Simple Gradio application integrated with Hugging Face Multimodals to support visual question answering chatbot and more features
-
Updated
Aug 16, 2024 - Python
Simple Gradio application integrated with Hugging Face Multimodals to support visual question answering chatbot and more features
Conversational Image Recognition Chatbot
Extract structured menu information from images into JSON by E2E Vision-Language model fine-tuning pipeline or LLM.
Add a description, image, and links to the image-text-to-text topic page so that developers can more easily learn about it.
To associate your repository with the image-text-to-text topic, visit your repo's landing page and select "manage topics."