This project is a conversational AI application that integrates image recognition capabilities. It uses a pre-trained model from the transformers
library to generate responses based on user queries and images.
- Load and display images
- Send text queries along with images
- Receive AI-generated responses based on the input image and query
- GUI built with Tkinter
-
Run the GUI application:
python gui.py
-
Use the GUI to load an image, enter a query, and get a response from the AI.
app.py
: Contains the core logic for loading the model and processing images and queries.gui.py
: Implements the graphical user interface using Tkinter..gitignore
: Specifies files and directories to be ignored by Git.
transformers
torch
Pillow
tkinter
This project is licensed under the MIT License. See the LICENSE
file for more details.