RAG-based Restaurant Chatbot

A Retrieval Augmented Generation (RAG) based chatbot that answers user questions about restaurants using data scraped from restaurant websites.

Project Overview

This project consists of the following components:

Web Scraper: Collects restaurant data including menus, locations, operating hours, etc.
Knowledge Base: Processes and stores scraped data for efficient retrieval
RAG Chatbot: Uses Hugging Face models to retrieve and generate responses
User Interface: Streamlit app for interacting with the chatbot

Setup Instructions

Prerequisites

Python 3.8+
Git

Installation

Clone the repository:

git clone https://github.com/keyaaness/zomato-rag-chatbot.git
cd zomato-rag-chatbot

Create a virtual environment (optional but recommended):

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install the required packages:
```
pip install -r requirements.txt
```

Usage

Running the Web Scraper

To collect restaurant data:

python src/scraper/main.py

This will scrape data from the configured restaurant websites and save it to data/raw/.

Building the Knowledge Base

To process the scraped data and build the knowledge base:

python src/knowledge_base/build_kb.py

This will process the raw data and create a structured knowledge base in data/processed/.

Running the Chatbot Interface

To launch the Streamlit interface:

streamlit run src/app.py

Then open your browser and navigate to http://localhost:8501 to interact with the chatbot.

Project Structure

zomato-rag-chatbot/
├── data/                      # Data directory
│   ├── raw/                   # Raw scraped data
│   └── processed/             # Processed data for knowledge base
├── src/                       # Source code
│   ├── scraper/               # Web scraping module
│   ├── knowledge_base/        # Knowledge base processing
│   ├── rag/                   # RAG implementation
│   ├── utils/                 # Utility functions
│   └── app.py                 # Streamlit application
├── notebooks/                 # Jupyter notebooks for exploration
├── tests/                     # Test files
├── README.md                  # Project documentation
└── requirements.txt           # Package dependencies

Features

Scrapes data from multiple restaurant websites
Structured knowledge base for efficient retrieval
Natural language query processing
Conversational interface via Streamlit
Handles various types of restaurant-related queries
Remembers conversation context

Limitations

Only works with data from scraped restaurants
May not handle highly complex or ambiguous queries
Limited to text-based information (no image processing)

Future Improvements

Expand the number of restaurants in the database
Implement more advanced retrieval mechanisms
Add sentiment analysis for restaurant reviews
Improve handling of ambiguous queries
Create a mobile application interface

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
.gitignore		.gitignore
DOCUMENTATION.md		DOCUMENTATION.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG-based Restaurant Chatbot

Project Overview

Setup Instructions

Prerequisites

Installation

Usage

Running the Web Scraper

Building the Knowledge Base

Running the Chatbot Interface

Project Structure

Features

Limitations

Future Improvements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

keyaaness/cuisine-query

Folders and files

Latest commit

History

Repository files navigation

RAG-based Restaurant Chatbot

Project Overview

Setup Instructions

Prerequisites

Installation

Usage

Running the Web Scraper

Building the Knowledge Base

Running the Chatbot Interface

Project Structure

Features

Limitations

Future Improvements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages