LLM-Powered Document QA System (RAG)

This project is a lightweight Retrieval-Augmented Generation (RAG) system that enables document-based question answering using a local Large Language Model (LLM). It combines LangChain, Chroma vector database, and Hugging Face models to process PDF documents, retrieve relevant chunks, and generate answers.

Features

Offline-capable: Runs fully locally with Hugging Face models (e.g., LLaMA 3, Falcon).
RAG Architecture: Answers are grounded in your uploaded documents, not hallucinated.
PDF Upload Support: Automatically splits and embeds text for semantic search.
Frontend + API: Streamlit UI + FastAPI backend for a complete user interface.

Project Structure

doc_qa_project/

├── main.py # FastAPI backend (upload & QA API)

├── frontend/app.py # Streamlit frontend UI

├── model_loader.py # Load Hugging Face model locally

├── rag_chain.py # RAG logic: split, embed, retrieve, generate

├── requirements.txt # Python dependencies

├── .gitignore

└── README.md

Installation

pip install -r requirements.txt

Recommended: Use Python 3.9+ with a virtual environment.

Quickstart

Set API key (if using OpenAI embeddings)

export OPENAI_API_KEY=sk-xxxxxxxxxxxxxxxx

Start FastAPI backend

uvicorn main:app --host 0.0.0.0 --port 8000

Start Streamlit frontend

streamlit run frontend/app.py

Authentication

To use gated Hugging Face Models (e.g. LlaMA 3), create a '.env' file in the root directory:

HUGGINGFACE_TOKEN=your_token_here

Then run:

export HUGGINGFACE_TOKEN=your_token_here

Supported Models

meta-llama/Meta-Llama-3-8B-Instruct
tiiuae/falcon-7b-instruct
Any AutoModelForCausalLM compatible Hugging Face model

Example Queries

“What are the key takeaways from this document?”
“What conclusions does the author draw in Section 3?”
“Does this paper mention recent industry trends?”

TODO

Support multiple documents
Add support for Markdown and Word files
Integrate response time logging
Add Dockerfile for containerized deployment

License

MIT License

Author

Ruoyi Zhu

Feel free to star ⭐ the repo, open issues, or contribute!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM-Powered Document QA System (RAG)

Features

Project Structure

Installation

Quickstart

Authentication

Supported Models

Example Queries

TODO

License

Author

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
frontend		frontend
.DS_Store		.DS_Store
.gitattributes		.gitattributes
README.md		README.md
main.py		main.py
model_loader.py		model_loader.py
rag_chain.py		rag_chain.py
requirements.txt		requirements.txt

zhux0445/doc_qa_project

Folders and files

Latest commit

History

Repository files navigation

LLM-Powered Document QA System (RAG)

Features

Project Structure

Installation

Quickstart

Authentication

Supported Models

Example Queries

TODO

License

Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages