Building AI course project
DocChatAI is a Retrieval-Augmented Generation (RAG) chatbot that allows users to upload PDFs, ask questions, and generate structured reports. It uses semantic search and LLMs to provide accurate, context-aware responses and summaries from documents.
- Upload PDFs and ask questions
- Context-aware answers using RAG
- Generate structured summary reports
- Download reports as text files
- Python 3.10
- Streamlit
- LangChain
- FAISS
- HuggingFace Embeddings
- Gemini API (Gemini-2.5-Flash)
Retrieval performance was evaluated using a curated set of question–document pairs on user-uploaded PDFs.
- Top-3 Retrieval Accuracy: 100%
- Embedding Model: sentence-transformers/all-MiniLM-L6-v2
- Vector Store: FAISS
- A query is considered correct if the expected document appears among the top-3 retrieved chunks.
pip install -r requirements.txt
streamlit run app.py