Multi-Document AI Assistant

A powerful AI assistant capable of answering questions from multiple PDF manuals with source citations using LangChain, FAISS, and Google Gemini 2.0 Flash Lite. Give it a try here! (Might take a second to load.)

Note: Still a WIP, the retrieval component could frankly be a lot better. Also bear in mind a lower end LLM model is used due to cost constraints so please verify any answer you get from this.

Features

Multi-PDF Processing: Upload and process multiple PDF documents
Intelligent Text Chunking: Smart text segmentation for optimal context retrieval
FAISS Vector Store: Fast and efficient similarity search
Source Citations: Every answer includes references to source documents
Streamlit Web Interface: User-friendly web application
Google Gemini 2.0 Flash Lite Integration: Advanced language model for accurate responses

Project Structure

app.py: Main Streamlit application
pdf_processor.py: PDF processing and text extraction
vector_store.py: FAISS vector store management
qa_system.py: Question answering system with LangChain
utils.py: Utility functions
requirements.txt: Python dependencies

How It Works

Document Processing: PDFs are parsed and text is extracted
Text Chunking: Text is split into manageable chunks with overlap
Embedding Generation: Chunks are converted to vector embeddings
Vector Storage: Embeddings are stored in FAISS for fast retrieval
Question Answering: User questions are processed through the retrieval-augmented generation pipeline
Source Citation: Relevant source documents are cited in responses

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.devcontainer		.devcontainer
.gitignore		.gitignore
README.md		README.md
app.py		app.py
config.py		config.py
pdf_processor.py		pdf_processor.py
qa_system.py		qa_system.py
requirements.txt		requirements.txt
utils.py		utils.py
vector_store.py		vector_store.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multi-Document AI Assistant

Features

Project Structure

How It Works

About

Uh oh!

Releases

Packages

Languages

dkleitsas/RAG_PDF_Assistant

Folders and files

Latest commit

History

Repository files navigation

Multi-Document AI Assistant

Features

Project Structure

How It Works

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages