A Streamlit web app that enables users to upload a PDF and interactively query its contents using language models and embeddings.
pdf-chat-assistant is a tool for generating executive summaries and extracting information from PDF documents through natural language questions. Users can upload a PDF and ask questions; the app will provide AI-generated answers based strictly on the document's context.
- Upload and process PDF documents
- Extract text, split into chunks, and embed them for efficient search
- Query the document using natural language and get accurate, context-based answers
- Built with Streamlit for a simple, responsive web UI
- Uses HuggingFace embeddings, FAISS vector store, and LangChain for advanced retrieval
# Clone the repository
https://github.com/lavya30/pdf-chat-assistant.git
cd pdf-chat-assistant
# (Optional) Create a new virtual environment
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
# Install dependencies
pip install -r requirements.txtstreamlit run streamlit_app.py- Open your browser to the local Streamlit address provided in the terminal.
- Upload a PDF in the app. Enter your queries in the textbox.
See requirements.txt for all packages, including:
- streamlit
- langchain
- langchain-community
- langchain-groq
- faiss-cpu
- sentence-transformers
- python-dotenv
- pypdf
Contributions are welcome! Feel free to fork this repo, open issues, or submit pull requests.
MIT License
Maintainer: lavya30
GitHub: https://github.com/lavya30