Skip to content

AI agent which can read the doc and answer your quries

License

Notifications You must be signed in to change notification settings

shashankshet/AskTheDoc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

AskTheDoc

Read Complete Blog

AI agent which can read the doc and answer your quries

SAskTheDoc is an intelligent chatbot that can read and understand uploaded PDF documents, and then answer user questions based on the content. It leverages cutting-edge technologies such as Google Generative AI, LangChain, and FAISS for embedding and retrieval, all wrapped inside a sleek Streamlit UI.

πŸš€ Features

  • πŸ“š Upload one or more PDFs
  • πŸ” Extracts and preprocesses text from PDFs
  • ✨ Displays word count and most frequent terms
  • πŸ’¬ Asks questions about the documents
  • πŸ€– Uses Google Generative AI (Gemini) for intelligent answers
  • 🧠 Named Entity Recognition (NER) with SpaCy
  • πŸ“Š Displays response time for each answer
  • ☁️ Vector storage with FAISS
  • πŸ›  Built with Streamlit, LangChain, NLTK, SpaCy, and more

🧰 Tech Stack

  • Frontend/UI: Streamlit
  • LLM: Google Gemini (gemini-1.5-flash)
  • Embeddings: GoogleGenerativeAIEmbeddings (embedding-001)
  • Text Splitting & QA Chain: LangChain
  • Vector Store: FAISS
  • NER: SpaCy (en_core_web_sm)
  • PDF Reader: PyPDF2
  • Preprocessing: NLTK
  • Visualization: WordCloud, Pandas

πŸ“ Installation

git clone https://github.com/your-username/smart-guide-agent.git
cd smart-guide-agent

πŸ”§ Install Requirements

pip install -r requirements.txt

πŸ“¦ Required NLTK & SpaCy Models

python -m nltk.downloader punkt stopwords wordnet
python -m spacy download en_core_web_sm

πŸ” Setup Google API Key

Get your API key from Google AI Studio. Create a .env file in the root directory:

GOOGLE_API_KEY=your_google_api_key

▢️ Run the App

streamlit run app.py

About

AI agent which can read the doc and answer your quries

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages