RAG Assistant

A Retrieval-Augmented Generation (RAG) based assistant that answers queries from an Operating Systems lab manual using local LLM inference with Ollama (Phi-3) and ChromaDB for semantic retrieval.

Overview

This project implements a context-aware AI assistant that retrieves relevant sections from an OS lab manual and generates precise, grounded responses.

Unlike naive LLM chat systems, this assistant:

Retrieves relevant context chunks from a vector database
Feeds them into a local LLM (Phi-3 via Ollama)
Produces factually grounded answers

Components

astAPI Backend

Handles user queries via REST API
Manages the RAG pipeline
Integrates retrieval + generation

ChromaDB

Stores embeddings of OS lab manual
Performs semantic similarity search
Returns top-k relevant chunks

Ollama (Phi-3)

Local LLM for inference
Generates answers using retrieved context
Ensures privacy (no external API calls)

Data Source

Operating Systems Lab Manual
Preprocessed into text chunks
Embedded and stored in ChromaDB

Workflow

User sends query to FastAPI
Query is embedded and matched in ChromaDB
Top-k relevant chunks are retrieved
Context is passed to Phi-3 via Ollama
LLM generates a grounded response

Features

Retrieval-Augmented Generation (RAG) pipeline
Semantic search using vector embeddings
Fully local LLM (Ollama Phi-3)
Fast API-based interaction
Domain-specific QA (OS lab manual)

Tech Stack

Backend: FastAPI
LLM: Phi-3 (via Ollama)
Vector DB: ChromaDB
Embeddings: Sentence Transformers / Ollama embeddings
Language: Python

Getting Started

1. Clone the Repository

bash id="z9f3r1" git clone https://github.com/yourusername/rag-assistant.git cd rag-assistant

2. Install Dependencies

bash id="d2l8xp" pip install -r requirements.txt

3. Start Ollama (Phi-3)

bash id="6c2mzk" ollama run phi3

4. Run FastAPI Server

bash id="d91kqf" uvicorn main:app --reload

5. Access API Docs

id="x7v9al" http://localhost:8000/docs

Example Query

id="n3k2pq" "What is deadlock and how can it be prevented?"

The system retrieves relevant sections and generates a context-aware answer.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.idea		.idea
Backend		Backend
db/chroma_db		db/chroma_db
docs		docs
.DS_Store		.DS_Store
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Assistant

Overview

Components

astAPI Backend

ChromaDB

Ollama (Phi-3)

Data Source

Workflow

Features

Tech Stack

Getting Started

1. Clone the Repository

2. Install Dependencies

3. Start Ollama (Phi-3)

4. Run FastAPI Server

5. Access API Docs

Example Query

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAG Assistant

Overview

Components

astAPI Backend

ChromaDB

Ollama (Phi-3)

Data Source

Workflow

Features

Tech Stack

Getting Started

1. Clone the Repository

2. Install Dependencies

3. Start Ollama (Phi-3)

4. Run FastAPI Server

5. Access API Docs

Example Query

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages