GitHub - harshit110927/RAGrealTime

 🧠 Real-Time Slack-integrated Retrieval-Augmented Generation (RAG) System

This project is a real-time, production-level RAG system designed to **learn from live Slack chats**, store them in a **local SQLite database**, and **respond instantly** to user queries within Slack channels.
It uses **semantic search** with **FAISS** and **SentenceTransformers**, a **CrossEncoder reranker**, and **TinyLLaMA** for final response generation — all deployed locally in a resource-efficient Docker container.

---

 🚀 Features

- ✅ Real-time chat ingestion from specified Slack channels
- 🔍 Semantic and fuzzy search with CrossEncoder reranking
- 🧠 Local knowledge base auto-updated via SQLite
- 🧾 Responses generated using TinyLLaMA 1.1B (Flan-T5 fallback supported)
- 🐳 Dockerized for easy deployment
- 📥 Supports live updates with no downtime
- ⚡ Slack bot integration using `slack_bolt`

---

 🧩 Project Structure & Modes

This project is **modular** and can operate in two modes:

| Component           | Description                                                                 |
|---------------------|-----------------------------------------------------------------------------|
| `app.py`            | **Standalone CLI RAG engine**. Can be used without Slack and integrated with|
|                     |any platform like MS Teams, internal chat tools, or REST APIs. Great for     |
|                     |testing, batch indexing, or non-Slack environments.                          |
| `slack_listener.py` | **Slack-integrated version**. Listens to Slack messages in real time and    |
|                     | uses `app.py` as the backend to serve answers directly within Slack threads.|
|                     | Acts as a plug-and-play example of real-world usage.                        |

 ✅ You can integrate `app.py` into any custom frontend, chatbot, or third-party system.

---

 🛠️ Tech Stack

| Layer         | Tools/Libs Used                                 |
|--------------|--------------------------------------------------|
| Embedding     | `sentence-transformers/all-MiniLM-L6-v2`        |
| Vector Search | `FAISS`                                         |
| Reranking     | `cross-encoder/ms-marco-MiniLM-L-6-v2`          |
| LLM           | `TinyLLaMA-1.1B-Chat-v1.0` via `transformers`   |
| Bot Layer     | `Slack Bolt SDK`                                |
| Storage       | `SQLite`                                        |
| Deployment    | `Docker`, `Python 3.10+`                        |

---

 📦 Installation & Local Setup

### 1. Clone the repo
```bash
git clone https://github.com/yourusername/realtime-rag.git
cd realtime-rag

2. Create `.env` file

cp .env.example .env

Edit .env and set the following:

SLACK_BOT_TOKEN=your-bot-token
SLACK_APP_TOKEN=your-app-level-token
MONITORED_CHANNELS=channel1,channel2

3. Build Docker image

docker build -t realtime-rag .

▶️ Running the Slack Bot

docker run --env-file .env --memory=6g --memory-swap=7g -it realtime-rag

Once you see:

✅ Slack listener started...
⚡️ Bolt app is running!

You're live!

⚙️ Running the Local RAG Engine (No Slack)

To test the CLI-based version or integrate with other platforms like Microsoft Teams:

python app.py

This will:

Watch live_input.txt for new input (via Watchdog)
Store data in SQLite
Serve answers via CLI prompts

🗃️ File Structure

.
├── app.py                  # Local/CLI mode RAG engine
├── slack_listener.py       # Slack listener + integration
├── rag_engine.py           # Encapsulated RAG logic (embed, search, rerank)
├── db_handler.py           # SQLite DB handler
├── Dockerfile              # Docker build file
├── requirements.txt
├── .env                    # Your environment variables
└── README.md

📈 Future Improvements

Add streaming generation
Deploy via LangServe or Kubernetes
Web dashboard for analytics
External plugin support for Teams, Discord, etc.

📄 License

This project is licensed under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

2. Create `.env` file

3. Build Docker image

▶️ Running the Slack Bot

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
__pycache__		__pycache__
data		data
.gitattributes		.gitattributes
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
db_handler.py		db_handler.py
rag_engine.py		rag_engine.py
requirements.txt		requirements.txt
slack_listener.py		slack_listener.py

Folders and files

Latest commit

History

Repository files navigation

2. Create .env file

3. Build Docker image

▶️ Running the Slack Bot

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

2. Create `.env` file

Packages