Skip to content

Omc12/RAG-stock-news

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Hybrid Stock News RAG (From Scratch)

This project implements a Retrieval Augmented Generation (RAG) system for stock news analysis using a hybrid retrieval strategy.

Features

  • Google News RSS ingestion
  • Semantic retrieval using embeddings
  • Keyword retrieval using BM25
  • Hybrid scoring for better Recall@K
  • Grounded LLM responses
  • Retrieval-first evaluation mindset

Tech Stack

  • Python
  • sentence-transformers
  • BM25 (rank-bm25)
  • OpenAI / Gemini
  • No RAG frameworks used

Why Hybrid RAG?

Embedding search captures meaning. BM25 captures exact terms (tickers, numbers). Hybrid improves recall and coverage.

How to Run

pip install -r requirements.txt
python src/main.py

About

This project implements a Retrieval Augmented Generation (RAG) system for stock news analysis using a hybrid retrieval strategy.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages