Skip to content

feat: finalize hybrid search pipeline with query expansion and RRF#9

Open
sam131725 wants to merge 3 commits intodevrev:mainfrom
sam131725:main
Open

feat: finalize hybrid search pipeline with query expansion and RRF#9
sam131725 wants to merge 3 commits intodevrev:mainfrom
sam131725:main

Conversation

@sam131725
Copy link
Copy Markdown

@sam131725 sam131725 commented Mar 26, 2026

Overview

This PR contains the final search results for the DevRev Search Benchmark. The pipeline has been optimized for high Recall@10 and MRR using a multi-stage retrieval strategy.

Key Components

  • Query Expansion: Utilized llama3.2:3b to expand user queries for better semantic matching.
  • Hybrid Retrieval: Combined Keyword (BM25) and Semantic (qwen3-embedding) scores.
  • Fusion: Integrated results using Reciprocal Rank Fusion (RRF).
  • Re-ranking: Final Top-10 selection via ms-marco-MiniLM Cross-Encoder.

Files Included

  • test_queries_results.json: Official submission file.

work-item: ISS-1

@sam131725
Copy link
Copy Markdown
Author

re-run validation

@prakhar7651
Copy link
Copy Markdown
Contributor

Do you have a file to be evaluated?

@sam131725
Copy link
Copy Markdown
Author

Please evaluate test_queries_results.json

@prakhar7651
Copy link
Copy Markdown
Contributor

Hey!
These are your scores.
Recall@10: 0.3156
Precision@10: 0.2154

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants