Add hybrid triple-path retrieval pipeline with cross-encoder reranking by i-anishR-droid · Pull Request #6 · devrev/devrev-search-bench

i-anishR-droid · 2026-03-12T06:44:27Z

Built a custom hybrid retrieval pipeline for the DevRev Search, replacing the baseline FAISS-only approach with a multi-signal retrieval system.

Dense Retrieval: Snowflake/snowflake-arctic-embed-l-v2.0 (1024-dim) embeddings indexed with FAISS IndexFlatIP (cosine similarity on normalized vectors) over ~65K knowledge base documents
Sparse Retrieval (dual): BM25Okapi on full-text (title + cleaned body) and a separate BM25Okapi on titles only (2x boosted RRF weight) for high-precision title matching
Fusion: Reciprocal Rank Fusion (RRF, k=60) across all three retrieval paths, run over up to 3 rule-based query expansions per input query
Reranking: cross-encoder/ms-marco-MiniLM-L-6-v2 cross-encoder on the top-60 fused candidates, returning the top-10
Text Cleaning: Strips b'...' byte-string artifacts, normalizes escaped unicode, collapses whitespace

System Details:

System Description: Hybrid search pipeline combining dense semantic embeddings (Snowflake/snowflake-arctic-embed-l-v2.0, 1024-dim) via FAISS IndexFlatIP, with dual sparse lexical retrieval (full-text BM25 + title-only BM25 with 2x boosted weight), fused using Reciprocal Rank Fusion across rule-based query expansions (up to 3 variants), then reranked with a cross-encoder (cross-encoder/ms-marco-MiniLM-L-6-v2).
System Type: Hybrid / RAG Retriever

Open Source: Fully open source - all models, retrieval infrastructure, and pipeline code are open source.

ISS-1

- Add run_pipeline.py: standalone pipeline with Snowflake arctic-embed-l-v2.0 dense embeddings, dual BM25 (full-text + title-only), RRF fusion, and cross-encoder reranking (ms-marco-MiniLM-L-6-v2); runs 92 test queries in ~2 min - Update devrev_search.ipynb: refactored Section 9 as infrastructure setup, added Section 10 Multi-Query Triple-Path Retrieval strategy - Add submission outputs: test_queries_results.json/.parquet (latest run), enhanced and old variants for comparison - Ignore large embeddings_*.npy files via .gitignore Made-with: Cursor

prakhar7651 · 2026-03-27T07:23:02Z

Hey!
These are your scores.
Recall@10: 0.3549
Precision@10: 0.3434

prakhar7651 · 2026-03-27T12:01:21Z

Can you also try without cross encoder? And try with various other boost configs? Also did you experiment with RRF_K values?

i-anishR-droid · 2026-03-28T08:10:06Z

Can you also try without cross encoder? And try with various other boost configs? Also did you experiment with RRF_K values?

I am trying Without cross-encoder, testing around 14 different weight combinations for dense, BM25, and title paths (e.g., dense-only, bm25-only, title-only, equal weights, title-boost-3x/4x, dense-boost-2x, etc.), both with and without the reranker, experimenting with RRF_K values of 10, 20, 30, 40, 60, 80, and 100 to find the sweet spot for rank fusion, again tested both with and without reranking.

for this submission I only used rrf_k=60 — no experimentation was done.

prakhar7651 · 2026-03-30T05:44:00Z

Did you use any framework for benchmarking all these different configs?

i-anishR-droid · 2026-03-30T05:58:49Z

Did you use any framework for benchmarking all these different configs?

No, I haven't used any benchmarking framework. Everything : config management, metric computation, result tracking is inline Python.

i-anishR-droid added 3 commits March 12, 2026 11:55

Delete test_queries_results_enhanced.json

67dfdc4

Delete test_queries_results_enhanced.parquet

01b3f3b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add hybrid triple-path retrieval pipeline with cross-encoder reranking#6

Add hybrid triple-path retrieval pipeline with cross-encoder reranking#6
i-anishR-droid wants to merge 3 commits intomainfrom
enhanced-pipeline

i-anishR-droid commented Mar 12, 2026 •

edited

Loading

Uh oh!

prakhar7651 commented Mar 27, 2026

Uh oh!

prakhar7651 commented Mar 27, 2026

Uh oh!

i-anishR-droid commented Mar 28, 2026

Uh oh!

prakhar7651 commented Mar 30, 2026

Uh oh!

i-anishR-droid commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

i-anishR-droid commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

prakhar7651 commented Mar 27, 2026

Uh oh!

prakhar7651 commented Mar 27, 2026

Uh oh!

i-anishR-droid commented Mar 28, 2026

Uh oh!

prakhar7651 commented Mar 30, 2026

Uh oh!

i-anishR-droid commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

i-anishR-droid commented Mar 12, 2026 •

edited

Loading