Improve retrieval quality with simple reranking
Currently we rely on vector similarity alone to pick top-k documents. In some cases this returns chunks that are close semantically but not the most useful for answering the query.
A simple improvement would be to introduce a reranking step after retrieval:
- expand the candidate pool slightly
- rerank using additional signals (e.g. keyword overlap, metadata)
- select top-k after reranking
This should improve answer quality without adding much complexity.
I can work on an initial version of this if this direction makes sense.
Improve retrieval quality with simple reranking
Currently we rely on vector similarity alone to pick top-k documents. In some cases this returns chunks that are close semantically but not the most useful for answering the query.
A simple improvement would be to introduce a reranking step after retrieval:
This should improve answer quality without adding much complexity.
I can work on an initial version of this if this direction makes sense.