Conversation
|
@nimit2801 Please help with the PR description validation. |
|
Please use this json for results: |
|
hey @srini047 Kindly add this in your PR description: https://app.devrev.ai/devrev/works/ISS-269621 |
|
Hey! |
|
Hey! |
What are the older scores then? |
Previous calculations had some errors. We have added the correct scores. Please check. |
|
Looking at the quality of submissions and eagerness for folks to contribute, we're extending the deadline to April 7th. Evaluations would be still going on. Please keep contributing. |
Implementation:
BM42 Hybrid Retrieval with Haystack + Qdrant - Hybrid search system combining sparse (BM42) and dense (mxbai-embed-large-v1) embeddings for information retrieval on the DevRev knowledge base.
Indexing Pipeline:
DocumentCleaner → FastembedSparseDocumentEmbedder (BM42)
→ SentenceTransformersDocumentEmbedder (1024-dim)
→ DocumentWriter (Qdrant)
Retrieval Pipeline:
FastembedSparseTextEmbedder (BM42) + SentenceTransformersTextEmbedder
→ QdrantHybridRetriever (RRF fusion)
Work item: ISS-269621