DevRev Search Challenge Submission by shrey2003 · Pull Request #4 · devrev/devrev-search-bench

shrey2003 · 2026-03-11T14:29:36Z

I built a custom hybrid retrieval pipeline using SOTA models rather than the baseline FAISS approach.

System Details:
System Description: Hybrid Search pipeline combining dense embeddings (VoyageAI voyage-4-large 2048-dim) via Alibaba's Zvec database, with sparse lexical retrieval (BM25), fused together and passed through a cross-encoder reranker (VoyageAI Rerank-2.5).
System Type: Hybrid / RAG Retriever
Open Source: Not open source (rerankers and the embedding models are closed source, vector db and sparse retrieval is open source)

Looking forward to seeing the results on the leaderboard!

https://app.devrev.ai/devrev/works/ISS-269621/
ISS-269621

These are the details for my current pipeline System Description: Hybrid Search pipeline combining dense embeddings (VoyageAI voyage-4-large 2048-dim) via Alibaba's Zvec database, with sparse lexical retrieval (BM25), fused together and passed through a cross-encoder reranker (VoyageAI Rerank-2.5). System Type: Hybrid / RAG Retriever Open Source: Not open source (rerankers and the embedding models are closed source, vector db and sparse retrieval is open source)

shrey2003 · 2026-03-11T14:38:38Z

@nimit2801 @prakhar7651 The description validation workflow is failing I am unable to find any suitable template in the readme. Do i have to follow any official template for the submission?

nimit2801 · 2026-03-12T07:20:00Z

hey @shrey2003

Kindly add this in your PR description: https://app.devrev.ai/devrev/works/ISS-269621

prakhar7651 · 2026-03-25T11:38:55Z

Hey Shreya!
Thanks for the submission, you results look promising on our benchmarks!
Can you confirm your topK is set to something greater than 50? We want to evaluate recall@k for various k values.

shrey2003 · 2026-03-25T13:43:08Z

Hey @prakhar7651 thanks for your evaluation!
The top k was set to 10 currently for generating the test queries results. Let me know if you have any further questions

shrey2003 · 2026-03-26T11:39:58Z

As per the instructions this json contains my results: test_queries_results.json @prakhar7651 @nimit2801

prakhar7651 · 2026-03-27T07:18:52Z

Hey!
These are your scores.
Recall@10: 0.2771
Precision@10: 0.3076

shrey2003 · 2026-03-28T09:55:46Z

@prakhar7651 Thanks for the evaluation. I have tried improving my script and rerun the results there were few bugs affecting he results which i found out. Can I resubmit after improving?

prakhar7651 · 2026-03-30T05:42:56Z

Yes, you can. Let me know when you're done and tell me which file to evaluate.

shrey2003 · 2026-03-30T08:01:01Z

test_queries_results_new.json @prakhar7651 this is the corrected file. Please evaluate it. Thanks!

prakhar7651 · 2026-03-31T06:22:20Z

In this commit - 41725c563d3f5690747ed7d57d88a579405efd57, you didn't make any changes in the notebook and submitted a new file. Is this expected?

shrey2003 · 2026-03-31T06:45:34Z

@prakhar7651 Yes I have run the results with a corrected python script I will update the notebook if the results are better that's why I didn't upload it can you check how is the new result performing?

shrey2003 · 2026-03-31T15:06:18Z

@prakhar7651 can you evaluate this now as today I think is the last day for submission?

shrey2003 · 2026-03-31T15:46:38Z

@nimit2801

prateekjain2606 · 2026-04-01T16:39:06Z

@shrey2003 can you please update your code before we release the results for latest submission so we can verify the code and ensure the results are reproducible

shrey2003 · 2026-04-01T16:45:58Z

@prateekjain2606 I have already pushed run_submission.py file please check

shrey2003 · 2026-04-01T18:15:07Z

@prakhar7651 Can you evaluate this too I have already added my submission file and code here
and if you can please tell me the scores of both of these files now I just wanted to check if their is any improvement
test_queries_results_new.json (This is with my updated method) and test_queries_results.json

prakhar7651 · 2026-04-01T18:39:40Z

Here are your updated scores,

Recall: 0.4497
Precision: 0.2935

The previous scores which we posted for your old submission, those were incorrect (some error on our side), these are the true values for your old submission

Recall: 0.1822
Precision: 0.1315

prakhar7651 · 2026-04-01T18:39:50Z

Looking at the quality of submissions and eagerness for folks to contribute, we're extending the deadline to April 7th. Evaluations would be still going on. Please keep contributing.

shrey2003 · 2026-04-01T19:06:53Z

@prakhar7651 Now this seems good! I was pretty much amazed earlier to see it perform so badly as I have almost compared from every benchmark and my tests that voyage has the best reranker and embedding models out their, no one can compete with them there was some issue in my earlier code Will try more combinations and check out if I can improve the scores further

shrey2003 added 4 commits March 11, 2026 07:22

Deleted devrev_search.ipynb

f0a574b

renamed original file to avoid confusion

431eb8f

Added test queries result for the current hybrid pipeline

beb859d

shrey2003 changed the title ~~DevRev Search Challenge Submission~~ DevRev Search Challenge Submission Mar 12, 2026

shrey2003 added 3 commits March 12, 2026 17:25

Added back the original notebook

d2f7cc1

Renamed custom pipeline notebook

0fdce26

Rename devrev_search_original.ipynb to devrev_search.ipynb

9b67260

Added corrected test results after bug fixes

41725c5

Added run file

5a75811

Conversation

shrey2003 commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shrey2003 commented Mar 11, 2026

Uh oh!

nimit2801 commented Mar 12, 2026

Uh oh!

prakhar7651 commented Mar 25, 2026

Uh oh!

shrey2003 commented Mar 25, 2026

Uh oh!

shrey2003 commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

prakhar7651 commented Mar 27, 2026

Uh oh!

shrey2003 commented Mar 28, 2026

Uh oh!

prakhar7651 commented Mar 30, 2026

Uh oh!

shrey2003 commented Mar 30, 2026

Uh oh!

prakhar7651 commented Mar 31, 2026

Uh oh!

shrey2003 commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shrey2003 commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shrey2003 commented Mar 31, 2026

Uh oh!

prateekjain2606 commented Apr 1, 2026

Uh oh!

shrey2003 commented Apr 1, 2026

Uh oh!

shrey2003 commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

prakhar7651 commented Apr 1, 2026

Uh oh!

prakhar7651 commented Apr 1, 2026

Uh oh!

shrey2003 commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

shrey2003 commented Mar 11, 2026 •

edited

Loading

shrey2003 commented Mar 26, 2026 •

edited

Loading

shrey2003 commented Mar 31, 2026 •

edited

Loading

shrey2003 commented Mar 31, 2026 •

edited

Loading

shrey2003 commented Apr 1, 2026 •

edited

Loading

shrey2003 commented Apr 1, 2026 •

edited

Loading