Hi, thanks for the great work on this project!
I'm interested in running evaluation (retrieval+reranking) on a custom set of samples and would like to process them in the same way as the existing datasets. Could you point me to the code or script that was used to generate the processed datasets, such as swe-bench-lite and locbench?
Thanks in advance!
Hi, thanks for the great work on this project!
I'm interested in running evaluation (retrieval+reranking) on a custom set of samples and would like to process them in the same way as the existing datasets. Could you point me to the code or script that was used to generate the processed datasets, such as swe-bench-lite and locbench?
Thanks in advance!