-
Hi, I was comparing the performance of pyserini to the Indri toolkit and I got significant differences in performance. I compared NDCG@10, NDCG@5, MAP@50 and P@5 for a sample of 30 queries from the 2019 TREC retrieval challenge. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 21 replies
-
Yes, it is not a surprise that different retrieval engines produce slightly different results. How big are your differences in the various metrics? |
Beta Was this translation helpful? Give feedback.
I would try to reproduce our scores exactly with default parameters (default index stemmer, bm25), then try to change one thing at a time... bm25 to QL, the stemmer, etc. and you'll find out where you went wrong.