preprintArXiv.orgJul 10, 2025GREEN OA

Overview of the TREC 2021 deep learning track

Microsoft (United States) · University College London · +1 more institution

Indexed inarxivdatacite

Abstract

This is the fifth year of the TREC Deep Learning track. As in previous years, we leverage the MS MARCO datasets that made hundreds of thousands of human-annotated training labels available for both passage and document ranking tasks. We mostly repeated last year's design, to get another matching test set, based on the larger, cleaner, less-biased v2 passage and document set, with passage ranking as primary and document ranking as a secondary task (using labels inferred from passage). As we did last year, we sample from MS MARCO queries that were completely held out, unused in corpus construction, unlike the test queries in the first three years. This approach yields a more difficult test with more headroom for…

Citation impact

57
total citations
FWCI
Percentile
References
0
Citations per year

Authors

8

Topics & keywords

Keywords
  • Computer science
  • Pooling
  • Ranking (information retrieval)
  • Deep learning
  • Test set
  • Artificial intelligence
  • Task (project management)
  • Training set
No related works found for this paper.