Overview of the TREC 2021 deep learning track

Craswell, Nick; Mitra, Bhaskar; Yılmaz, Emine; Campos, Daniel; Daniel, Campos,; Jimmy, Lin,; M., Voorhees, Ellen; Ian, Soboroff,

doi:10.48550/arxiv.2507.08191

preprintArXiv.orgJul 10, 2025GREEN OA

Overview of the TREC 2021 deep learning track

NCNick Craswell BMBhaskar Mitra EYEmine Yılmaz DCDaniel CamposCDCampos, Daniel

Microsoft (United States) · University College London · +1 more institution

Indexed inarxivdatacite

Abstract

This is the fifth year of the TREC Deep Learning track. As in previous years, we leverage the MS MARCO datasets that made hundreds of thousands of human-annotated training labels available for both passage and document ranking tasks. We mostly repeated last year's design, to get another matching test set, based on the larger, cleaner, less-biased v2 passage and document set, with passage ranking as primary and document ranking as a secondary task (using labels inferred from passage). As we did last year, we sample from MS MARCO queries that were completely held out, unused in corpus construction, unlike the test queries in the first three years. This approach yields a more difficult test with more headroom for…

Citation impact

57

total citations

FWCI: —
Percentile: —
References: 0

Citations per year

Authors

8

NC
Nick CraswellCorresponding
Microsoft (United States)
BM
Bhaskar Mitra
Microsoft (United States)
EY
Emine Yılmaz
University College London
DC
Daniel Campos
Microsoft (United States)
CD
Campos, Daniel
National Institute of Standards and Technology

Topics & keywords

Topics

Keywords

Computer science
Pooling
Ranking (information retrieval)
Deep learning
Test set
Artificial intelligence
Task (project management)
Training set

No related works found for this paper.