preprintarXiv (Cornell University)Jan 28, 2022GREEN OA

DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR

Indexed inarxivdatacite

Abstract

We present in this paper a novel query formulation using dynamic anchor boxes for DETR (DEtection TRansformer) and offer a deeper understanding of the role of queries in DETR. This new formulation directly uses box coordinates as queries in Transformer decoders and dynamically updates them layer-by-layer. Using box coordinates not only helps using explicit positional priors to improve the query-to-feature similarity and eliminate the slow training convergence issue in DETR, but also allows us to modulate the positional attention map using the box width and height information. Such a design makes it clear that queries in DETR can be implemented as performing soft ROI pooling layer-by-layer in a cascade manner.…

Citation impact

396
total citations
FWCI
Percentile
References
0
Citations per year

Authors

8

Topics & keywords

Keywords
  • Computer science
  • Pooling
  • Benchmark (surveying)
  • Data mining
  • Geocoding
  • Information retrieval
  • Artificial intelligence
  • Cartography
No related works found for this paper.