DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
Indexed inarxivdatacite
Abstract
We present in this paper a novel query formulation using dynamic anchor boxes for DETR (DEtection TRansformer) and offer a deeper understanding of the role of queries in DETR. This new formulation directly uses box coordinates as queries in Transformer decoders and dynamically updates them layer-by-layer. Using box coordinates not only helps using explicit positional priors to improve the query-to-feature similarity and eliminate the slow training convergence issue in DETR, but also allows us to modulate the positional attention map using the box width and height information. Such a design makes it clear that queries in DETR can be implemented as performing soft ROI pooling layer-by-layer in a cascade manner.…
Citation impact
396
total citations
- FWCI
- —
- Percentile
- —
- References
- 0
Citations per year
Authors
8Topics & keywords
Topics
Keywords
- Computer science
- Pooling
- Benchmark (surveying)
- Data mining
- Geocoding
- Information retrieval
- Artificial intelligence
- Cartography
No related works found for this paper.