preprintarXiv (Cornell University)Mar 7, 2022GREEN OA

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

Indexed inarxivdatacite

Abstract

We present DINO (\textbf{D}ETR with \textbf{I}mproved de\textbf{N}oising anch\textbf{O}r boxes), a state-of-the-art end-to-end object detector. % in this paper. DINO improves over previous DETR-like models in performance and efficiency by using a contrastive way for denoising training, a mixed query selection method for anchor initialization, and a look forward twice scheme for box prediction. DINO achieves $49.4$AP in $12$ epochs and $51.3$AP in $24$ epochs on COCO with a ResNet-50 backbone and multi-scale features, yielding a significant improvement of $\textbf{+6.0}$\textbf{AP} and $\textbf{+2.7}$\textbf{AP}, respectively, compared to DN-DETR, the previous best DETR-like model. DINO scales well in both…

Citation impact

756
total citations
FWCI
Percentile
References
0
Citations per year

Authors

8

Topics & keywords

Keywords
  • Initialization
  • End-to-end principle
  • Object (grammar)
  • Code (set theory)
  • Algorithm
  • Computer science
  • Reduction (mathematics)
  • Discrete mathematics
No related works found for this paper.