articleFeb 26, 2025Closed access

RT-DETRv3: Real-Time End-to-End Object Detection with Hierarchical Dense Positive Supervision

Baidu (China)

Indexed incrossref

Abstract

RT-DETR is the first real-time end-to-end transformer-based object detector. Its efficiency comes from the frame-work design and the Hungarian matching. However, compared to dense supervision detectors like the YOLO se-ries, the Hungarian matching provides much sparser su-pervision, leading to insufficient model training and diffi-cult to achieve optimal results. To address these issues, we proposed a hierarchical dense positive supervision method based on RT-DETR, named RT-DETRv3. Firstly, we in-troduce a CNN-based auxiliary branch that provides dense supervision that collaborates with the original decoder to enhance the encoder's feature representation. Secondly, to address insufficient decoder training, we…

Citation impact

75
total citations
FWCI
75.69
Percentile
100%
References
28
Citations per year

Authors

4

Topics & keywords

Keywords
  • Computer science
  • End-to-end principle
  • Object (grammar)
  • Artificial intelligence
No related works found for this paper.