articleFeb 26, 2025Closed access
RT-DETRv3: Real-Time End-to-End Object Detection with Hierarchical Dense Positive Supervision
Indexed incrossref
Abstract
RT-DETR is the first real-time end-to-end transformer-based object detector. Its efficiency comes from the frame-work design and the Hungarian matching. However, compared to dense supervision detectors like the YOLO se-ries, the Hungarian matching provides much sparser su-pervision, leading to insufficient model training and diffi-cult to achieve optimal results. To address these issues, we proposed a hierarchical dense positive supervision method based on RT-DETR, named RT-DETRv3. Firstly, we in-troduce a CNN-based auxiliary branch that provides dense supervision that collaborates with the original decoder to enhance the encoder's feature representation. Secondly, to address insufficient decoder training, we…
Citation impact
75
total citations
- FWCI
- 75.69
- Percentile
- 100%
- References
- 28
Citations per year
Authors
4Topics & keywords
Topics
Keywords
- Computer science
- End-to-end principle
- Object (grammar)
- Artificial intelligence
No related works found for this paper.