March 7, 2022Open Access

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

Key Points

Key points are not available for this paper at this time.

Abstract

We present DINO (DETR with Improved deNoising anchOr boxes), a state-of-the-art end-to-end object detector. % in this paper. DINO improves over previous DETR-like models in performance and efficiency by using a contrastive way for denoising training, a mixed query selection method for anchor initialization, and a look forward twice scheme for box prediction. DINO achieves 49. 4AP in 12 epochs and 51. 3AP in 24 epochs on COCO with a ResNet-50 backbone and multi-scale features, yielding a significant improvement of +6. 0AP and +2. 7AP, respectively, compared to DN-DETR, the previous best DETR-like model. DINO scales well in both model size and data size. Without bells and whistles, after pre-training on the Objects365 dataset with a SwinL backbone, DINO obtains the best results on both COCO val2017 (63. 2AP) and test-dev (63. 3AP). Compared to other models on the leaderboard, DINO significantly reduces its model size and pre-training data size while achieving better results. Our code will be available at https: //github. com/IDEACVR/DINO.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Hao Zhang

Feng Li

Shilong Liu

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study