[PDF] YOLOv10: Real-Time End-to-End Object Detection

Skip to search formSkip to main contentSkip to account menu

Semantic ScholarSemantic Scholar's Logo

Corpus ID: 269983404

@inproceedings{Wang2024YOLOv10RE, title={YOLOv10: Real-Time End-to-End Object Detection}, author={Ao Wang and Hui Chen and Lihao Liu and Kai Chen and Zijia Lin and Jungong Han and Guiguang Ding}, year={2024}, url={https://api.semanticscholar.org/CorpusID:269983404}}

Ao Wang, Hui Chen, Guiguang Ding
Published 23 May 2024
Computer Science, Engineering

A new generation of YOLO series for real-time end-to-end object detection, dubbed YOLOv10, is presented and the holistic efficiency-accuracy driven model design strategy for YOLOs is introduced, which greatly reduces the computational overhead and enhances the capability.

[PDF] Semantic Reader

1 Citation

Figures and Tables from this paper

table 1
figure 2
figure 3
table 4
figure 4
table 5
table 7
table 13
table 14
table 15

Ask This Paper
BETA
AI-Powered

Our system tries to constrain to information found in this paper. Results quality may vary. Learn more about how we generate these answers.

Feedback?

One Citation

SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving

Yiming CuiCheng HanDongfang Liu

Computer Science, Engineering

2024

This work introduces a stepwise spatial global-local aggregation network that combines the local information from the neighboring frames and global semantics from the current frame to eliminate the feature degradation in video object detection.

[PDF]

74 References

DETRs Beat YOLOs on Real-time Object Detection

Wenyu LvShangliang Xu Yi Liu

Computer Science

ArXiv

2023

This paper proposes the Real-Time DEtection TRansformer (RT-DETR), the first real-time end-to-end object detector to the best knowledge that addresses the above dilemma and designs an efficient hybrid encoder to expeditiously process multi-scale features by decoupling intra-scale interaction and cross-scale fusion to improve speed.

93
Highly Influential

[PDF]

DAMO-YOLO : A Report on Real-Time Object Detection Design

Xianzhe XuYiqi JiangWeihua ChenYi-Li HuangYuanhang ZhangXiuyu Sun

Computer Science

ArXiv

2022

DAMO-YOLO is extended from YOLO with some new technologies, including Neural Architecture Search (NAS), efficient Reparameterized Generalized-FPN (RepGFPN), a lightweight head with AlignedOTA label assignment, and distillation enhancement to improve performance to a higher level.

[PDF]

Gold-YOLO: Efficient Object Detector via Gather-and-Distribute Mechanism

Chengcheng WangWei He Yunhe Wang

Computer Science

NeurIPS

2023

This study provides an advanced Gatherand-Distribute mechanism (GD) mechanism, which is realized with convolution and self-attention operations, and implements MAE-style pretraining in the YOLO-series for the first time, allowing Y OLOseries models could be to benefit from unsupervised pretraining.

20
Highly Influential

[PDF]

RTMDet: An Empirical Study of Designing Real-Time Object Detectors

Chengqi LyuWenwei Zhang Kai Chen

Computer Science

ArXiv

2022

An efficient real-time object detector is designed that exceeds the YOLO series and is easily extensible for many object recognition tasks such as instance segmentation and rotated object detection and introduces soft labels when calculating matching costs in the dynamic label assignment to improve accuracy.

[PDF]

YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection

Yuming ChenXinbin YuanRuiqi WuJiabao WangQibin HouMingg-Ming Cheng

Computer Science

ArXiv

2023

The core design is based on a series of investigations on how convolutions with different kernel sizes affect the detection performance of objects at different scales, resulting in a new strategy that can strongly enhance multi-scale feature representations of real-time object detectors.

8
Highly Influential

[PDF]

End-to-End Object Detection with Fully Convolutional Network

Jianfeng WangLin SongZeming LiHongbin SunJian SunN. Zheng

Computer Science

2021 IEEE/CVF Conference on Computer Vision and…

2021

A Prediction-aware One-To-One (POTO) label assignment for classification is introduced to enable end-to-end detection, which obtains comparable performance with NMS and a simple 3D Max Filtering is proposed to utilize the multi-scale features and improve the discriminability of convolutions in the local region.

[PDF]

What Makes for End-to-End Object Detection?

Pei SunYi Jiang Ping Luo

Computer Science

ICML

2021

It is pointed out that one-to-one positive sample assignment is the key factor, while, one- to-many assignment in previous detectors causes redundant predictions in inference, and the concept of score gap is introduced to explore the effect of matching cost.

[PDF]

End-to-End Object Detection with Transformers

Nicolas CarionFrancisco MassaGabriel SynnaeveNicolas UsunierAlexander KirillovSergey Zagoruyko

Computer Science

ECCV

2020

This work presents a new method that views object detection as a direct set prediction problem, and demonstrates accuracy and run-time performance on par with the well-established and highly-optimized Faster RCNN baseline on the challenging COCO object detection dataset.

8,870

[PDF]

MOTR: End-to-End Multiple-Object Tracking with TRansformer

Fangao ZengBin DongTiancai WangCheng ChenX. ZhangYichen Wei

Computer Science

ECCV

2022

MOTR is proposed, which extends DETR and introduces track query to model the tracked instances in the entire video to enhance temporal relation modeling and serve as a stronger baseline for future research on temporal modeling and Transformer-based trackers.

[PDF]

You Only Look Once: Unified, Real-Time Object Detection

Joseph RedmonS. DivvalaRoss B. GirshickAli Farhadi

Computer Science

2016 IEEE Conference on Computer Vision and…

2016

Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

29,480
Highly Influential

[PDF]

...

Related Papers

Showing 1 through 3 of 0 Related Papers

[PDF] YOLOv10: Real-Time End-to-End Object Detection | Semantic Scholar (2024)

Figures and Tables from this paper

Ask This PaperBETAAI-Powered

One Citation

74 References

Related Papers

References

Ask This Paper
BETA
AI-Powered