Paper
1 December 2023 Application of improved YOLOv5 model on object detection across multiple datasets
Zongyi Shao, Rui Li, Tianfu He
Author Affiliations +
Proceedings Volume 12940, Third International Conference on Control and Intelligent Robotics (ICCIR 2023); 129403A (2023) https://doi.org/10.1117/12.3010576
Event: Third International Conference on Control and Intelligent Robotics (ICCIR 2023), 2023, Sipsongpanna, China
Abstract
This paper presents a series of enhancements made to the YOLOv5 model, which belongs to the well-established YOLO (You Only Look Once) series of object detection models. The proposed modifications yield a significantly advanced object detector exhibiting exceptional performance across diverse datasets. The primary focus of our improvements lies in the replacement of select convolutions within the model using an enhanced reparameterization technique tailored for convolutional models. In conjunction with other effective enhancement strategies, the augmented YOLOv5n model achieves a mean average precision (mAP) of 77.8 on the VOC2007 dataset, showcasing an impressive 18% performance gain over the original model (version 6.0). This notable improvement positions YOLOv5n ahead of the state-of-the-art YOLOv8 model, while concurrently attaining further enhancements in frames per second (FPS) compared to its predecessor. A comprehensive set of experimental results substantiates the efficacy of our approach towards enhancing the YOLOv5 model, rendering it more amenable to the requirements posed by various application domains within the field of object detection.
(2023) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Zongyi Shao, Rui Li, and Tianfu He "Application of improved YOLOv5 model on object detection across multiple datasets", Proc. SPIE 12940, Third International Conference on Control and Intelligent Robotics (ICCIR 2023), 129403A (1 December 2023); https://doi.org/10.1117/12.3010576
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Performance modeling

Education and training

Object detection

Head

Design and modelling

Convolution

Back to Top