Issue |
ITM Web Conf.
Volume 69, 2024
International Conference on Mobility, Artificial Intelligence and Health (MAIH2024)
|
|
---|---|---|
Article Number | 04008 | |
Number of page(s) | 6 | |
Section | Transactions | |
DOI | https://doi.org/10.1051/itmconf/20246904008 | |
Published online | 13 December 2024 |
Optimizing Convolution Operations for YOLOv4-based Object Detection on GPU
1 Université Paris-Saclay, ENS Paris-Saclay, CNRS, SATIE, 91190, Gif-sur-Yvette, France
2 Systems Analysis, Information Processing and Industrial Management Laboratory, Higher School of Technology of Sale, Mohamed V University, Rabat, Morocco
* e-mail: fatima-zahra.guerrouj@universite-paris-saclay.fr
Real-time object detection is crucial for autonomous vehicles, and YOLO (You Only Look Once) algorithms have demonstrated their effectiveness for this purpose. This study examines the performance of YOLOv4 [3] for real-time object detection on an embedded architecture. We focus on optimizing the computationally intensive convolution operations by employing the cuDNN library to achieve efficient inference. The evaluation assesses critical performance metrics, including object detection accuracy in terms of Mean Average Precision (mAP) and inference latency on the embedded architecture. We conduct a comparative analysis using the publicly available KITTI [7] database. The reported results establish a benchmark between the parallelized YOLOv4 model and the baseline implementation, assessing the advantages of cuDNN acceleration for real-time object detection on resource-constrained devices.
© The Authors, published by EDP Sciences, 2024
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.