Paper
12 September 2024 Enhanced feline facial recognition: advancing cat face detection with YOLOv8 and TensorRT
Teng Wang
Author Affiliations +
Proceedings Volume 13256, Fourth International Conference on Computer Vision and Pattern Analysis (ICCPA 2024); 132560S (2024) https://doi.org/10.1117/12.3037875
Event: Fourth International Conference on Computer Vision and Pattern Analysis (ICCPA 2024), 2024, Anshan, China
Abstract
In the field of computer vision, the ability to accurately detect and recognise animal features in various environments is an area of growing interest and application. This study presents an advanced cat face detection system utilising the You Only Look Once Version 8 (YOLOv8) architecture, enhanced with TensorRT optimisation for real-time processing. The approach involves a comprehensive data augmentation process to improve detection accuracy across diverse cat breeds and environmental conditions. Performance evaluation is based on quantifiable metrics; the optimised model achieves a notable reduction in inference time from 50.1ms to 0.9ms and a decrease in GPU power usage from 77 watts to 63 watts, without compromising accuracy. The accelerated processing speed and reduced power requirements make the system highly suitable for real-time applications, such as pet monitoring systems or behaviour analysis tools, where rapid and accurate detection is paramount. The research highlights the potential of deep learning algorithms in precise animal feature recognition and contributes to the field of computer vision by addressing challenges in small, diverse object detection.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Teng Wang "Enhanced feline facial recognition: advancing cat face detection with YOLOv8 and TensorRT", Proc. SPIE 13256, Fourth International Conference on Computer Vision and Pattern Analysis (ICCPA 2024), 132560S (12 September 2024); https://doi.org/10.1117/12.3037875
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Object detection

Data modeling

Performance modeling

Education and training

Facial recognition systems

Animals

Computer vision technology

Back to Top