Paper
6 May 2019 Multi-scale HOG feature used in object detection
Jin Li, Hong Zhang, Lei Zhang, Yawei Li, Qiaochu Kang, Yujie Wu
Author Affiliations +
Proceedings Volume 11069, Tenth International Conference on Graphics and Image Processing (ICGIP 2018); 110693U (2019) https://doi.org/10.1117/12.2524169
Event: Tenth International Conference on Graphic and Image Processing (ICGIP 2018), 2018, Chengdu, China
Abstract
Object detection is one of the most popular and difficult field in computer vision. Although deep learning methods have great performance on object detection. For specific application, algorithms which use hand-crafted features are still widely used. One main problem in object detection is the scale problem. Algorithms usually use image pyramid to cover as many scales as possible. But gaps still exist between scale levels in image pyramid. Our work extends some sub scale level to fill the gaps between image pyramids. To this end, we use Gaussian Scales Pyramid to generate sub-scale image and extract HOG feature on the sub-scale. We use framework offered by DPM algorithm and make modification on it. We compare the result of our method with DPM baseline on Pascal VOC database. Our work has great performance on some categories and makes an improvement on the overall performance. This work can be used in other object detection frameworks. We apply multi-scale HOG feature on pre-process procedure of our own detection framework and test it on our own dataset. Then the framework gains performance improvement on precision and recall rate of the pre-process procedure comparing to the original HOG feature architecture.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jin Li, Hong Zhang, Lei Zhang, Yawei Li, Qiaochu Kang, and Yujie Wu "Multi-scale HOG feature used in object detection", Proc. SPIE 11069, Tenth International Conference on Graphics and Image Processing (ICGIP 2018), 110693U (6 May 2019); https://doi.org/10.1117/12.2524169
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Detection and tracking algorithms

Computer vision technology

Databases

Feature extraction

Image processing

Machine vision

Neural networks

Back to Top