6 April 2021 Weighted feature fusion and attention mechanism for object detection
Yanhao Cheng, Weibin Liu, Weiwei Xing
Author Affiliations +
Abstract

Recently, anchor-free methods have brought new ideas to the field of object detection that eliminate the need for anchor boxes in object detection and provide a simpler detection structure. CenterNet is the representative anchor-free method. However, this method still has the problem of obtaining high-resolution representation from low-resolution representation using upsampling, and the predicted heatmap is not accurate enough in space and does not make full use of the shallow low-level features of the network. We introduce CenterNet-HRA to solve this problem. An attention module is proposed to calibrate the high-level semantic features of the network output using the shallow low-level features from different receptive fields; HRNet is used as the backbone to maintain high-resolution feature representation through the whole process rather than using upsampling to generate high-resolution feature representation as HourglassNet. Considering that the feature representations with different resolutions have different contributions to the network but HRNet fuses them without distinction, a novel weighted feature fusion HRNet is designed to achieve higher detection precision. Our method achieves an average precision (AP) of 42.3% at 13.5 frames-per-second (FPS) (40.3% AP at 13.3 FPS for CenterNet-HG) on the MS-COCO benchmark.

© 2021 SPIE and IS&T 1017-9909/2021/$28.00© 2021 SPIE and IS&T
Yanhao Cheng, Weibin Liu, and Weiwei Xing "Weighted feature fusion and attention mechanism for object detection," Journal of Electronic Imaging 30(2), 023015 (6 April 2021). https://doi.org/10.1117/1.JEI.30.2.023015
Received: 4 December 2020; Accepted: 11 March 2021; Published: 6 April 2021
Lens.org Logo
CITATIONS
Cited by 7 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Sensors

Convolution

Calibration

Deconvolution

Data modeling

Detection and tracking algorithms

Feature extraction

RELATED CONTENT

Object detection based on deep learning
Proceedings of SPIE (December 16 2021)

Back to Top