12 July 2024 Scale separation: video crowd counting with different density maps
Ao Zhang, Xin Deng, Baoying Liu, Weiwei Zhang, Jun Guo, Linrui Xie
Author Affiliations +
Abstract

Most crowd counting methods rely on integrating density maps for prediction, but they encounter performance degradation in the face of density variations. Existing methods primarily employ a multi-scale architecture to mitigate this issue. However, few approaches concurrently consider both scale and timing information. We propose a scale-divided architecture for video crowd counting. Initially, density maps of different Gaussian scales are employed to retain information at various scales, accommodating scale changes in images. Subsequently, we observe that the spatiotemporal network places greater emphasis on individual locations, prompting us to aggregate temporal information at a specific scale. This design enables the temporal model to acquire more spatial information and alleviate occlusion issues. Experimental results on various public datasets demonstrate the superior performance of our proposed method.

© 2024 SPIE and IS&T
Ao Zhang, Xin Deng, Baoying Liu, Weiwei Zhang, Jun Guo, and Linrui Xie "Scale separation: video crowd counting with different density maps," Journal of Electronic Imaging 33(4), 043016 (12 July 2024). https://doi.org/10.1117/1.JEI.33.4.043016
Received: 24 January 2024; Accepted: 15 May 2024; Published: 12 July 2024
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Education and training

Convolution

Feature extraction

Ablation

Image processing

Visualization

Back to Top