Paper
2 November 2022 Sound source localization method based on dual microarrays and deep learning
Pan Su, Qingning Zeng, Chao Long
Author Affiliations +
Proceedings Volume 12455, International Conference on Signal Processing and Communication Security (ICSPCS 2022); 1245506 (2022) https://doi.org/10.1117/12.2655183
Event: International Conference on Signal Processing and Communication Security (ICSPCS 2022), 2022, Dalian, China
Abstract
In order to improve the localization accuracy in complex environments, a sound source localization method based on dual microarrays (DMA) and deep learning is studied. Generalized cross correlation-phase transform (GCCPHAT) sequence and the maximum value information of the sequence are used as localization cues, the three-dimensional coordinates of the sound source are used as the output of the network, and the mapping rules from input features to output are learned through the improved CNN network based on VGG16 network structure (referred to as V_CNN for short). Through simulation experiments, the sound source localization method based on circular array and V_CNN, the sound source localization method based on dual microarrays and ordinary convolutional neural network (CNN), and the sound source localization method based on dual microarrays and V_CNN are compared. The experimental results show that the sound source localization method in this paper has high localization accuracy under different noise and reverberation environments.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Pan Su, Qingning Zeng, and Chao Long "Sound source localization method based on dual microarrays and deep learning", Proc. SPIE 12455, International Conference on Signal Processing and Communication Security (ICSPCS 2022), 1245506 (2 November 2022); https://doi.org/10.1117/12.2655183
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Source localization

Convolutional neural networks

Robots

Data modeling

Ear

Feature extraction

Neurons

Back to Top