Paper
8 June 2023 Research on a high precision audio indexing feature combination recognition method
Yang Zhang, Minghui Mi
Author Affiliations +
Proceedings Volume 12707, International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2023); 127073X (2023) https://doi.org/10.1117/12.2681325
Event: International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2023), 2023, Changsha, China
Abstract
Audio type recognition based on audio feature value matters a lot to audio indexing. To attain accurate results, an effective calculation or extraction of audio features is paramount. This paper uses Discrete Wavelet Transform (DWT) on audio signals to obtain the statistical eigenvalues of audio by means of a comprehensive eigenvalue calculation method combining wavelet transform and frequency domain. Additionally, the Support Vector Machine (SVM) Model was used to construct different audio differentiation templates, thus enabling the recognition of various types of audio. The experimental results have revealed that the recognition method of the combined feature and Support Vector Machine model can obtain relatively high accuracy and demonstrate considerable practical application value.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Yang Zhang and Minghui Mi "Research on a high precision audio indexing feature combination recognition method", Proc. SPIE 12707, International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2023), 127073X (8 June 2023); https://doi.org/10.1117/12.2681325
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Wavelet transforms

Wavelets

Support vector machines

Discrete wavelet transforms

Feature extraction

Back to Top