Paper
21 December 2023 Automatic speech recognition with efficient transformer
Shuhan Luo
Author Affiliations +
Proceedings Volume 12970, Fourth International Conference on Signal Processing and Computer Science (SPCS 2023); 129702U (2023) https://doi.org/10.1117/12.3012507
Event: Fourth International Conference on Signal Processing and Computer Science (SPCS 2023), 2023, Guilin, China
Abstract
Automatic Speech Recognition (ASR) is an important technology in modern society, since it acts as a great tool for humans to communicate with computers, visually impaired people, and deaf people. However, existing speech recognition methods are still facing many problems. Some of the methods require a large model and excessive parameters, others cannot achieve reliable accuracy. Therefore, our study utilizes an Efficient Transformer Model with Convolutional Network to conduct an ASR task. Our model significantly improves the accuracy of speech recognition while does not become a huge model with a large number of parameters.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Shuhan Luo "Automatic speech recognition with efficient transformer", Proc. SPIE 12970, Fourth International Conference on Signal Processing and Computer Science (SPCS 2023), 129702U (21 December 2023); https://doi.org/10.1117/12.3012507
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
Back to Top