Paper
11 October 2023 Research and implementation of key technologies of simultaneous interpreting speech translation
Wenyi Wang, Ting Yang, Shanshan Li, Bo Chen
Author Affiliations +
Proceedings Volume 12800, Sixth International Conference on Computer Information Science and Application Technology (CISAT 2023); 128004F (2023) https://doi.org/10.1117/12.3003978
Event: 6th International Conference on Computer Information Science and Application Technology (CISAT 2023), 2023, Hangzhou, China
Abstract
Simultaneous Interpretation Speech Translation is an emerging technology that has been widely used in many fields. It is a technology that combines Automatic Speech Recognition (ASR), Machine Translation (MT) and Natural Language Processing (NLP). In this paper, we will discuss the key technologies related to simultaneous interpretation speech translation, including ASR, MT, NLP, and their application scenarios. We will also explore the challenges and opportunities of this technology, as well as the existing solutions. In this paper, we propose a position-based attention to model the interaction between two words regarding positions that can improve the self-attention networks by better integrating sequential relations, which is essential for modeling natural languages.
(2023) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Wenyi Wang, Ting Yang, Shanshan Li, and Bo Chen "Research and implementation of key technologies of simultaneous interpreting speech translation", Proc. SPIE 12800, Sixth International Conference on Computer Information Science and Application Technology (CISAT 2023), 128004F (11 October 2023); https://doi.org/10.1117/12.3003978
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Speech recognition

Modeling

Telecommunications

Speaker recognition

Acoustics

Education and training

Mathematical optimization

Back to Top