Paper
13 November 2003 Estimating speaker scale factors from vowels
Author Affiliations +
Abstract
In previous works, Umesh et al, demonstrated that phonetically similar vowels spoken by different individuals are related by a simple translation in a universal warped spectral representation. They experimentally derived this function and called it the “speech-scale”. We present further experimental evidence, based on a large data set, validating the speech-scale. We also estimate speaker-specific scale factors based on the speech-scale, and we present a vowel classification experiment, which demonstrates a significant performance improvement through a normalization based on the speech-scale. The results we present are based on formant estimates of vowels in a Western Michigan vowel database.
© (2003) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Douglas J. Nelson, David C. Smith, Srinivasan Umesh, and Leon Cohen "Estimating speaker scale factors from vowels", Proc. SPIE 5207, Wavelets: Applications in Signal and Image Processing X, (13 November 2003); https://doi.org/10.1117/12.507416
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Factor analysis

Databases

Composites

Ear

Defense and security

Defense technologies

Fourier transforms

Back to Top