KEYWORDS: Digital watermarking, Linear filtering, Digital filtering, Computer programming, Electronic filtering, Neodymium, Analog electronics, Detection and tracking algorithms, Signal to noise ratio, Forward error correction
A speech production procedure can be divided into three parts, namely the glottal source, articulation and
radiation, respectively. We propose a watermarking method for speech by manipulating the articulation in the
process of speech production. We apply our method to CS-ACELP(G.729 standard), which is the ITU-T
approved recommendation. It provides a low bit rate 8 kb/s speech coding algorithm with wire/line quality. The
watermarked vocal tract model is expressed by codebooks made by LSP(Line Spectrum Pair) parameters. The
codebook vectors replace some of the extracted LSP. Speech is synthesized using replaced LSP. We generate a
couple of codebooks using a unique method to modify the LSP of the spectrum envelope. Shortening the width
of the LSPs creates one watermarked codebook, and the second codebook is created by stretching the LSP of
both sides of each formant. There are ten LSP dimensions in each voice frame of the CS-ACELP decoder. In the
detecting process, the weighted Euclidean distance(WED) between the watermarked codebooks and the
extracted LSP will be calculated. Whether the watermark is embedded will be judged by utilizing the calculated
WED. Evaluation tests on detection accuracy will be discussed with simulation results.
KEYWORDS: Digital watermarking, Signal to noise ratio, Wavelets, Quantization, Distance measurement, Databases, Quality testing methods, Internet, Multimedia, Process modeling
A speech production model can be ivided into three parts, namely the glottal source, articulation and radiation, respectively. Some digital watermarking methods for speech that have been proposed are based on modifying quantized values or parameters of a coding scheme. In this paper, we propose a new watermarking method for speech by manipulating the articulaton in the process of speech production. The proposed method is performed by modeling a quasi vocal tract model equivalent to the speech production process. The watermarked vocal tract model is expressed by codebooks made by LSP(Line Spectrum Pair) parameters. The procedure of watermark for speech is as follows; 1) LSPs are extracted from the speech. 2) Some of the extracted LSPs are replaced by the codebook vectors. 3) Speech is synthesized using replaced LSPs. In the process above, watermarks are embedded indirectly into the speech. Evaluation tests on speech quality and accuracy of the proposed method will be discussed with simulation results.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.