Automatic speech segmentation based on alignment with a text-to-speech system
Tvůrce(i)
Horák, Petr (URE-Y)
Vyd. údaje
Chichester: J.Wiley, 2002
ISBN
0-471-49985-4
Zdroj.dok.
Improvements in Speech Synthesis / Keller E. ; Bailly G. ; Monaghan A. ; Terken J. ; Huckvale M.
Rozsah stran
s. 328-338
Poč.str.
11 s.
Jazyk dok.
eng - angličtina
Země vyd.
GB - Velká Británie
Klíč. slova
speech processing ; speech synthesis
Vědní obor RIV
BD - Teorie informace
CEP
GV102/96/K087 GA ČR - Grantová agentura ČR
OC 258.10 GA MŠMT - Ministerstvo školství, mládeže a tělovýchovy
CEZ
AV0Z2067918 - URE-Y
Anotace
Automatic phonetic speech segmentation, or the alignment of a known phonetic transcription to a speech signal, is an important tool for many fields of speech reasearch. Most systems for automatic segmentation are based on a trained recognition system. Such systems are typically trained on hidden Markov models of phoneme realizations. An alternative strategy is to use a text-to-speech system to generate a prototype realization of the transcription and to align the synthetic signal with the real one.