Article (Scientific journals)
HMM-based speech synthesis with various degrees of articulation: A perceptual study
Picart, Benjamin; Drugman, Thomas; Dutoit, Thierry
2013In Neurocomputing, Volume 132, p. 142 - 147
Peer Reviewed verified by ORBi
 

Files


Full Text
neurocomputing_nolisp11_bptdtd.pdf
Author postprint (367.78 kB)
Request a copy

All documents in ORBi UMONS are protected by a user license.

Send to



Details



Keywords :
[en] Expressive Speech; [en] Voice Quality; [en] Speech Synthesis; [en] Perceptual Effects; [en] Speaking Style Adaptation
Abstract :
[en] HMM-based speech synthesis is very convenient for creating a synthesizer whose speaker characteristics and speaking styles can be easily modified. This can be obtained by adapting a source speaker's model to a target speaker's model, using intra-speaker voice adaptation techniques. In this paper, we focus on high-quality HMM-based speech synthesis integrating various degrees of articulation, and more specifically on the internal mechanisms leading to the perception of the degrees of articulation by listeners. Therefore the process of adapting a neutral speech synthesizer to generate hypo and hyperarticulated speech is broken down into four factors: cepstrum, prosody, phonetic transcription adaptation as well as the complete adaptation. The impact of these factors on the perceived degree of articulation is studied. Moreover, this study is complemented with an Absolute Category Rating (ACR) evaluation, allowing the subjective assessment of hypo/hyperarticulated speech through various dimensions: comprehension, non-monotony, fluidity and pronunciation. This paper quantifies the importance of prosody and cepstrum adaptation as well as the use of a Natural Language Processor able to generate realistic hypo and hyperarticulated phonetic transcriptions.
Disciplines :
Electrical & electronics engineering
Author, co-author :
Picart, Benjamin ;  Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Drugman, Thomas ;  Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Dutoit, Thierry ;  Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Language :
English
Title :
HMM-based speech synthesis with various degrees of articulation: A perceptual study
Publication date :
17 October 2013
Journal title :
Neurocomputing
ISSN :
0925-2312
Publisher :
Elsevier, Netherlands
Volume :
Volume 132
Pages :
142 - 147
Peer reviewed :
Peer Reviewed verified by ORBi
Research unit :
F105 - Information, Signal et Intelligence artificielle
Research institute :
R450 - Institut NUMEDIART pour les Technologies des Arts Numériques
Available on ORBi UMONS :
since 22 January 2014

Statistics


Number of views
6 (0 by UMONS)
Number of downloads
1 (1 by UMONS)

Scopus citations®
 
2
Scopus citations®
without self-citations
1
OpenCitations
 
2

Bibliography


Similar publications



Contact ORBi UMONS