Paper published in a journal (Scientific congresses and symposiums)
Speech synthesis in various communicative situations: Impact of pronunciation variations
Brognaux, Sandrine; Picart, Benjamin; Drugman, Thomas
2014
 

Files


Full Text
interspeech2014_sbbptd.pdf
Author postprint (252.95 kB)
Request a copy

All documents in ORBi UMONS are protected by a user license.

Send to



Details



Keywords :
[en] Sports commentaries; [en] Phonetic variations; [en] HMM-based speech synthesis; [en] Communicative situation
Abstract :
[en] While current research in speech synthesis focuses on the generation of various speaking styles or emotions, very few studies have addressed the possibility of including phonetic variations according to the communicative situation of the target speech (sports commentaries, TV news, etc.). However, significant phonetic variations have been observed, depending on various communicative factors (e.g. spontaneous/read and media broadcast or not). This study analyzes whether these alternative pronunciations contribute to the plausibility of the message and should therefore be considered in synthesis. To this end, subjective tests are performed on synthesized French sports commentaries. They aim at comparing HMM-based speech synthesis with genuine pronunciation and with neutral NLP-produced phonetization. Results show that the integration of the phonetic variations significantly improves the perceived naturalness of the generated speech. They also highlight the relative impor tance of the various types of variations and show that schwa elisions, in particular, play a crucial role in that respect.
Disciplines :
Electrical & electronics engineering
Author, co-author :
Brognaux, Sandrine 
Picart, Benjamin ;  Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Drugman, Thomas ;  Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Language :
English
Title :
Speech synthesis in various communicative situations: Impact of pronunciation variations
Publication date :
10 June 2014
Event name :
Interspeech 2014
Event place :
Singapore, Singapore
Event date :
2014
Research unit :
F105 - Information, Signal et Intelligence artificielle
Research institute :
R450 - Institut NUMEDIART pour les Technologies des Arts Numériques
Available on ORBi UMONS :
since 05 January 2015

Statistics


Number of views
15 (0 by UMONS)
Number of downloads
0 (0 by UMONS)

Scopus citations®
 
5
Scopus citations®
without self-citations
4

Bibliography


Similar publications



Contact ORBi UMONS