Article (Scientific journals)
pHTS for Max/MSP: A Stream­ing Archi­tec­ture for Sta­tis­ti­cal Para­met­ric Speech Syn­the­sis
Astrinaki, Maria; Babacan, Onur; D'alessandro, Nicolas et al.
2011In Quarterly Progress Scientific Report of the Numediart Research Program, 4 (1), p. 7-11
 

Files


Full Text
numediart_2011_s13_p2_report.pdf
Author postprint (753.85 kB)
Request a copy

All documents in ORBi UMONS are protected by a user license.

Send to



Details



Keywords :
[en] HMM, speech synthesis, statistical parametric speech synthesis, real-time, performative, streaming, HTS, sHTS, pHTS, Max/MSP
Abstract :
[en] In this report, we present a Max/MSP external for real-time speech synthesis. Statistical parametric speech synthesis, based on Hid- den Markov Models has been demonstrated to be very effective in synthesizing high-quality, natural and expressive speech. This technique is also able to provide high flexibility as a speech production model and a small database footprint. In this work, we modify the existing HTS engine in order to establish a streaming architecture, called performative-HTS or pHTS. pHTS is implemented as a Max/MSP external which provides a basis for further research in gesturally-controlled speech synthesis. Quantitative evaluations of the system show that the degradation of speech quality in pHTS is small with reference to HTS. These results are supported by a subjective evaluation, which confirms that HTS and pHTS resulting speech waveforms can hardly be distinguished.
Disciplines :
Mathematics
Author, co-author :
Astrinaki, Maria ;  Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Babacan, Onur ;  Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
D'alessandro, Nicolas 
Picart, Benjamin ;  Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Dutoit, Thierry ;  Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Language :
English
Title :
pHTS for Max/MSP: A Stream­ing Archi­tec­ture for Sta­tis­ti­cal Para­met­ric Speech Syn­the­sis
Publication date :
01 March 2011
Journal title :
Quarterly Progress Scientific Report of the Numediart Research Program
ISSN :
2032-5398
Publisher :
numediart Insti­tute for Cre­ative Tech­nolo­gies, Mons, Belgium
Volume :
4
Issue :
1
Pages :
7-11
Research unit :
F105 - Information, Signal et Intelligence artificielle
Research institute :
R450 - Institut NUMEDIART pour les Technologies des Arts Numériques
Available on ORBi UMONS :
since 07 January 2013

Statistics


Number of views
6 (0 by UMONS)
Number of downloads
0 (0 by UMONS)

Bibliography


Similar publications



Contact ORBi UMONS