pHTS for Max/MSP: A Stream­ing Archi­tec­ture for Sta­tis­ti­cal Para­met­ric Speech Syn­the­sis

Astrinaki, Maria; Babacan, Onur; D'alessandro, Nicolas; Picart, Benjamin; Dutoit, Thierry

Request a copy

Article (Scientific journals)

pHTS for Max/MSP: A Streaming Architecture for Statistical Parametric Speech Synthesis

Astrinaki, Maria; Babacan, Onur; D'alessandro, Nicolas et al.

2011 • In Quarterly Progress Scientific Report of the Numediart Research Program, 4 (1), p. 7-11

Permalink
https://hdl.handle.net/20.500.12907/41445

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

numediart_2011_s13_p2_report.pdf

Author postprint (753.85 kB)

Request a copy

All documents in ORBi UMONS are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

[en] HMM, speech synthesis, statistical parametric speech synthesis, real-time, performative, streaming, HTS, sHTS, pHTS, Max/MSP

Abstract :

[en] In this report, we present a Max/MSP external for real-time speech synthesis. Statistical parametric speech synthesis, based on Hid- den Markov Models has been demonstrated to be very effective in synthesizing high-quality, natural and expressive speech. This technique is also able to provide high flexibility as a speech production model and a small database footprint. In this work, we modify the existing HTS engine in order to establish a streaming architecture, called performative-HTS or pHTS. pHTS is implemented as a Max/MSP external which provides a basis for further research in gesturally-controlled speech synthesis. Quantitative evaluations of the system show that the degradation of speech quality in pHTS is small with reference to HTS. These results are supported by a subjective evaluation, which confirms that HTS and pHTS resulting speech waveforms can hardly be distinguished.

Disciplines :

Mathematics

Author, co-author :

Astrinaki, Maria ; Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle

Babacan, Onur ; Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle

D'alessandro, Nicolas

Picart, Benjamin ; Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle

Dutoit, Thierry ; Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle

Language :

English

Title :

pHTS for Max/MSP: A Streaming Architecture for Statistical Parametric Speech Synthesis

Publication date :

01 March 2011

Journal title :

Quarterly Progress Scientific Report of the Numediart Research Program

ISSN :

2032-5398

Publisher :

numediart Institute for Creative Technologies, Mons, Belgium

Volume :

Issue :

Pages :

7-11

Research unit :

F105 - Information, Signal et Intelligence artificielle

Research institute :

R450 - Institut NUMEDIART pour les Technologies des Arts Numériques

Available on ORBi UMONS :

since 07 January 2013

Statistics

Number of views

66 (0 by UMONS)

Number of downloads

0 (0 by UMONS)

More statistics