CRTI - Centre de Recherche en Technologie de l'Information
Disciplines :
Library & information sciences
Author, co-author :
Urbain, Jérôme ; Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Cakmak, Huseyin ; Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Dutoit, Thierry ; Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Language :
English
Title :
Automatic Phonetic Transcription of Laughter and its Application to Laughter Synthesis
Publication date :
03 September 2013
Event name :
Fifth biannual Humaine Association Conference on Affective Computing and Intelligent Interaction
Event place :
Genève, Switzerland
Event date :
2013
Research unit :
F105 - Information, Signal et Intelligence artificielle
Research institute :
R300 - Institut de Recherche en Technologies de l'Information et Sciences de l'Informatique R450 - Institut NUMEDIART pour les Technologies des Arts Numériques
Commentary :
This publication received the best student paper award
W. Ruch and P. Ekman, "The expressive pattern of laughter," in Emotion, qualia and consciousness, A. Kaszniak, Ed. Tokyo: World Scientific Publishers, 2001, pp. 426-443.
J.-A. Bachorowski, M. J. Smoski, and M. J. Owren, "The acoustic features of human laughter," Journal of the Acoustical Society of America, vol. 110, no. 3, pp. 1581-1597, September 2001.
J. Trouvain, "Segmenting phonetic units in laughter," in Proceedings of the 15th International Congress of Phonetic Sciences, Barcelona, Spain, August 2003, pp. 2793-2796.
G. Jefferson, "An exercise in the transcription and analysis of laughter," in Handbook of discourse analysis, ser. Discourse and Dialogue, T. V. Dijk, Ed. London, UK: Academic Press, 1985, vol. 3, pp. 25-34.
P. J. Glenn, Laughter in interaction. Cambridge University Press, Cambridge, 2003.
W. Chafe, The Importance of not being earnest. The feeling behind laughter and humor., ser. Consciousness & Emotion Book Series. Amsterdam, The Nederlands: John Benjamins Pub. Comp., 2007, vol. 3.
J. Urbain and T. Dutoit, "A phonetic analysis of natural laughter, for use in automatic laughter processing systems," in Proceedings of ACII 2011, Memphis, Tennesse, October 2011, pp. 397-406.
L. Kennedy and D. Ellis, "Laughter detection in meetings," in NIST ICASSP Meeting Recognition Workshop, May 2004, pp. 118-121.
K. P. Truong and D. A. van Leeuwen, "Automatic discrimination between laughter and speech," Speech Com., vol. 49, pp. 144-158, 2007.
S. Sundaram and S. Narayanan, "Automatic acoustic synthesis of human-like laughter," Journal of the Acoustical Society of America, vol. 121, no. 1, pp. 527-535, January 2007.
E. Lasarcyk and J. Trouvain, "Imitating conversational laughter with an articulatory speech synthesis," in Proceedings of the Interdisciplinary Workshop on the Phonetics of Laughter, Saarbrücken, Germany, 2007.
J. Urbain, H. Cakmak, and T. Dutoit, "Evaluation of hmm-based laughter synthesis," in Proceedings of ICASSP'13, Vancouver, Canada, 2013.
J. Urbain, E. Bevacqua, T. Dutoit, A. Moinet, R. Niewiadomski, C. Pelachaud, B. Picart, J. Tilmanne, and J. Wagner, "The AVLaughterCycle database," in Proceedings of LREC'10, Valletta, Malta, 2010.
P. Ladefoged, "A course in phonetics," Online: http://hctv.humnet.ucla. edu/departments/linguistics/VowelsandConsonants/course/chapter1/chapter1.html, Consulted on January 20, 2011.
D. T. Toledano, L. H. Gómez, and L. V. Grande, "Automatic phonetic segmentation," Speech and Audio Processing, IEEE Transactions on, vol. 11, no. 6, pp. 617-625, 2003.
S. Young and S. Young, "The htk hidden markov model toolkit: Design and philosophy," in Entropic Cambridge Research Laboratory, Ltd. Citeseer, 1994.
G. Peeters, "A large set of audio features for sound description (similarity and classification) in the CUIDADO project," Institut de Recherche et Coordination Acoustique/Musique (IRCAM), Tech. Rep., 2004.
T. Drugman, T. Dubuisson, and T. Dutoit, "Phase-based information for voice pathology detection," in Proceedings of ICASSP'11. IEEE, 2011, pp. 4612-4615.
T. Drugman, J. Urbain, N. Bauwens, R. Chessini, C. Valderrama, P. Lebecque, and T. Dutoit, "Objective study of sensor relevance for automatic cough detection," IEEE Transactions on Information Technology in BioMedicine, 2013.
D. P. Ellis and G. E. Poliner, "Identifying cover songs' with chroma features and dynamic programming beat tracking," in Proceedings of ICASSP'07', vol. 4. IEEE, 2007, pp. IV-1429.
T. Drugman and A. Alwan, "Joint robust voicing detection and pitch estimation based on residual harmonics," in Proceedings of Interspeech 2011, Firenze, Italy, August 2011.
S. J. Young, G. Evermann, M. Gales, D. Kershaw, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. Woodland, "The htk book version 3.4," 2006.
O. J. Räsänen, U. K. Laine, and T. Altosaar, "An improved speech segmentation quality measure: the r-value," in Proceedings of Interspeech' 09, 2009.
K. Oura, "HMM-based speech synthesis system (hts) [computer program webpage]," Online: http://hts.sp.nitech.ac.jp/, consulted on June 22, 2011.
H. Zen, "An example of context-dependent label format for hmm-based speech synthesis in english," The HTS CMUARCTIC demo, 2006.
J. L. Fleiss, B. Levin, and M. C. Paik, "The measurement of interrater agreement," Statistical methods for rates and proportions, vol. 2, pp. 212-236, 1981.
F. Eyben, M. Wöllmer, and B. Schuller, "Opensmile: the munich versatile and fast open-source audio feature extractor," in Proceedings of ACM, Florence, Italy, 2010, pp. 1459-1462.