
Drugman Thomas

Main Referenced Co-authors
DUTOIT, Thierry  (50)
Dubuisson, Thomas  (12)
Picart, Benjamin  (12)
Brognaux, Sandrine  (7)
Kane, John (7)
Main Referenced Keywords
Expressive Speech (8); Speaking Style Adaptation (7); Speech Synthesis (7); Voice Quality (7); HTS (6);
Main Referenced Unit & Research Centers
BIOSYS - Biosys (12)
CRTI - Centre de Recherche en Technologie de l'Information (2)
Main Referenced Disciplines
Electrical & electronics engineering (50)
Library & information sciences (41)
Languages & linguistics (1)
Computer science (1)
Laboratory medicine & medical technology (1)

Publications (total 83)

The most downloaded
Mahmoudi, S., Da Cunha Possa, P., Ravet, T., Drugman, T., Chessini Bose, R., Dutoit, T., & Valderrama, C. (01 October 2015). Sensor-based Framework for Automatic Cough Detection and Classification. Advances in Intelligent Systems and Computing, 7 (2015).

The most cited

202 citations (Scopus®)

Drugman, T., & Alwan, A. (2011). Joint Robust Voicing Detection and Pitch Estimation Based on Residual Harmonics [Paper presentation]. Interspeech 2011, Firenze, Italy.

Drugman, T. (2022). Advances in Glottal Analysis and its Applications [Doctoral thesis, Université de Mons]. ORBi UMONS-University of Mons.

Brognaux, S., & Drugman, T. (01 January 2016). HMM-based Speech Segmentation: Improvements of Fully Automatic Approaches. IEEE Transactions on Audio, Speech and Language Processing, 24 (1), 5-15.
Peer Reviewed verified by ORBi

Mahmoudi, S., Da Cunha Possa, P., Ravet, T., Drugman, T., Chessini Bose, R., Dutoit, T., & Valderrama, C. (01 October 2015). Sensor-based Framework for Automatic Cough Detection and Classification. Advances in Intelligent Systems and Computing, 7 (2015).

Brognaux, S., Picart, B., & Drugman, T. (2014). Speech synthesis in various communicative situations: Impact of pronunciation variations [Paper presentation]. Interspeech 2014, Singapore, Singapore.

Picart, B., Drugman, T., & Dutoit, T. (2014). Automatic Variation of the Degree of Articulation in New HMM-based Voices. IEEE Journal of Selected Topics in Signal Processing, 307 - 322. doi:10.1109/JSTSP.2014.2302742
Peer Reviewed verified by ORBi

Drugman, T. (01 February 2014). Maximum Phase Modeling for Sparse Linear Prediction of Speech. IEEE Signal Processing Letters, 21, 185-189.
Peer Reviewed verified by ORBi

Drugman, T., & Dutoit, T. (18 December 2013). Detecting Speech Polarity with High-Order Statistics. Cognitive Computation, 5 (4), 442-447.
Peer Reviewed verified by ORBi

Picart, B., Drugman, T., & Dutoit, T. (17 October 2013). HMM-based speech synthesis with various degrees of articulation: A perceptual study. Neurocomputing, Volume 132, 142 - 147. doi:10.1016/j.neucom.2012.10.040
Peer Reviewed verified by ORBi

Drugman, T., & Dutoit, T. (2013). Speech Polarity Determination: A Comparative Evaluation. Neurocomputing.
Peer Reviewed verified by ORBi

Raitio, T., Kane, J., Drugman, T., & Gobl, C. (2013). HMM-based synthesis of creaky voice [Paper presentation]. Interspeech, Lyon, France.

Babacan, O., Drugman, T., D'alessandro, N., Henrich, N., & Dutoit, T. (2013). A Quantitative Comparison of Glottal Closure Instant Estimation Algorithms on a Large Variety of Singing Sounds [Paper presentation]. Interspeech, Lyon, France.

Drugman, T., Urbain, J., Bauwens, N., Chessini Bose, R., Valderrama, C., Lebecque, P., & Dutoit, T. (09 September 2013). Objective Study of Sensor Relevance for Automatic Cough Detection. IEEE Transactions on Information Technology in Biomedicine, 17 (3), 699-707.
Peer Reviewed verified by ORBi

Cullen, A., Kane, J., Drugman, T., & Harte, N. (2013). Creaky Voice and the Classification of Affect [Paper presentation]. Workshop on Affective Social Speech Signals, Grenoble, France.

Picart, B., Brognaux, S., & Drugman, T. (2013). HMM-based Speech Synthesis of Live Sports Commentaries: Integration of a Two-Layer Prosody Annotation [Paper presentation]. 8th Speech Synthesis Workshop (SSW8), Barcelona, Spain.

Kane, J., Drugman, T., & Gobl, C. (20 June 2013). Improved Automatic Detection of Creak. Computer Speech and Language, 27 (4), 1028-1047.
Peer Reviewed verified by ORBi

Loweimi, E., Ahadi, S., Drugman, T., & Loveymi, S. (2013). On the Importance of Pre-emphasis and Window Shape in Phase-based Speech Recognition [Paper presentation]. Non-Linear Speech Processing, Mons, Belgium.

Drugman, T., Rijckaert, M., Lawson, G., & Remacle, M. (2013). Analysis and Quantification of Acoustic Artefacts in Tracheoesophageal Speech [Paper presentation]. Non-Linear Speech Processing, Mons, Belgium.

Brognaux, S., Picart, B., & Drugman, T. (2013). A New Prosody Annotation Protocol for Live Sports Commentaries [Paper presentation]. Interspeech 2013, Lyon, France.

Drugman, T., Kane, J., Raitio, T., & Gobl, C. (2013). Prediction of Creaky Voice from Contextual Factors [Paper presentation]. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vancouver, Canada.

Loweimi, E., Ahadi, S., & Drugman, T. (2013). A New Phase-based Feature Representation for Robust Speech Recognition [Paper presentation]. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vancouver, Canada.

Babacan, O., Drugman, T., D'alessandro, N., Henrich, N., & Dutoit, T. (2013). A comparative study of pitch extraction algorithms on a large variety of singing sounds [Paper presentation]. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vancouver, Canada.

Picart, B., Drugman, T., & Dutoit, T. (29 April 2013). Analysis and HMM-based synthesis of hypo and hyperarticulated speech. Computer Speech and Language, Volume 28 (Issue 2), 687 - 707. doi:10.1016/j.csl.2013.04.008
Peer Reviewed verified by ORBi

Drugman, T. (14 April 2013). Residual Excitation Skewness for Automatic Speech Polarity Detection. IEEE Signal Processing Letters, 20 (4), 387-390.
Peer Reviewed verified by ORBi

Brognaux, S., Roekhaut, S., Drugman, T., & Beaufort, R. (2012). Train&Align: A New Online Tool for Automatic Phonetic Alignment [Paper presentation]. IEEE Workshop on Spoken Language Technology (SLT), Miami, United States - Florida.

Brognaux, S., Drugman, T., & Beaufort, R. (2012). Automatic Detection and Correction of Syntax-based Prosody Annotation Errors [Paper presentation]. IEEE Workshop on Spoken Language Technology (SLT), Miami, United States - Florida.

Astrinaki, M., D'alessandro, N., Picart, B., Drugman, T., & Dutoit, T. (2012). Reactive and Continuous Control of HMM-based Speech Synthesis [Paper presentation]. IEEE Workshop on Spoken Language Technology (SLT), Miami, United States - Florida.

Picart, B., Drugman, T., & Dutoit, T. (2012). Statistical Methods for Varying the Degree of Articulation in New HMM-based Voices [Paper presentation]. IEEE Workshop on Spoken Language Technology (SLT), Miami, United States - Florida.

Brognaux, S., Roekhaut, S., Drugman, T., & Beaufort, R. (2012). Automatic Phone Alignment: A Comparison between Speaker-Independent Models and Models Trained on the Corpus to Align [Paper presentation]. 8th International Conference on Natural Language Processing (JapTAL 2012), Kanazawa, Japan.

Drugman, T., Urbain, J., Bauwens, N., Chessini Bose, R., Aubriot, A.-S., Lebecque, P., & Dutoit, T. (2012). Audio and Contact Microphones for Cough Detection [Paper presentation]. 13th Annual Conference of the International Speech Communication Association (Interspeech 2012), Portland, United States - Oregon.

Drugman, T., Kane, J., & Gobl, C. (2012). Modeling the Creaky Excitation for Parametric Speech Synthesis [Paper presentation]. 13th Annual Conference of the International Speech Communication Association (Interspeech 2012), Portland, United States - Oregon.

Drugman, T., Kane, J., & Gobl, C. (2012). Resonator-based creaky voice detection [Paper presentation]. 13th Annual Conference of the International Speech Communication Association (Interspeech 2012), Portland, United States - Oregon.

Bauwens, N., Aubriot, A.-S., Drugman, T., Dutoit, T., Leal, T., & Lebecque, P. (06 June 2012). Assessment of a commercially available cough counter in healthy subjects [Paper presentation]. 35th European Cystic Fibrosis Conference, Dublin, Ireland.

Reboursière, L., Lähdeoja, O., Drugman, T., Dupont, S., Picard-Limpens, C., & Riche, N. (2012). Left and right-hand guitar playing techniques detection [Paper presentation]. 12th Conference on New Interfaces for Musical Expression (NIME'12), Ann Arbor, Michigan, United States - Michigan.

Lähdeoja, O., Reboursière, L., Drugman, T., Dupont, S., Picard-Limpens, C., & Riche, N. (2012). Detection des Techniques de Jeu de la Guitare [Paper presentation]. Journées d'Informatique Musicale (JIM 2012), Mons, Belgium.

Picart, B., Drugman, T., & Dutoit, T. (2012). Assessing the Intelligibility and Quality of HMM-based Speech Synthesis with a Variable Degree of Articulation [Paper presentation]. The Listening Talker (LISTA) workshop, Edinburgh, United Kingdom.

Drugman, T. (2012). Using the Glottal Source in Voice Technology Applications [Paper presentation]. Workshop on Innovation and Applications in Speech Technology (IAST), Dublin, Ireland.

Kane, J., Drugman, T., & Gobl, C. (2012). Creaky voice - detection and synthesis [Paper presentation]. Workshop on Innovation and Applications in Speech Technology (IAST), Dublin, Ireland.

Drugman, T., & Dutoit, T. (2012). The Deterministic plus Stochastic Model of the Residual Signal and its Applications. IEEE Transactions on Acoustics, Speech, and Signal Processing.
Peer reviewed

Drugman, T., Mark, T., Gudnason, J., Naylor, P., & Dutoit, T. (2012). Detection of Glottal Closure Instants from Speech Signals: a Quantitative Review. IEEE Transactions on Acoustics, Speech, and Signal Processing.
Peer reviewed

Drugman, T., Bozkurt, B., & Dutoit, T. (14 January 2012). A comparative study of glottal source estimation techniques. Computer Speech and Language, 26 (1), 20-34.
Peer Reviewed verified by ORBi

Picart, B., Drugman, T., & Dutoit, T. (07 November 2011). Perceptual Effects of the Degree of Articulation in HMM-based Speech Synthesis. Lecture Notes in Computer Science, 7015, 177 - 182. doi:10.1007/978-3-642-25020-0_23
Peer reviewed

Drugman, T., & Dutoit, T. (2011). Oscillating Statistical Moments for Speech Polarity Detection [Paper presentation]. Non-Linear Speech Processing International Workshop (NOLISP), Las Palmas, Spain.

Reboursière, L., Lähdeoja, O., Chessini Bose, R., Drugman, T., Dupont, S., Picard-Limpens, C., & Riche, N. (2011). Guitar As Controller. Quarterly Progress Scientific Report of the Numediart Research Program.

Drugman, T., Urbain, J., & Dutoit, T. (01 September 2011). Assessment of Audio Features for Automatic Cough Detection [Poster presentation]. Eusipco 2011, Barcelona, Spain.

Drugman, T., Dubuisson, T., & Dutoit, T. (31 August 2011). On the Use of Glottal Source for Expressive Speech Analysis [Paper presentation]. 9th Pan European Conference (PEVOC 9), Marseille, France.

Dubuisson, T., Drugman, T., & Dutoit, T. (31 August 2011). On the Use of Grey Zones in Automatic Voice Pathology Detection [Paper presentation]. 9th Pan European Conference (PEVOC 9), Marseille, France.

Dubuisson, T., Drugman, T., & Dutoit, T. (2011). On the Use of Grey Zones in Automatic Voice Pathology Detection [Paper presentation]. 9th Pan European Conference (PEVOC 9), Marseille, France.

Drugman, T., Dubuisson, T., & Dutoit, T. (2011). On the Use of Glottal Source for Expressive Speech Analysis [Paper presentation]. 9th Pan European Conference (PEVOC 9), Marseille, France.

Drugman, T., & Alwan, A. (2011). Joint Robust Voicing Detection and Pitch Estimation Based on Residual Harmonics [Paper presentation]. Interspeech 2011, Firenze, Italy.

Picart, B., Drugman, T., & Dutoit, T. (2011). Continuous Control of the Degree of Articulation in HMM-based Speech Synthesis [Paper presentation]. Interspeech 2011, Firenze, Italy.

Drugman, T., Bozkurt, B., & Dutoit, T. (01 July 2011). Causal-anticausal Decomposition of Speech using Complex Cepstrum for Glottal Source Estimation. Speech Communication, 53 (6), 855-866.
Peer Reviewed verified by ORBi

Drugman, T., Dubuisson, T., & Dutoit, T. (2011). Phase-based Information for Voice Pathology Detection [Paper presentation]. ICASSP11, .

Drugman, T., Wilfart, G., & Dutoit, T. (25 January 2011). Speech Synthesis and Coding Methods.

Drugman, T., & Dutoit, T. (2010). Chirp Complex Cepstrum-based Decomposition for Asynchronous Glottal Analysis [Paper presentation]. Interspeech 2010, Makuhari, Japan.

Drugman, T., & Dutoit, T. (2010). Glottal-based Analysis of the Lombard Effect [Paper presentation]. Interspeech 2010, Makuhari, Japan.

Drugman, T., & Dutoit, T. (2010). On the Potential of Glottal Signatures for Speaker Recognition [Paper presentation]. Interspeech 2010, Makuhari, Japan.

Picart, B., Drugman, T., & Dutoit, T. (2010). Analysis and Synthesis of Hypo and Hyperarticulated Speech [Paper presentation]. 7th ISCA Tutorial and Research Workshop on Speech Synthesis, Kyoto, Japan.

Drugman, T., & Dutoit, T. (2010). A Comparative Evaluation of Pitch Modification Techniques [Paper presentation]. Interspeech 2010, Makuhari, Japan.

Drugman, T., & Dutoit, T. (2010). Reconnaissance du Locuteur basée sur des Signatures Glottiques [Paper presentation]. 28èmes Journées d'Etude sur la Parole (JEP 2010), Mons, Belgium.

Drugman, T., Bozkurt, B., & Dutoit, T. (2010). Analyse et Modification de la Qualité Vocale basée sur l'Excitation [Paper presentation]. 28èmes Journées d'Etude sur la Parole (JEP 2010), Mons, Belgium.

Dubuisson, T., Drugman, T., & Dutoit, T. (17 March 2010). Détection des pathologies vocales [Poster presentation]. Forum des Industries de la Langue, Louvain-la-Neuve, Belgium.

Drugman, T., Bozkurt, B., & Dutoit, T. (2010). Glottal Source Estimation Using an Automatic Chirp Decomposition. Lecture Notes in Computer Science.
Peer reviewed

Drugman, T. (2010). On the Glottal Flow Estimation and its Usefulness in Speech Processing [Paper presentation]. EuroDocInfo'2010, UVHC, Valenciennes, France.

Dubuisson, T., Drugman, T., & Dutoit, T. (2009). On the mutual information of glottal source estimation techniques for the automatic detection of speech pathologies [Paper presentation]. International workshop on models and analysis of vocal emissions for biomedical applications, Florence, Italy.

Drugman, T., & Dutoit, T. (2009). Glottal closure and opening instant detection from speech signals [Paper presentation]. Interspeech 2009, Brighton, United Kingdom.

Drugman, T., Dubuisson, T., & Dutoit, T. (2009). On the mutual information between source and filter contributions for voice pathology detection [Paper presentation]. Interspeech 2009, Brighton, United Kingdom.

Drugman, T., Wilfart, G., & Dutoit, T. (2009). A deterministic plus stochastic model of the residual signal for improved parametric speech synthesis [Paper presentation]. Interspeech 2009, Brighton, United Kingdom.

Drugman, T., Bozkurt, B., & Dutoit, T. (2009). Complex cepstrum-based decomposition of speech for glottal source estimation [Paper presentation]. Interspeech 2009, Brighton, United Kingdom.

Drugman, T., Wilfart, G., & Dutoit, T. (2009). Eigenresiduals for improved parametric speech synthesis [Paper presentation]. 17th European Signal Processing Conference, Glasgow, United Kingdom.

Drugman, T., Bozkurt, B., & Dutoit, T. (2009). Chirp decomposition of speech signals for glottal source estimation [Paper presentation]. ISCA Workshop on Non-Linear Speech Processing, Vic, Spain.

Drugman, T., Wilfart, G., Moinet, A., & Dutoit, T. (2009). Using a pitch-synchronous residual codebook for hybrid HMM/frame selection speech synthesis [Paper presentation]. ICASSP 2009 - International Conference on Acoustics, Speech and Signal Processing, Taipei, Taiwan. doi:10.1109/ICASSP.2009.4960453

Drugman, T., & Dutoit, T. (22 January 2009). Hidden Markow Models-based speech synthesis [Paper presentation]. Rencontre Internationale de doctorants en informatique, Mons, Belgium.

Gurban, M., Drugman, T., Dutoit, T., & Thiran, J.-P. (2008). Dynamic modality weighting for multi-stream HMMs in audio-visual speech recognition [Paper presentation]. IEEE International Conference on Multimodal Interfaces, Chania, Greece.

Drugman, T., Dubuisson, T., Moinet, A., D'alessandro, N., & Dutoit, T. (2008). Voice Source Parameters Estimation by Fitting the Glottal Formant and the Inverse Filtering Open Phase [Paper presentation]. 16th European Signal Processing Conference, Lausanne, Switzerland.

Drugman, T., Dubuisson, T., Moinet, A., D'alessandro, N., & Dutoit, T. (2008). Glottal Source Estimation Robustness [Paper presentation]. Conference on Signal Processing and Multimedia Applications, Porto, Portugal.

Couvreur, L., Bettens, F., Drugman, T., Dubuisson, T., Dupont, S., Frisson, C., Jottrand, M., & Mancas, M. (01 June 2008). Project # 2.3 : audio thumbnailing. Quarterly Progress Scientific Report of the Numediart Research Program, 1 (2), 67-85.

Drugman, T., Moinet, A., & Dutoit, T. (2008). On the Use of Machine Learning in Statistical Parametric Speech Synthesis [Paper presentation]. Benelearn 2008 - Annual Machine Learning Conference, Spa, Belgium.

D'alessandro, N., Drugman, T., & Dubuisson, T. (01 March 2008). Project # 3 : transvoice table. Quarterly Progress Scientific Report of the Numediart Research Program, 1 (1).

Couvreur, L., Bettens, F., Drugman, T., Frisson, C., Jottrand, M., Mancas, M., & Moinet, A. (01 March 2008). Project # 1.1 : audio skimming. Quarterly Progress Scientific Report of the Numediart Research Program, 1 (1), 1-16.

Drugman, T., Gurban, M., & Thiran, J.-P. (2007). Relevant Feature Selection for Audio-Visual Speech Recognition [Paper presentation]. 9th International Workshop on Multimedia Signal Processing, Chania, Crete, Greece.

Drugman, T., Gurban, M., & Thiran, J.-P. (01 October 2007). Relevant Feature Selection for Audio-Visual Speech Recognition [Paper presentation]. 9th International Workshop on Multimedia Signal Processing, Chania, Crete, Greece.

Thiran, J.-P., Valles, A., Drugman, T., & Gurban, M. (2007). Définition et sélection d'attributs visuels pour la reconnaissance audio-visuelle de la parole [Paper presentation]. 5è atelier de travail sur le Traitement et l'Analyse de l'Information : Méthodes et Applications, Hammamet, Tunisia.

Drugman, T., Gurban, M., & Valles, A. (22 May 2007). Définition et sélection d'attributs visuels pour la reconnaissance audio-visuelle de la parole [Paper presentation]. 5è atelier de travail sur le Traitement et l'Analyse de l'Information : Méthodes et Applications, Hammamet, Tunisia.

Contact ORBi UMONS