Paper published in a journal (Scientific congresses and symposiums)
Noise and Speech Estimation As Auxiliary Tasks for Robust Speech Recognition
Pironkov, Gueorgui; Dupont, Stéphane; Wood, S. U. N. et al.
2017
 

Files


Full Text
Pironkov2017noiseEstimation.pdf
Publisher postprint (277.28 kB)
Request a copy

All documents in ORBi UMONS are protected by a user license.

Send to



Details



Abstract :
[en] Dealing with noise deteriorating the speech is still a major problem for automatic speech recognition. An interesting approach to tackle this problem consists of using multi-task learning. In this case, an efficient auxiliary task is clean-speech generation. This auxiliary task is trained in addition to the main speech recognition task and its goal is to help improve the results of the main task. In this paper, we inves- tigate this idea further by generating features extracted directly from the audio file containing only the noise, instead of the clean-speech. Af- ter demonstrating that an improvement can be obtained through this multi-task learning auxiliary task, we also show that using both noise and clean-speech estimation auxiliary tasks leads to a 4% relative word error rate improvement in comparison to the classic single-task learning on the CHiME4 dataset.
Disciplines :
Electrical & electronics engineering
Library & information sciences
Author, co-author :
Pironkov, Gueorgui ;  Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Dupont, Stéphane  ;  Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Wood, S. U. N.
Dutoit, Thierry ;  Université de Mons > Faculté Polytechnique > Service Information, Signal et Intelligence artificielle
Language :
English
Title :
Noise and Speech Estimation As Auxiliary Tasks for Robust Speech Recognition
Publication date :
23 October 2017
Event name :
International Conference on Statistical Language and Speech Processing
Event place :
Le Mans, France
Event date :
2017
Research unit :
F105 - Information, Signal et Intelligence artificielle
Research institute :
R300 - Institut de Recherche en Technologies de l'Information et Sciences de l'Informatique
R450 - Institut NUMEDIART pour les Technologies des Arts Numériques
Available on ORBi UMONS :
since 28 September 2017

Statistics


Number of views
1 (0 by UMONS)
Number of downloads
0 (0 by UMONS)

Scopus citations®
 
1
Scopus citations®
without self-citations
0

Bibliography


Similar publications



Contact ORBi UMONS