Paper published in a journal (Scientific congresses and symposiums)
Speaker-Aware Long Short-Term Memory Multi-Task Learning for Speech Recognition
Pironkov, Gueorgui; Dupont, Stéphane; Dutoit, Thierry
2016
 

Files


Full Text
pironkov2016speakerMTL.pdf
Publisher postprint (420.56 kB)
Request a copy

All documents in ORBi UMONS are protected by a user license.

Send to



Details



Abstract :
[en] In order to address the commonly met issue of overfitting in speech recognition, this article investigates Multi- Task Learning, when the auxiliary task focuses on speaker clas- sification. Overfitting occurs when the amount of training data is limited, leading to an over-sensible acoustic model. Multi-Task Learning is a method, among many other regularization methods, which decreases the overfitting impact by forcing the acoustic model to train jointly for multiple different, but related, tasks. In this paper, we consider speaker classification as an auxiliary task in order to improve the generalization abilities of the acoustic model, by training the model to recognize the speaker, or find the closest one inside the training set. We investigate this Multi- Task Learning setup on the TIMIT database, while the acoustic modeling is performed using a Recurrent Neural Network with Long Short-Term Memory cells.
Disciplines :
Electrical & electronics engineering
Library & information sciences
Author, co-author :
Pironkov, Gueorgui ;  Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Dupont, Stéphane  ;  Université de Mons > Faculté Polytechnique > Information, Signal et Intelligence artificielle
Dutoit, Thierry ;  Université de Mons > Faculté Polytechnique > Service Information, Signal et Intelligence artificielle
Language :
English
Title :
Speaker-Aware Long Short-Term Memory Multi-Task Learning for Speech Recognition
Publication date :
31 August 2016
Event name :
European Signal Processing Conference
Event place :
Budapest, Hungary
Event date :
2016
Research unit :
F105 - Information, Signal et Intelligence artificielle
Research institute :
R300 - Institut de Recherche en Technologies de l'Information et Sciences de l'Informatique
R450 - Institut NUMEDIART pour les Technologies des Arts Numériques
Available on ORBi UMONS :
since 05 September 2016

Statistics


Number of views
1 (0 by UMONS)
Number of downloads
0 (0 by UMONS)

Scopus citations®
 
23
Scopus citations®
without self-citations
19

Bibliography


Similar publications



Contact ORBi UMONS