- -
UPV
 
29/01/20
Notícia
Charla invitada: A Story of Two Ideas in Automatic Speech Recognition: One Elegant, and One Very Useful!

Charla impartida por el profesor Sanjeev Khudanpur, Center for Language and Speech Processing, Johns Hopkins University, US.

3 de febrero, 12:00, Sala de juntas del DSIC (1F)


Abstract:  The Kaldi tools for automatic speech recognition (ASR) are being widely used both for research and for industry-scale deployments.  Many innovations keep Kaldi up-to-date in this fact-moving field.  This presentation will briefly overview the history of ASR and Kaldi, then describe how two recent innovations grew from germination to implementation and evaluation.  One is to use adversarial examples to improve training of deep neural network (DNN) based acoustic models.  The other is GPU acceleration of the inference engine (the so-called Viterbi decoder).  They illustrate two different metrics of research success.  The adversarial training solution turns out to be very elegant: it can be viewed as either correcting a biased estimate of the true gradient in SGD, or as a form of classical leave-one-out estimation.  The GPU acceleration solution turns out to be of immense practical significance, and demonstrates a successful academic-industry collaboration in applied areas of Computer Science.

Bio: https://www.clsp.jhu.edu/faculty-pages/sanjeev


EMAS upv