Gakuto Kurata

research

∙ 09/07/2023

Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems

Transferring the knowledge of large language models (LLMs) is a promisin...

0 Takuma Udagawa, et al. ∙

research

∙ 04/01/2022

Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems

Large-scale language models (LLMs) such as GPT-2, BERT and RoBERTa have ...

0 Takuma Udagawa, et al. ∙

research

∙ 03/29/2022

Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing

We introduce two techniques, length perturbation and n-best based label ...

0 Xiaodong Cui, et al. ∙

research

∙ 12/16/2021

Knowledge Distillation Leveraging Alternative Soft Targets from Non-Parallel Qualified Speech Data

This paper describes a novel knowledge distillation framework that lever...

0 Tohru Nagano, et al. ∙

research

∙ 04/08/2021

RNN Transducer Models For Spoken Language Understanding

We present a comprehensive study on building and adapting RNN transducer...

0 Samuel Thomas, et al. ∙

research

∙ 09/30/2020

End-to-End Spoken Language Understanding Without Full Transcripts

An essential component of spoken language understanding (SLU) is slot fi...

0 Hong-Kwang J. Kuo, et al. ∙

research

∙ 04/30/2019

English Broadcast News Speech Recognition by Humans and Machines

With recent advances in deep learning, considerable attention has been g...

0 Samuel Thomas, et al. ∙

research

∙ 04/17/2019

Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation

Conventional automatic speech recognition (ASR) systems trained from fra...

0 Gakuto Kurata, et al. ∙

research

∙ 09/19/2017

Language Modeling with Highway LSTM

Language models (LMs) based on Long Short Term Memory (LSTM) have shown ...

0 Gakuto Kurata, et al. ∙

research

∙ 03/06/2017

English Conversational Telephone Speech Recognition by Humans and Machines

One of the most difficult speech recognition tasks is accurate recogniti...

0 George Saon, et al. ∙

research

∙ 01/07/2016

Leveraging Sentence-level Information with Encoder LSTM for Semantic Slot Filling

Recurrent Neural Network (RNN) and one of its specific architectures, Lo...

0 Gakuto Kurata, et al. ∙

Gakuto Kurata

Featured Co-authors

Sign in with Google

Consider DeepAI Pro