Hayato Futami

research

∙ 09/16/2023

Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation

Collecting audio-text pairs is expensive; however, it is much easier to ...

0 Emiru Tsunoo, et al. ∙

research

∙ 07/24/2023

Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition

Although frame-based models, such as CTC and transducers, have an affini...

0 Emiru Tsunoo, et al. ∙

research

∙ 07/20/2023

Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding

There has been an increased interest in the integration of pretrained sp...

0 Siddhant Arora, et al. ∙

research

∙ 05/02/2023

A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge

Recently there have been efforts to introduce new benchmark tasks for sp...

0 Siddhant Arora, et al. ∙

research

∙ 05/02/2023

The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge

This paper describes our system for the low-resource domain adaptation t...

0 Hayato Futami, et al. ∙

research

∙ 05/01/2023

Joint Modelling of Spoken Language Understanding Tasks with Integrated Dialog History

Most human interactions occur in the form of spoken conversations where ...

0 Siddhant Arora, et al. ∙

research

∙ 11/16/2022

Streaming Joint Speech Recognition and Disfluency Detection

Disfluency detection has mainly been solved in a pipeline approach, as p...

0 Hayato Futami, et al. ∙

research

∙ 09/08/2022

Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM

Connectionist temporal classification (CTC) -based models are attractive...

0 Hayato Futami, et al. ∙

research

∙ 09/05/2022

Distilling the Knowledge of BERT for CTC-based ASR

Connectionist temporal classification (CTC) -based models are attractive...

0 Hayato Futami, et al. ∙

research

∙ 10/05/2021

ASR Rescoring and Confidence Estimation with ELECTRA

In automatic speech recognition (ASR) rescoring, the hypothesis with the...

0 Hayato Futami, et al. ∙

research

∙ 08/09/2020

Distilling the Knowledge of BERT for Sequence-to-Sequence ASR

Attention-based sequence-to-sequence (seq2seq) models have achieved prom...

0 Hayato Futami, et al. ∙

Hayato Futami

Featured Co-authors

Sign in with Google

Consider DeepAI Pro