Knowledge Distillation for Improved Accuracy in Spoken Question Answering

10/21/2020
by   Chenyu You, et al.
0

Spoken question answering (SQA) is a challenging task that requires the machine to fully understand the complex spoken documents. Automatic speech recognition (ASR) plays a significant role in the development of QA systems. However, the recent work shows that ASR systems generate highly noisy transcripts, which critically limit the capability of machine comprehension on the SQA task. To address the issue, we present a novel distillation framework. Specifically, we devise a training strategy to perform knowledge distillation (KD) from spoken documents and written counterparts. Our work makes a step towards distilling knowledge from the language model as a supervision signal to lead to better student accuracy by reducing the misalignment between automatic and manual transcriptions. Experiments demonstrate that our approach outperforms several state-of-the-art language models on the Spoken-SQuAD dataset.

READ FULL TEXT
research
10/21/2020

Contextualized Attention-based Knowledge Transfer for Spoken Conversational Question Answering

Spoken conversational question answering (SCQA) requires machines to mod...
research
10/18/2020

Towards Data Distillation for End-to-end Spoken Conversational Question Answering

In spoken question answering, QA systems are designed to answer question...
research
04/16/2019

Mitigating the Impact of Speech Recognition Errors on Spoken Question Answering by Adversarial Domain Adaptation

Spoken question answering (SQA) is challenging due to complex reasoning ...
research
07/20/2021

Sequence Model with Self-Adaptive Sliding Window for Efficient Spoken Document Segmentation

Transcripts generated by automatic speech recognition (ASR) systems for ...
research
09/26/2022

On the Impact of Speech Recognition Errors in Passage Retrieval for Spoken Question Answering

Interacting with a speech interface to query a Question Answering (QA) s...
research
07/09/2021

An Initial Investigation of Non-Native Spoken Question-Answering

Text-based machine comprehension (MC) systems have a wide-range of appli...
research
08/20/2023

LibriSQA: Advancing Free-form and Open-ended Spoken Question Answering with a Novel Dataset and Framework

While Large Language Models (LLMs) have demonstrated commendable perform...

Please sign up or login with your details

Forgot password? Click here to reset