ODSQA: Open-domain Spoken Question Answering Dataset

08/07/2018
by   Chia-Hsuan Lee, et al.
0

Reading comprehension by machine has been widely studied, but machine comprehension of spoken content is still a less investigated problem. In this paper, we release Open-Domain Spoken Question Answering Dataset (ODSQA) with more than three thousand questions. To the best of our knowledge, this is the largest real SQA dataset. On this dataset, we found that ASR errors have catastrophic impact on SQA. To mitigate the effect of ASR errors, subword units are involved, which brings consistent improvements over all the models. We further found that data augmentation on text-based QA training examples can improve SQA.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/01/2018

Spoken SQuAD: A Study of Mitigating the Impact of Speech Recognition Errors on Listening Comprehension

Reading comprehension has been widely studied. One of the most represent...
research
09/26/2022

On the Impact of Speech Recognition Errors in Passage Retrieval for Spoken Question Answering

Interacting with a speech interface to query a Question Answering (QA) s...
research
04/16/2019

Mitigating the Impact of Speech Recognition Errors on Spoken Question Answering by Adversarial Domain Adaptation

Spoken question answering (SQA) is challenging due to complex reasoning ...
research
07/09/2021

An Initial Investigation of Non-Native Spoken Question-Answering

Text-based machine comprehension (MC) systems have a wide-range of appli...
research
04/21/2020

Logic-Guided Data Augmentation and Regularization for Consistent Question Answering

Many natural language questions require qualitative, quantitative or log...
research
03/09/2022

DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering

Spoken Question Answering (SQA) is to find the answer from a spoken docu...
research
08/20/2023

LibriSQA: Advancing Free-form and Open-ended Spoken Question Answering with a Novel Dataset and Framework

While Large Language Models (LLMs) have demonstrated commendable perform...

Please sign up or login with your details

Forgot password? Click here to reset