Towards Machine Comprehension of Spoken Content: Initial TOEFL Listening Comprehension Test by Machine

08/23/2016
by   Bo-Hsiang Tseng, et al.
0

Multimedia or spoken content presents more attractive information than plain text content, but it's more difficult to display on a screen and be selected by a user. As a result, accessing large collections of the former is much more difficult and time-consuming than the latter for humans. It's highly attractive to develop a machine which can automatically understand spoken content and summarize the key information for humans to browse over. In this endeavor, we propose a new task of machine comprehension of spoken content. We define the initial goal as the listening comprehension test of TOEFL, a challenging academic English examination for English learners whose native language is not English. We further propose an Attention-based Multi-hop Recurrent Neural Network (AMRNN) architecture for this task, achieving encouraging results in the initial tests. Initial results also have shown that word-level attention is probably more robust than sentence-level attention for this task with ASR errors.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/28/2016

Hierarchical Attention Model for Improved Machine Comprehension of Spoken Content

Multimedia or spoken content presents more attractive information than p...
research
12/26/2016

Abstractive Headline Generation for Spoken Content by Attentive Recurrent Neural Networks with ASR Error Modeling

Headline generation for spoken content is important since spoken content...
research
04/01/2018

Spoken SQuAD: A Study of Mitigating the Impact of Speech Recognition Errors on Listening Comprehension

Reading comprehension has been widely studied. One of the most represent...
research
09/01/2017

Query-by-example Spoken Term Detection using Attention-based Multi-hop Networks

Retrieving spoken content with spoken queries, or query-by- example spok...
research
07/09/2021

An Initial Investigation of Non-Native Spoken Question-Answering

Text-based machine comprehension (MC) systems have a wide-range of appli...
research
08/23/2018

Role of Intonation in Scoring Spoken English

In this paper, we have introduced and evaluated intonation based feature...
research
10/14/2021

Identifying Introductions in Podcast Episodes from Automatically Generated Transcripts

As the volume of long-form spoken-word content such as podcasts explodes...

Please sign up or login with your details

Forgot password? Click here to reset