Improving Audio Caption Fluency with Automatic Error Correction

06/16/2023
by   Hanxue Zhang, et al.
0

Automated audio captioning (AAC) is an important cross-modality translation task, aiming at generating descriptions for audio clips. However, captions generated by previous AAC models have faced “false-repetition” errors due to the training objective. In such scenarios, we propose a new task of AAC error correction and hope to reduce such errors by post-processing AAC outputs. To tackle this problem, we use observation-based rules to corrupt captions without errors, for pseudo grammatically-erroneous sentence generation. One pair of corrupted and clean sentences can thus be used for training. We train a neural network-based model on the synthetic error dataset and apply the model to correct real errors in AAC outputs. Results on two benchmark datasets indicate that our approach significantly improves fluency while maintaining semantic information.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/12/2021

Improving Translation Robustness with Visual Cues and Error Correction

Neural Machine Translation models are brittle to input noise. Current ro...
research
08/20/2022

Judge a Sentence by Its Content to Generate Grammatical Errors

Data sparsity is a well-known problem for grammatical error correction (...
research
06/04/2022

Automated Audio Captioning with Epochal Difficult Captions for Curriculum Learning

In this paper, we propose an algorithm, Epochal Difficult Captions, to s...
research
04/16/2021

Comparison of Grammatical Error Correction Using Back-Translation Models

Grammatical error correction (GEC) suffers from a lack of sufficient par...
research
11/01/2021

VSEC: Transformer-based Model for Vietnamese Spelling Correction

Spelling error correction is one of topics which have a long history in ...
research
03/16/2023

Visual Information Matters for ASR Error Correction

Aiming to improve the Automatic Speech Recognition (ASR) outputs with a ...
research
06/06/2021

Do Grammatical Error Correction Models Realize Grammatical Generalization?

There has been an increased interest in data generation approaches to gr...

Please sign up or login with your details

Forgot password? Click here to reset