Visual Text Correction

01/06/2018
by   Amir Mazaheri, et al.
1

This paper tackles the Text Correction (TC) problem, i.e., finding and replacing an inaccurate word in a sentence. We introduce a novel deep network which detects the inaccuracy in a sentence and selects the best appropriate word to substitute. Our pipeline can be trained in an End-To-End fashion. Moreover, our method leverages the visual features and extends the simple text correction to Visual Text Correction (VTC). We present a method to fuse the visual and textual data for VTC problem. In our formulation, every single word dynamically selects part of a visual feature vector through a gating process. Furthermore, to train and evaluate our model, we propose an approach to automatically construct a large dataset for VTC problem. Our experiments and performance analysis demonstrate that the proposed method provides the best results and also highlights the challenges in solving the VTC problem. To the best of our knowledge, this work is the first of its kind for the Visual Text Correction task.

READ FULL TEXT

page 1

page 8

research
11/05/2016

LipNet: End-to-End Sentence-level Lipreading

Lipreading is the task of decoding text from the movement of a speaker's...
research
02/19/2019

A spelling correction model for end-to-end speech recognition

Attention-based sequence-to-sequence models for speech recognition joint...
research
06/10/2019

Generation of Multimodal Justification Using Visual Word Constraint Model for Explainable Computer-Aided Diagnosis

The ambiguity of the decision-making process has been pointed out as the...
research
11/09/2017

Learning Multi-Modal Word Representation Grounded in Visual Context

Representing the semantics of words is a long-standing problem for the n...
research
02/07/2023

Real-Word Error Correction with Trigrams: Correcting Multiple Errors in a Sentence

Spelling correction is a fundamental task in Text Mining. In this study,...
research
07/02/2018

A Simple but Effective Classification Model for Grammatical Error Correction

We treat grammatical error correction (GEC) as a classification problem ...
research
06/13/2023

CipherSniffer: Classifying Cipher Types

Ciphers are a powerful tool for encrypting communication. There are many...

Please sign up or login with your details

Forgot password? Click here to reset