R-PHOC: Segmentation-Free Word Spotting using CNN

07/05/2017
by   Suman Ghosh, et al.
0

This paper proposes a region based convolutional neural network for segmentation-free word spotting. Our net- work takes as input an image and a set of word candidate bound- ing boxes and embeds all bounding boxes into an embedding space, where word spotting can be casted as a simple nearest neighbour search between the query representation and each of the candidate bounding boxes. We make use of PHOC embedding as it has previously achieved significant success in segmentation- based word spotting. Word candidates are generated using a simple procedure based on grouping connected components using some spatial constraints. Experiments show that R-PHOC which operates on images directly can improve the current state-of- the-art in the standard GW dataset and performs as good as PHOCNET in some cases designed for segmentation based word spotting.

READ FULL TEXT
research
10/19/2021

1st Place Solution for the UVO Challenge on Image-based Open-World Segmentation 2021

We describe our two-stage instance segmentation framework we use to comp...
research
11/15/2020

BanglaWriting: A multi-purpose offline Bangla handwriting dataset

This article presents a Bangla handwriting dataset named BanglaWriting t...
research
02/04/2019

Realistic Image Generation using Region-phrase Attention

The Generative Adversarial Network (GAN) has recently been applied to ge...
research
08/05/2018

A Multi-task Framework for Skin Lesion Detection and Segmentation

Early detection and segmentation of skin lesions is crucial for timely d...
research
04/10/2023

Split, Merge, and Refine: Fitting Tight Bounding Boxes via Learned Over-Segmentation and Iterative Search

We present a novel framework for finding a set of tight bounding boxes o...
research
04/21/2020

A CNN Framenwork Based on Line Annotations for Detecting Nematodes in Microscopic Images

Plant parasitic nematodes cause damage to crop plants on a global scale....
research
10/21/2022

TransLIST: A Transformer-Based Linguistically Informed Sanskrit Tokenizer

Sanskrit Word Segmentation (SWS) is essential in making digitized texts ...

Please sign up or login with your details

Forgot password? Click here to reset