Supervised mid-level features for word image representation

10/20/2014
by   Albert Gordo, et al.
0

This paper addresses the problem of learning word image representations: given the cropped image of a word, we are interested in finding a descriptive, robust, and compact fixed-length representation. Machine learning techniques can then be supplied with these representations to produce models useful for word retrieval or recognition tasks. Although many works have focused on the machine learning aspect once a global representation has been produced, little work has been devoted to the construction of those base image representations: most works use standard coding and aggregation techniques directly on top of standard computer vision features such as SIFT or HOG. We propose to learn local mid-level features suitable for building word image representations. These features are learnt by leveraging character bounding box annotations on a small set of training images. However, contrary to other approaches that use character bounding box information, our approach does not rely on detecting the individual characters explicitly at testing time. Our local mid-level features can then be aggregated to produce a global word image signature. When pairing these features with the recent word attributes framework of Almazán et al., we obtain results comparable with or better than the state-of-the-art on matching and recognition tasks using global descriptors of only 96 dimensions.

READ FULL TEXT
research
10/15/2020

Does Chinese BERT Encode Word Structure?

Contextualized representations give significantly improved results for a...
research
04/05/2016

Deep Image Retrieval: Learning global representations for image search

We propose a novel approach for instance-level image retrieval. It produ...
research
08/07/2023

Keyword Spotting Simplified: A Segmentation-Free Approach using Character Counting and CTC re-scoring

Recent advances in segmentation-free keyword spotting treat this problem...
research
03/19/2019

3DCarRecog: Car Recognition Using 3D Bounding Box

We present a novel learning framework for vehicle recognition from a sin...
research
03/19/2019

Geometry-constrained Car Recognition Using a 3D Perspective Network

We present a novel learning framework for vehicle recognition from a sin...
research
06/11/2019

Weakly-supervised Compositional FeatureAggregation for Few-shot Recognition

Learning from a few examples is a challenging task for machine learning....
research
08/28/2018

All You Need is "Love": Evading Hate-speech Detection

With the spread of social networks and their unfortunate use for hate sp...

Please sign up or login with your details

Forgot password? Click here to reset