Tiny CNN for feature point description for document analysis: approach and dataset

09/09/2021
by   A. Sheshkus, et al.
0

In this paper, we study the problem of feature points description in the context of document analysis and template matching. Our study shows that the specific training data is required for the task especially if we are to train a lightweight neural network that will be usable on devices with limited computational resources. In this paper, we construct and provide a dataset with a method of training patches retrieval. We prove the effectiveness of this data by training a lightweight neural network and show how it performs in both documents and general patches matching. The training was done on the provided dataset in comparison with HPatches training dataset and for the testing we use HPatches testing framework and two publicly available datasets with various documents pictured on complex backgrounds: MIDV-500 and MIDV-2019.

READ FULL TEXT

page 2

page 4

page 5

research
07/16/2018

MIDV-500: A Dataset for Identity Documents Analysis and Recognition on Mobile Devices in Video Stream

A lot of research has been devoted to identity documents analysis and re...
research
10/22/2019

One-Shot Template Matching for Automatic Document Data Capture

In this paper, we propose a novel one-shot template-matching algorithm t...
research
01/29/2020

Comparison of scanned administrative document images

In this work the methods of comparison of digitized copies of administra...
research
06/15/2023

Document Entity Retrieval with Massive and Noisy Pre-training

Visually-Rich Document Entity Retrieval (VDER) is a type of machine lear...
research
01/08/2022

Coherence-Based Distributed Document Representation Learning for Scientific Documents

Distributed document representation is one of the basic problems in natu...
research
07/03/2022

Training Patch Analysis and Mining Skills for Image Restoration Deep Neural Networks

There have been numerous image restoration methods based on deep convolu...
research
05/23/2022

Domain Adaptation for Memory-Efficient Dense Retrieval

Dense retrievers encode documents into fixed dimensional embeddings. How...

Please sign up or login with your details

Forgot password? Click here to reset