A Fast Fully Octave Convolutional Neural Network for Document Image Segmentation

The Know Your Customer (KYC) and Anti Money Laundering (AML) are worldwide practices to online customer identification based on personal identification documents, similarity and liveness checking, and proof of address. To answer the basic regulation question: are you whom you say you are? The customer needs to upload valid identification documents (ID). This task imposes some computational challenges since these documents are diverse, may present different and complex backgrounds, some occlusion, partial rotation, poor quality, or damage. Advanced text and document segmentation algorithms were used to process the ID images. In this context, we investigated a method based on U-Net to detect the document edges and text regions in ID images. Besides the promising results on image segmentation, the U-Net based approach is computationally expensive for a real application, since the image segmentation is a customer device task. We propose a model optimization based on Octave Convolutions to qualify the method to situations where storage, processing, and time resources are limited, such as in mobile and robotic applications. We conducted the evaluation experiments in two new datasets CDPhotoDataset and DTDDataset, which are composed of real ID images of Brazilian documents. Our results showed that the proposed models are efficient to document segmentation tasks and portable.

READ FULL TEXT

page 1

page 2

page 4

page 5

research
06/16/2021

ICDAR 2021 Competition on Components Segmentation Task of Document Photos

This paper describes the short-term competition on Components Segmentati...
research
05/03/2022

Attention U-Net for Glaucoma Identification Using Fundus Image Segmentation

Glaucoma is a fatal and worldwide ocular disease that can result in irre...
research
01/16/2023

Post-Train Adaptive U-Net for Image Segmentation

Typical neural network architectures used for image segmentation cannot ...
research
01/29/2020

Comparison of scanned administrative document images

In this work the methods of comparison of digitized copies of administra...
research
03/30/2022

L^3U-net: Low-Latency Lightweight U-net Based Image Segmentation Model for Parallel CNN Processors

In this research, we propose a tiny image segmentation model, L^3U-net, ...
research
05/09/2023

Child Palm-ID: Contactless Palmprint Recognition for Children

Effective distribution of nutritional and healthcare aid for children, p...
research
10/07/2018

A Fast Text Similarity Measure for Large Document Collections using Multi-reference Cosine and Genetic Algorithm

One of the important factors that make a search engine fast and accurate...

Please sign up or login with your details

Forgot password? Click here to reset