CNN based Extraction of Panels/Characters from Bengali Comic Book Page Images

10/21/2019
by   Arpita Dutta, et al.
0

Peoples nowadays prefer to use digital gadgets like cameras or mobile phones for capturing documents. Automatic extraction of panels/characters from the images of a comic document is challenging due to the wide variety of drawing styles adopted by writers, beneficial for readers to read them on mobile devices at any time and useful for automatic digitization. Most of the methods for localization of panel/character rely on the connected component analysis or page background mask and are applicable only for a limited comic dataset. This work proposes a panel/character localization architecture based on the features of YOLO and CNN for extraction of both panels and characters from comic book images. The method achieved remarkable results on Bengali Comic Book Image dataset (BCBId) consisting of total 4130 images, developed by us as well as on a variety of publicly available comic datasets in other languages, i.e. eBDtheque, Manga 109 and DCM dataset.

READ FULL TEXT

page 5

page 6

research
08/30/2017

Experimental Evaluation of Book Drawing Algorithms

A k-page book drawing of a graph G=(V,E) consists of a linear ordering o...
research
12/11/2022

Extending TrOCR for Text Localization-Free OCR of Full-Page Scanned Receipt Images

Digitization of scanned receipts aims to extract text from receipt image...
research
07/13/2022

A new database of Houma Alliance Book ancient handwritten characters and its baseline algorithm

The Houma Alliance Book is one of the national treasures of the Museum i...
research
01/03/2011

Segmentation of Camera Captured Business Card Images for Mobile Devices

Due to huge deformation in the camera captured images, variety in nature...
research
01/12/2012

Autonomous Cleaning of Corrupted Scanned Documents - A Generative Modeling Approach

We study the task of cleaning scanned text documents that are strongly c...
research
03/18/2020

ScanSSD: Scanning Single Shot Detector for Mathematical Formulas in PDF Document Images

We introduce the Scanning Single Shot Detector (ScanSSD) for locating ma...
research
09/22/2009

A Method for Extraction and Recognition of Isolated License Plate Characters

A method to extract and recognize isolated characters in license plates ...

Please sign up or login with your details

Forgot password? Click here to reset