A Post-Processing Based Bengali Document Layout Analysis with YOLOV8

09/02/2023
by   Nazmus Sakib Ahmed, et al.
0

This paper focuses on enhancing Bengali Document Layout Analysis (DLA) using the YOLOv8 model and innovative post-processing techniques. We tackle challenges unique to the complex Bengali script by employing data augmentation for model robustness. After meticulous validation set evaluation, we fine-tune our approach on the complete dataset, leading to a two-stage prediction strategy for accurate element segmentation. Our ensemble model, combined with post-processing, outperforms individual base architectures, addressing issues identified in the BaDLAD dataset. By leveraging this approach, we aim to advance Bengali document analysis, contributing to improved OCR and document comprehension and BaDLAD serves as a foundational resource for this endeavor, aiding future research in the field. Furthermore, our experiments provided key insights to incorporate new strategies into the established solution.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/03/2022

DocBed: A Multi-Stage OCR Solution for Documents with Complex Layouts

Digitization of newspapers is of interest for many reasons including pre...
research
12/03/2021

The Influence of Data Pre-processing and Post-processing on Long Document Summarization

Long document summarization is an important and hard task in the field o...
research
09/24/2020

Effects of Word-frequency based Pre- and Post- Processings for Audio Captioning

The system we used for Task 6 (Automated Audio Captioning)of the Detecti...
research
08/28/2023

Ensemble of Anchor-Free Models for Robust Bangla Document Layout Segmentation

In this research paper, we introduce a novel approach designed for the p...
research
04/27/2018

dhSegment: A generic deep-learning approach for document segmentation

In recent years there have been multiple successful attempts tackling do...
research
12/16/2011

Ensemble Models with Trees and Rules

In this article, we have proposed several approaches for post processing...

Please sign up or login with your details

Forgot password? Click here to reset