Historical Document Image Segmentation with LDA-Initialized Deep Neural Networks

10/19/2017
by   Michele Alberti, et al.
0

In this paper, we present a novel approach to perform deep neural networks layer-wise weight initialization using Linear Discriminant Analysis (LDA). Typically, the weights of a deep neural network are initialized with: random values, greedy layer-wise pre-training (usually as Deep Belief Network or as auto-encoder) or by re-using the layers from another network (transfer learning). Hence, many training epochs are needed before meaningful weights are learned, or a rather similar dataset is required for seeding a fine-tuning of transfer learning. In this paper, we describe how to turn an LDA into either a neural layer or a classification layer. We analyze the initialization technique on historical documents. First, we show that an LDA-based initialization is quick and leads to a very stable initialization. Furthermore, for the task of layout analysis at pixel level, we investigate the effectiveness of LDA-based initialization and show that it outperforms state-of-the-art random weight initialization methods.

READ FULL TEXT
research
02/01/2017

PCA-Initialized Deep Neural Networks Applied To Document Image Analysis

In this paper, we present a novel approach for initializing deep neural ...
research
08/05/2015

Learning from LDA using Deep Neural Networks

Latent Dirichlet Allocation (LDA) is a three-level hierarchical Bayesian...
research
04/20/2020

Revisiting Initialization of Neural Networks

Good initialization of weights is crucial for effective training of deep...
research
02/26/2021

Layer-Wise Interpretation of Deep Neural Networks Using Identity Initialization

The interpretability of neural networks (NNs) is a challenging but essen...
research
05/02/2021

Data-driven Weight Initialization with Sylvester Solvers

In this work, we propose a data-driven scheme to initialize the paramete...
research
01/15/2020

Autoencoders as Weight Initialization of Deep Classification Networks for Cancer versus Cancer Studies

Cancer is still one of the most devastating diseases of our time. One wa...
research
07/20/2016

On the Modeling of Error Functions as High Dimensional Landscapes for Weight Initialization in Learning Networks

Next generation deep neural networks for classification hosted on embedd...

Please sign up or login with your details

Forgot password? Click here to reset