Neural Networks for Latent Budget Analysis of Compositional Data

by   Zhenwei Yang, et al.
Utrecht University

Compositional data are non-negative data collected in a rectangular matrix with a constant row sum. Due to the non-negativity the focus is on conditional proportions that add up to 1 for each row. A row of conditional proportions is called an observed budget. Latent budget analysis (LBA) assumes a mixture of latent budgets that explains the observed budgets. LBA is usually fitted to a contingency table, where the rows are levels of one or more explanatory variables and the columns the levels of a response variable. In prospective studies, there is only knowledge about the explanatory variables of individuals and interest goes out to predicting the response variable. Thus, a form of LBA is needed that has the functionality of prediction. Previous studies proposed a constrained neural network (NN) extension of LBA that was hampered by an unsatisfying prediction ability. Here we propose LBA-NN, a feed forward NN model that yields a similar interpretation to LBA but equips LBA with a better ability of prediction. A stable and plausible interpretation of LBA-NN is obtained through the use of importance plots and table, that show the relative importance of all explanatory variables on the response variable. An LBA-NN-K- means approach that applies K-means clustering on the importance table is used to produce K clusters that are comparable to K latent budgets in LBA. Here we provide different experiments where LBA-NN is implemented and compared with LBA. In our analysis, LBA-NN outperforms LBA in prediction in terms of accuracy, specificity, recall and mean square error. We provide open-source software at GitHub.



There are no comments yet.


page 13


The α-k-NN regression for compositional data

Compositional data arise in many real-life applications and versatile me...

Time series forecasting using neural networks

Recent studies have shown the classification and prediction power of the...

Identifying Table Structure in Documents using Conditional Generative Adversarial Networks

In many industries, as well as in academic research, information is prim...

A tractable Multi-Partitions Clustering

In the framework of model-based clustering, a model allowing several lat...

nn-dependability-kit: Engineering Neural Networks for Safety-Critical Systems

nn-dependability-kit is an open-source toolbox to support safety enginee...

Implementation of a neural network for non-linearities estimation in a tail-sitter aircraft

The control of a tail-sitter aircraft is a challenging task, especially ...

Comparing Machine Learning Approaches for Table Recognition in Historical Register Books

We present in this paper experiments on Table Recognition in hand-writte...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.