Minimum description length as an objective function for non-negative matrix factorization

02/05/2019
by   Steven Squires, et al.
0

Non-negative matrix factorization (NMF) is a dimensionality reduction technique which tends to produce a sparse representation of data. Commonly, the error between the actual and recreated matrices is used as an objective function, but this method may not produce the type of representation we desire as it allows for the complexity of the model to grow, constrained only by the size of the subspace and the non-negativity requirement. If additional constraints, such as sparsity, are imposed the question of parameter selection becomes critical. Instead of adding sparsity constraints in an ad-hoc manner we propose a novel objective function created by using the principle of minimum description length (MDL). Our formulation, MDL-NMF, automatically trades off between the complexity and accuracy of the model using a principled approach with little parameter selection or the need for domain expertise. We demonstrate our model works effectively on three heterogeneous data-sets and on a range of semi-synthetic data showing the broad applicability of our method.

READ FULL TEXT
research
10/31/2019

Solving NMF with smoothness and sparsity constraints using PALM

Non-negative matrix factorization is a problem of dimensionality reducti...
research
04/27/2021

Structured Sparse Non-negative Matrix Factorization with L20-Norm for scRNA-seq Data Analysis

Non-negative matrix factorization (NMF) is a powerful tool for dimension...
research
04/07/2016

A Unified Framework for Sparse Non-Negative Least Squares using Multiplicative Updates and the Non-Negative Matrix Factorization Problem

We study the sparse non-negative least squares (S-NNLS) problem. S-NNLS ...
research
12/08/2020

Sparse encoding for more-interpretable feature-selecting representations in probabilistic matrix factorization

Dimensionality reduction methods for count data are critical to a wide r...
research
10/03/2022

Process Modeling, Hidden Markov Models, and Non-negative Tensor Factorization with Model Selection

Monitoring of industrial processes is a critical capability in industry ...
research
10/30/2019

Constrained Polynomial Likelihood

Starting from a distribution z, we develop a non-negative polynomial min...
research
06/21/2021

Objective discovery of dominant dynamical processes with intelligible machine learning

The advent of big data has vast potential for discovery in natural pheno...

Please sign up or login with your details

Forgot password? Click here to reset