Binary Matrix Factorization via Dictionary Learning

04/16/2018
by   Ignacio Ramírez, et al.
0

Matrix factorization is a key tool in data analysis; its applications include recommender systems, correlation analysis, signal processing, among others. Binary matrices are a particular case which has received significant attention for over thirty years, especially within the field of data mining. Dictionary learning refers to a family of methods for learning overcomplete basis (also called frames) in order to efficiently encode samples of a given type; this area, now also about twenty years old, was mostly developed within the signal processing field. In this work we propose two binary matrix factorization methods based on a binary adaptation of the dictionary learning paradigm to binary matrices. The proposed algorithms focus on speed and scalability; they work with binary factors combined with bit-wise operations and a few auxiliary integer ones. Furthermore, the methods are readily applicable to online binary matrix factorization. Another important issue in matrix factorization is the choice of rank for the factors; we address this model selection problem with an efficient method based on the Minimum Description Length principle. Our preliminary results show that the proposed methods are effective at producing interpretable factorizations of various data types of different nature.

READ FULL TEXT

page 8

page 9

research
11/10/2020

Applications of Online Nonnegative Matrix Factorization to Image and Time-Series Data

Online nonnegative matrix factorization (ONMF) is a matrix factorization...
research
12/05/2022

Matrix factorization with neural networks

Matrix factorization is an important mathematical problem encountered in...
research
05/03/2016

Dictionary Learning for Massive Matrix Factorization

Sparse matrix factorization is a popular tool to obtain interpretable da...
research
05/09/2016

Identification of refugee influx patterns in Greece via model-theoretic analysis of daily arrivals

The refugee crisis is perhaps the single most challenging problem for Eu...
research
11/05/2019

Online matrix factorization for Markovian data and applications to Network Dictionary Learning

Online Matrix Factorization (OMF) is a fundamental tool for dictionary l...
research
02/08/2014

An Inequality with Applications to Structured Sparsity and Multitask Dictionary Learning

From concentration inequalities for the suprema of Gaussian or Rademache...
research
04/10/2020

The Permuted Striped Block Model and its Factorization – Algorithms with Recovery Guarantees

We introduce a novel class of matrices which are defined by the factoriz...

Please sign up or login with your details

Forgot password? Click here to reset