Boolean and 𝔽_p-Matrix Factorization: From Theory to Practice

07/25/2022
by   Fedor Fomin, et al.
0

Boolean Matrix Factorization (BMF) aims to find an approximation of a given binary matrix as the Boolean product of two low-rank binary matrices. Binary data is ubiquitous in many fields, and representing data by binary matrices is common in medicine, natural language processing, bioinformatics, computer graphics, among many others. Unfortunately, BMF is computationally hard and heuristic algorithms are used to compute Boolean factorizations. Very recently, the theoretical breakthrough was obtained independently by two research groups. Ban et al. (SODA 2019) and Fomin et al. (Trans. Algorithms 2020) show that BMF admits an efficient polynomial-time approximation scheme (EPTAS). However, despite the theoretical importance, the high double-exponential dependence of the running times from the rank makes these algorithms unimplementable in practice. The primary research question motivating our work is whether the theoretical advances on BMF could lead to practical algorithms. The main conceptional contribution of our work is the following. While EPTAS for BMF is a purely theoretical advance, the general approach behind these algorithms could serve as the basis in designing better heuristics. We also use this strategy to develop new algorithms for related 𝔽_p-Matrix Factorization. Here, given a matrix A over a finite field GF(p) where p is a prime, and an integer r, our objective is to find a matrix B over the same field with GF(p)-rank at most r minimizing some norm of A-B. Our empirical research on synthetic and real-world data demonstrates the advantage of the new algorithms over previous works on BMF and 𝔽_p-Matrix Factorization.

READ FULL TEXT
research
12/05/2020

Recent Developments in Boolean Matrix Factorization

The goal of Boolean Matrix Factorization (BMF) is to approximate a given...
research
10/27/2020

C-SALT: Mining Class-Specific ALTerations in Boolean Matrix Factorization

Given labeled data represented by a binary matrix, we consider the task ...
research
07/14/2023

Efficiently Factorizing Boolean Matrices using Proximal Gradient Descent

Addressing the interpretability problem of NMF on Boolean data, Boolean ...
research
09/09/2019

MEBF: a fast and efficient Boolean matrix factorization method

Boolean matrix has been used to represent digital information in many fi...
research
09/09/2019

Fast And Efficient Boolean Matrix Factorization By Geometric Segmentation

Boolean matrix has been used to represent digital information in many fi...
research
01/17/2019

Boolean matrix factorization meets consecutive ones property

Boolean matrix factorization is a natural and a popular technique for su...
research
05/28/2019

Factorizations for a Class of Multivariate Polynomial Matrices

Following the works by Lin et al. (Circuits Syst. Signal Process. 20(6):...

Please sign up or login with your details

Forgot password? Click here to reset