topFiberM: Scalable and Efficient Boolean Matrix Factorization

03/06/2019
by   Abdelmoneim Amer Desouki, et al.
0

Matrix Factorization has many applications such as clustering. When the matrix is Boolean it is favorable to have Boolean factors too. This will save the efforts of quantizing the reconstructed data back, which usually is done using arbitrary thresholds. Here we introduce topFiberM a Boolean matrix factorization algorithm. topFiberM chooses in a greedy way the fibers (rows or columns) to represent the entire matrix. Fibers are extended to rectangles according to a threshold on precision. The search for these "top fibers" can continue beyond the required rank and according to an optional parameter that defines the limit for this search. A factor with a better gain replaces the factor with minimum gain in "top fibers". We compared topFiberM to the state-of-the-art methods, it achieved better quality for the set of datasets usually used in literature. We also applied our algorithm to linked-data to show its scalability. topFiberM was in average 128 times faster than the well known Asso method when applied to a set of matrices representing a real multigraph although Asso is implemented in C and topFiberM is implemented in R which is generally slower than C. topFiberM is publicly available from Github (https://github.com/dice-group/BMF).

READ FULL TEXT
research
06/18/2021

Boolean Matrix Factorization with SAT and MaxSAT

The Boolean matrix factorization problem consists in approximating a mat...
research
01/28/2019

From-Below Boolean Matrix Factorization Algorithm Based on MDL

During the past few years Boolean matrix factorization (BMF) has become ...
research
05/17/2023

Algorithms for Boolean Matrix Factorization using Integer Programming

Boolean matrix factorization (BMF) approximates a given binary input mat...
research
12/05/2020

Recent Developments in Boolean Matrix Factorization

The goal of Boolean Matrix Factorization (BMF) is to approximate a given...
research
12/05/2020

Biclustering and Boolean Matrix Factorization in Data Streams

We study the clustering of bipartite graphs and Boolean matrix factoriza...
research
02/02/2021

Symmetric Boolean Factor Analysis with Applications to InstaHide

In this work we examine the security of InstaHide, a recently proposed s...
research
09/09/2019

Fast And Efficient Boolean Matrix Factorization By Geometric Segmentation

Boolean matrix has been used to represent digital information in many fi...

Please sign up or login with your details

Forgot password? Click here to reset