Fast Matrix Factorization with Non-Uniform Weights on Missing Data

11/11/2018
by   Xiangnan He, et al.
0

Matrix factorization (MF) has been widely used to discover the low-rank structure and to predict the missing entries of data matrix. In many real-world learning systems, the data matrix can be very high-dimensional but sparse. This poses an imbalanced learning problem, since the scale of missing entries is usually much larger than that of observed entries, but they cannot be ignored due to the valuable negative signal. For efficiency concern, existing work typically applies a uniform weight on missing entries to allow a fast learning algorithm. However, this simplification will decrease modeling fidelity, resulting in suboptimal performance for downstream applications. In this work, we weight the missing data non-uniformly, and more generically, we allow any weighting strategy on the missing data. To address the efficiency challenge, we propose a fast learning method, for which the time complexity is determined by the number of observed entries in the data matrix, rather than the matrix size. The key idea is two-fold: 1) we apply truncated SVD on the weight matrix to get a more compact representation of the weights, and 2) we learn MF parameters with element-wise alternating least squares (eALS) and memorize the key intermediate variables to avoid repeating computations that are unnecessary. We conduct extensive experiments on two recommendation benchmarks, demonstrating the correctness, efficiency, and effectiveness of our fast eALS method.

READ FULL TEXT

page 1

page 13

research
08/16/2017

Fast Matrix Factorization for Online Recommendation with Implicit Feedback

This paper contributes improvements on both the effectiveness and effici...
research
10/25/2021

Fast Rank-1 NMF for Missing Data with KL Divergence

We propose a fast non-gradient based method of rank-1 non-negative matri...
research
09/26/2022

Bounded Simplex-Structured Matrix Factorization

In this paper, we propose a new low-rank matrix factorization model dubb...
research
10/01/2014

Generalized Low Rank Models

Principal components analysis (PCA) is a well-known technique for approx...
research
10/01/2019

Covariance Matrix Estimation with Non Uniform and Data Dependent Missing Observations

In this paper we study covariance estimation with missing data. We consi...
research
12/15/2019

Polynomial Matrix Completion for Missing Data Imputation and Transductive Learning

This paper develops new methods to recover the missing entries of a high...
research
02/28/2013

Scoup-SMT: Scalable Coupled Sparse Matrix-Tensor Factorization

How can we correlate neural activity in the human brain as it responds t...

Please sign up or login with your details

Forgot password? Click here to reset