BELT: Block-wise Missing Embedding Learning Transformer

05/21/2021
by   Doudou Zhou, et al.
0

Matrix completion has attracted attention in many fields, including statistics, applied mathematics, and electrical engineering. Most of the works focus on the independent sampling models under which the observed entries are sampled independently. Motivated by applications in the integration of multiple Electronic Health Record (EHR) datasets, we propose the method Block-wise missing Embedding Learning Transformer (BELT) to treat row-wise/column-wise missingness. Specifically, BELT can recover block-wise missing matrices efficiently when every pair of matrices has an overlap. Our idea is to exploit the orthogonal Procrustes problem to align the eigenspace of the two sub-matrices using their overlap, then complete the missing blocks by the inner product of the two low-rank components. Besides, we prove the statistical rate for the eigenspace of the underlying matrix, which is comparable to the rate under the independently missing assumption. Simulation studies show that the method performs well under a variety of configurations. In the real data analysis, the method is applied to two tasks: (i) the integrating of several point-wise mutual information matrices built by English EHR and Chinese medical text data, and (ii) the machine translation between English and Chinese medical concepts. Our method shows an advantage over existing methods.

READ FULL TEXT
research
04/08/2015

Structured Matrix Completion with Applications to Genomic Data Integration

Matrix completion has attracted significant recent attention in many fie...
research
08/12/2022

Parallel QR Factorization of Block Low-Rank Matrices

We present two new algorithms for Householder QR factorization of Block ...
research
07/19/2019

Matrix Completion for Survey Data Prediction with Multivariate Missingness

Survey data are the gold-standard for estimating finite population param...
research
06/23/2020

Solving the Phantom Inventory Problem: Near-optimal Entry-wise Anomaly Detection

We observe that a crucial inventory management problem ('phantom invento...
research
03/18/2022

Optimal Exact Matrix Completion Under new Parametrization

We study the problem of exact completion for m×n sized matrix of rank r ...
research
07/31/2020

Denoising individual bias for a fairer binary submatrix detection

Low rank representation of binary matrix is powerful in disentangling sp...

Please sign up or login with your details

Forgot password? Click here to reset