Optimal Permutation Recovery in Permuted Monotone Matrix Model

11/24/2019
by   Rong Ma, et al.
0

Motivated by recent research on quantifying bacterial growth dynamics based on genome assemblies, we consider a permuted monotone matrix model Y=ΘΠ+Z, where the rows represent different samples, the columns represent contigs in genome assemblies and the elements represent log-read counts after preprocessing steps and Guanine-Cytosine (GC) adjustment. In this model, Θ is an unknown mean matrix with monotone entries for each row, Π is a permutation matrix that permutes the columns of Θ, and Z is a noise matrix. This paper studies the problem of estimation/recovery of Π given the observed noisy matrix Y. We propose an estimator based on the best linear projection, which is shown to be minimax rate-optimal for both exact recovery, as measured by the 0-1 loss, and partial recovery, as quantified by the normalized Kendall's tau distance. Simulation studies demonstrate the superior empirical performance of the proposed estimator over alternative methods. We demonstrate the methods using a synthetic metagenomics dataset of 45 closely related bacterial species and a real metagenomic dataset to compare the bacterial growth dynamics between the responders and the non-responders of the IBD patients after 8 weeks of treatment.

READ FULL TEXT
research
11/28/2019

Optimal and Adaptive Estimation of Extreme Values in the Permuted Monotone Matrix Model

Motivated by applications in metagenomics, we consider the permuted mono...
research
07/08/2016

Optimal Rates of Statistical Seriation

Given a matrix the seriation problem consists in permuting its rows in s...
research
02/02/2018

Unlabelled Sensing: A Sparse Bayesian Learning Approach

We address the recovery of sparse vectors in an overcomplete, linear and...
research
06/25/2018

Towards Optimal Estimation of Bivariate Isotonic Matrices with Unknown Permutations

Many applications, including rank aggregation, crowd-labeling, and graph...
research
03/20/2023

Sparse Recovery with Shuffled Labels: Statistical Limits and Practical Estimators

This paper considers the sparse recovery with shuffled labels, i.e., = ...
research
06/05/2021

Learning Treatment Effects in Panels with General Intervention Patterns

The problem of causal inference with panel data is a central econometric...
research
07/12/2021

Likelihood estimation of sparse topic distributions in topic models and its applications to Wasserstein document distance calculations

This paper studies the estimation of high-dimensional, discrete, possibl...

Please sign up or login with your details

Forgot password? Click here to reset