MREC: a fast and versatile framework for aligning and matching data with applications to single cell molecular data

01/06/2020
by   Andrew J. Blumberg, et al.
0

Comparing and aligning large datasets is a pervasive problem occurring across many different knowledge domains. We introduce and study MREC, a recursive decomposition algorithm for computing matchings between data sets. The basic idea is to partition the data, match the partitions, and then recursively match the points within each pair of identified partitions. The matching itself is done using black box matching procedures that are too expensive to run on the entire data set. Using an absolute measure of the quality of a matching, the framework supports optimization over parameters including partitioning procedures and matching algorithms. By design, MREC can be applied to extremely large data sets. We analyze the procedure to describe when we can expect it to work well and demonstrate its flexibility and power by applying it to a number of alignment problems arising in the analysis of single cell molecular data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/06/2020

MREC: a fast and versatile framework for aligning and matching point clouds with applications to single cell molecular data

Comparing and aligning large datasets is a pervasive problem occurring a...
research
03/29/2020

Elastic Coupled Co-clustering for Single-Cell Genomic Data

The recent advances in single-cell technologies have enabled us to profi...
research
07/30/2019

Comparing partitions through the Matching Error

With the aim to propose a non parametric hypothesis test, this paper car...
research
03/24/2020

Notes on Equitable Partition into Matching Forests in Mixed Graphs and into b-branchings in Digraphs

An equitable partition into branchings in a digraph is a partition of th...
research
05/22/2019

AXS: A framework for fast astronomical data processing based on Apache Spark

We introduce AXS (Astronomy eXtensions for Spark), a scalable open-sourc...
research
12/21/2019

Black Box Recursive Translations for Molecular Optimization

Machine learning algorithms for generating molecular structures offer a ...
research
12/09/2017

Assessing Achievability of Queries and Constraints

Assessing and improving the quality of data in data-intensive systems ar...

Please sign up or login with your details

Forgot password? Click here to reset