Phase Transitions in the Pooled Data Problem

10/18/2017
by   Jonathan Scarlett, et al.
0

In this paper, we study the pooled data problem of identifying the labels associated with a large collection of items, based on a sequence of pooled tests revealing the counts of each label within the pool. In the noiseless setting, we identify an exact asymptotic threshold on the required number of tests with optimal decoding, and prove a phase transition between complete success and complete failure. In addition, we present a novel noisy variation of the problem, and provide an information-theoretic framework for characterizing the required number of tests for general random noise models. Our results reveal that noise can make the problem considerably more difficult, with strict increases in the scaling laws even at low noise levels. Finally, we demonstrate similar behavior in an approximate recovery setting, where a given number of errors is allowed in the decoded labels.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2020

Improved Bounds and Algorithms for Sparsity-Constrained Group Testing

In group testing, the goal is to identify a subset of defective items wi...
research
01/11/2022

Performance Bounds for Group Testing With Doubly-Regular Designs

In the group testing problem, the goal is to identify a subset of defect...
research
06/15/2022

Statistical and Computational Phase Transitions in Group Testing

We study the group testing problem where the goal is to identify a set o...
research
03/14/2018

Noisy Adaptive Group Testing: Bounds and Algorithms

The group testing problem consists of determining a small set of defecti...
research
11/07/2019

An Efficient Algorithm for Capacity-Approaching Noisy Adaptive Group Testing

In this paper, we consider the group testing problem with adaptive test ...
research
11/10/2020

Group testing and local search: is there a computational-statistical gap?

In this work we study the fundamental limits of approximate recovery in ...
research
11/18/2018

Information Theoretic Bounds on Optimal Worst-case Error in Binary Mixture Identification

Identification of latent binary sequences from a pool of noisy observati...

Please sign up or login with your details

Forgot password? Click here to reset