A Coding Theory Perspective on Multiplexed Molecular Profiling of Biological Tissues

01/26/2021
by   Luca D'Alessio, et al.
0

High-throughput and quantitative experimental technologies are experiencing rapid advances in the biological sciences. One important recent technique is multiplexed fluorescence in situ hybridization (mFISH), which enables the identification and localization of large numbers of individual strands of RNA within single cells. Core to that technology is a coding problem: with each RNA sequence of interest being a codeword, how to design a codebook of probes, and how to decode the resulting noisy measurements? Published work has relied on assumptions of uniformly distributed codewords and binary symmetric channels for decoding and to a lesser degree for code construction. Here we establish that both of these assumptions are inappropriate in the context of mFISH experiments and substantial decoding performance gains can be obtained by using more appropriate, less classical, assumptions. We propose a more appropriate asymmetric channel model that can be readily parameterized from data and use it to develop a maximum a posteriori (MAP) decoders. We show that false discovery rate for rare RNAs, which is the key experimental metric, is vastly improved with MAP decoders even when employed with the existing sub-optimal codebook. Using an evolutionary optimization methodology, we further show that by permuting the codebook to better align with the prior, which is an experimentally straightforward procedure, significant further improvements are possible.

READ FULL TEXT

page 1

page 3

research
05/08/2021

On Multi-Channel Huffman Codes for Asymmetric-Alphabet Channels

Zero-error single-channel source coding has been studied extensively ove...
research
06/21/2022

Codebook Mismatch Can Be Fully Compensated by Mismatched Decoding

We consider an ensemble of constant composition codes that are subsets o...
research
03/14/2023

On Decoder Ties for the Binary Symmetric Channel with Arbitrarily Distributed Input

The error probability of block codes sent under a non-uniform input dist...
research
11/11/2019

Optimizing short stabilizer codes for asymmetric channels

For a number of quantum channels of interest, phase-flip errors occur fa...
research
11/06/2020

Keep the bursts and ditch the interleavers

To facilitate applications in IoT, 5G, and beyond, there is an engineeri...
research
03/11/2022

Optimal Covariate Weighting Increases Discoveries in High-throughput Biology

The large-scale multiple testing inherent to high throughput biological ...

Please sign up or login with your details

Forgot password? Click here to reset