Designing robust watermark barcodes for multiplex long-read sequencing

04/05/2016
by   Joaquín Ezpeleta, et al.
0

A method for designing sequencing barcodes that can withstand a large number of insertion, deletion and substitution errors and are suitable for use in multiplex single-molecule real-time sequencing is presented. The manuscript focuses on the design of barcodes for full-length single-pass reads, impaired by challenging error rates in the order of 11 is the first method to specifically address this problem without requiring upstream quality improvement. The proposed barcodes can multiplex hundreds or thousands of samples while achieving sample misassignment probabilities as low as 10^-7, and are designed to be compatible with chemical constraints imposed by the sequencing process. Software for constructing watermark barcode sets and demultiplexing barcoded reads, together with example sets of barcodes and synthetic barcoded reads, are freely available at www.cifasis-conicet.gov.ar/ezpeleta/NS-watermark.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/05/2021

Permutation-Invariant Quantum Codes for Deletion Errors

This paper presents conditions for constructing permutation-invariant qu...
research
11/28/2022

E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model

We explore unifying a neural segmenter with two-pass cascaded encoder AS...
research
03/06/2013

Constructing Lower Probabilities

An elaboration of Dempster's method of constructing belief functions sug...
research
04/02/2020

Single Quantum Deletion Error-Correcting Codes

In this paper, we discuss a construction method of quantum deletion erro...
research
01/10/2023

HQAlign: Aligning nanopore reads for SV detection using current-level modeling

Motivation: Detection of structural variants (SV) from the alignment of ...
research
01/27/2020

diBELLA: Distributed Long Read to Long Read Alignment

We present a parallel algorithm and scalable implementation for genome a...
research
12/15/2019

Towards Building a Real Time Mobile Device Bird Counting System Through Synthetic Data Training and Model Compression

Counting the number of birds in an open sky setting has been an challeng...

Please sign up or login with your details

Forgot password? Click here to reset