Minimum Segmentation for Pan-genomic Founder Reconstruction in Optimal Time

05/09/2018
by   Tuukka Norri, et al.
0

Given a threshold L and a set R = {R_1, ..., R_m} of m haplotype sequences, each having length n, the minimum segmentation problem for founder reconstruction is to partition the sequences into disjoint segments R[i_1+1,i_2], R[i_2+1, i_3], ..., R[i_r-1+1, i_r], where 0 = i_1 < ... < i_r = n and R[i_j-1+1, i_j] is the set {R_1[i_j-1+1, i_j], ..., R_m[i_j-1+1, i_j]}, such that the length of each segment, i_j - i_j-1, is at least L and K = _j{ |R[i_j-1+1, i_j]| } is minimized. The distinct substrings in the segments R[i_j-1+1, i_j] represent founder blocks that can be concatenated to form K founder sequences representing the original R such that crossovers happen only at segment boundaries. We give an optimal O(mn) time algorithm to solve the problem, improving over earlier O(mn^2). This improvement enables to exploit the algorithm on a pan-genomic setting of haplotypes being complete human chromosomes, with a goal of finding a representative set of references that can be indexed for read alignment and variant calling.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/03/2021

Rectangular Partitions of a Rectilinear Polygon

We investigate the problem of partitioning a rectilinear polygon P with ...
research
05/19/2020

Linear Time Construction of Indexable Founder Block Graphs

We introduce a compact pangenome representation based on an optimal segm...
research
01/27/2020

Bayesian nonparametric shared multi-sequence time series segmentation

In this paper, we introduce a method for segmenting time series data usi...
research
09/29/2022

Minimum Link Fencing

We study a variant of the geometric multicut problem, where we are given...
research
10/02/2019

Optimal Patrolling of High Priority Segments While Visiting the Unit Interval with a Set of Mobile Robots

Consider a region that requires to be protected from unauthorized penetr...
research
07/07/2022

What Makes for Automatic Reconstruction of Pulmonary Segments

3D reconstruction of pulmonary segments plays an important role in surgi...
research
02/24/2017

Sequence Modeling via Segmentations

Segmental structure is a common pattern in many types of sequences such ...

Please sign up or login with your details

Forgot password? Click here to reset