MONI can find k-MEMs

02/10/2022
by   Travis Gagie, et al.
0

Maximal exact matches (MEMs) have been widely used in bioinformatics at least since Li (2013) presented BWA-MEM. Building on work by Bannai, Gagie and I (2018), Rossi et al. (2022) recently built an index called MONI, based on the run-length compressed Burrows-Wheeler Transform, that can find MEMs efficiently with respect to pangenomes. In this paper we define k-MEMs to be maximal substrings of a pattern that each occur exactly at k times in a text (so a MEM is a 1-MEM) and show that, when k is given at construction time, MONI can find k-MEMs efficiently as well.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2022

Computing Maximal Unique Matches with the r-index

In recent years, pangenomes received increasing attention from the scien...
research
11/14/2022

Augmented Thresholds for MONI

MONI (Rossi et al., 2022) can store a pangenomic dataset T in small spac...
research
09/19/2022

MARIA: Multiple-alignment r-index with aggregation

There now exist compact indexes that can efficiently list all the occurr...
research
11/16/2018

Efficient Construction of a Complete Index for Pan-Genomics Read Alignment

While short read aligners, which predominantly use the FM-index, are abl...
research
12/08/2021

RLBWT Tricks

Experts would probably have guessed that compressed sparse bitvectors we...
research
02/14/2020

On Extensions of Maximal Repeats in Compressed Strings

This paper provides an upper bound for several subsets of maximal repeat...
research
04/27/2020

In-Place Bijective Burrows-Wheeler Transforms

One of the most well-known variants of the Burrows-Wheeler transform (BW...

Please sign up or login with your details

Forgot password? Click here to reset