LOGAN: High-Performance GPU-Based X-Drop Long-Read Alignment

02/12/2020
by   Alberto Zeni, et al.
0

Pairwise sequence alignment is one of the most computationally intensive kernels in genomic data analysis, accounting for more than 90 for key bioinformatics applications. This method is particularly expensive for third-generation sequences due to the high computational cost of analyzing sequences of length between 1Kb and 1Mb. Given the quadratic overhead of exact pairwise algorithms for long alignments, the community primarily relies on approximate algorithms that search only for high-quality alignments and stop early when one is not found. In this work, we present the first GPU optimization of the popular X-drop alignment algorithm, that we named LOGAN. Results show that our high-performance multi-GPU implementation achieves up to 181.6 GCUPS and speed-ups up to 6.6x and 30.7x using 1 and 6 NVIDIA Tesla V100, respectively, over the state-of-the-art software running on two IBM Power9 processors using 168 CPU threads, with equivalent accuracy. We also demonstrate a 2.3x LOGAN speed-up versus ksw2, a state-of-art vectorized algorithm for sequence alignment implemented in minimap2, a long-read mapping software. To highlight the impact of our work on a real-world application, we couple LOGAN with a many-to-many long-read alignment software called BELLA, and demonstrate that our implementation improves the overall BELLA runtime by up to 10.6x. Finally, we adapt the Roofline model for LOGAN and demonstrate that our implementation is near-optimal on the NVIDIA Tesla V100s.

READ FULL TEXT
research
04/05/2022

High-throughput Pairwise Alignment with the Wavefront Algorithm using Processing-in-Memory

We show that the wavefront algorithm can achieve higher pairwise read al...
research
03/27/2021

GateKeeper-GPU: Fast and Accurate Pre-Alignment Filtering in Short Read Mapping

At the last step of short read mapping, the candidate locations of the r...
research
10/08/2019

Accelerating the Understanding of Life's Code Through Better Algorithms and Hardware Design

Calculating the similarities between a pair of genomic sequences is one ...
research
04/17/2023

Space Efficient Sequence Alignment for SRAM-Based Computing: X-Drop on the Graphcore IPU

Dedicated accelerator hardware has become essential for processing AI-ba...
research
11/09/2020

Adaptive Learning of Rank-One Models for Efficient Pairwise Sequence Alignment

Pairwise alignment of DNA sequencing data is a ubiquitous task in bioinf...
research
01/27/2020

diBELLA: Distributed Long Read to Long Read Alignment

We present a parallel algorithm and scalable implementation for genome a...
research
07/30/2017

CUDAMPF++: A Proactive Resource Exhaustion Scheme for Accelerating Homologous Sequence Search on CUDA-enabled GPU

Genomic sequence alignment is an important research topic in bioinformat...

Please sign up or login with your details

Forgot password? Click here to reset