Semiglobal Sequence Alignment with Gaps Using GPU



Carroll, Thomas C ORCID: 0000-0002-3020-2661, Ojiaku, Jude-Thaddeus and Wong, Prudence WH ORCID: 0000-0001-7935-7245
(2020) Semiglobal Sequence Alignment with Gaps Using GPU. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 17 (6). pp. 2086-2097.

[img] Text
GapsMisJournal.pdf - Author Accepted Manuscript

Download (613kB) | Preview

Abstract

In this paper, we consider the pair-wise semiglobal sequence alignment problem with gaps, which is motivated by the re-sequencing problem that requires to assemble short reads sequences into a genome sequence by referring to a reference sequence. The problem has been studied before for single gap and bounded number of gaps. For single gap, there is a GPU-based algorithm proposed (Barton et al., 2015). In our work, we propose a GPU-based algorithm for the bounded number of gaps case, called GPUGapsMis. We implement the algorithm and compare the performance with the CPU-based algorithm, called CPUGapsMis. The algorithm has two distinct stages: the alignment phase, and the backtrack phase. We investigate several different approaches, in order to determine the most favorable for this problem, by means of a Hybrid model or a wholly-GPU based model, as well as the alignment of single text sequences or multiple text sequences on the GPU at a time. We show that the alignment phase of the algorithm is a good candidate for parallelization, with peak speedup of 11 times. We show that although the backtracking phase is sequential, it is more beneficial to perform it on the GPU, as opposed to returning to the CPU and performing there. When performing both phases on the GPU, GPUGapsMis achieves a peak speedup of 10.4 times against CPUGapsMis. Our data parallel GPU algorithm achieves results which are an improvement on those of an existing GPU data parallel implementation (Ojiaku, 2014).

Item Type: Article
Uncontrolled Keywords: Graphics processors, parallel programming, data communications aspects, bioinformatics
Depositing User: Symplectic Admin
Date Deposited: 02 May 2019 13:30
Last Modified: 19 Jan 2023 00:52
DOI: 10.1109/TCBB.2019.2914105
Related URLs:
URI: https://livrepository.liverpool.ac.uk/id/eprint/3039311