Standard

New algorithms for efficient parallel string comparison. / Krusche, Peter; Tiskin, Alexander.

Annual ACM Symposium on Parallelism in Algorithms and Architectures. 2010. p. 209-216.

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Harvard

Krusche, P & Tiskin, A 2010, New algorithms for efficient parallel string comparison. in Annual ACM Symposium on Parallelism in Algorithms and Architectures. pp. 209-216. https://doi.org/10.1145/1810479.1810521

APA

Krusche, P., & Tiskin, A. (2010). New algorithms for efficient parallel string comparison. In Annual ACM Symposium on Parallelism in Algorithms and Architectures (pp. 209-216) https://doi.org/10.1145/1810479.1810521

Vancouver

Krusche P, Tiskin A. New algorithms for efficient parallel string comparison. In Annual ACM Symposium on Parallelism in Algorithms and Architectures. 2010. p. 209-216 https://doi.org/10.1145/1810479.1810521

Author

Krusche, Peter ; Tiskin, Alexander. / New algorithms for efficient parallel string comparison. Annual ACM Symposium on Parallelism in Algorithms and Architectures. 2010. pp. 209-216

BibTeX

@inproceedings{0e85ac9004a74bfeac44917f6b035770,
title = "New algorithms for efficient parallel string comparison",
abstract = "In this paper, we show new parallel algorithms for a set of classical string comparison problems: computation of string alignments, longest common subsequences (LCS) or edit distances, and longest increasing subsequence computation. These problems have a wide range of applications, in particular in computational biology and signal processing. We discuss the scalability of our new parallel algorithms in computation time, in memory, and in communication. Our new algorithms are based on an efficient parallel method for (min; +)-multiplication of distance matrices. The core result of this paper is a scalable parallel algorithm for multiplying implicit simple unit-Monge matrices of size n × n on p processors using time O( n log n/p ), communication O( n log p/p ) and O(log p) supersteps. This algorithm allows us to implement scalable LCS computation for two strings of length n using time O( n 2/p ) and communication O( n p/√p ), requiring local memory of size O( n p/√p ) on each processor. Furthermore, our algorithm can be used to obtain the first generally work-scalable algorithm for computing the longest increasing subsequence (LIS). Our algorithm for LIS computation requires computation O( n log2 n/p ), communication O( n log p/p ), and O(log2 p) supersteps for computing the LIS of a sequence of length n. This is within a log n factor of work-optimality for the LIS problem, which can be solved sequentially in time O(n log n) in the comparison-based model. Our LIS algorithm is also within a log p- factor of achieving perfectly scalable communication and furthermore has perfectly scalable memory size requirements of O( n/p ) per processor.",
keywords = "BSP algorithms, Longest common subsequences, Longest increasing subsequences",
author = "Peter Krusche and Alexander Tiskin",
year = "2010",
month = jul,
day = "30",
doi = "10.1145/1810479.1810521",
language = "English",
pages = "209--216",
booktitle = "Annual ACM Symposium on Parallelism in Algorithms and Architectures",

}

RIS

TY - GEN

T1 - New algorithms for efficient parallel string comparison

AU - Krusche, Peter

AU - Tiskin, Alexander

PY - 2010/7/30

Y1 - 2010/7/30

N2 - In this paper, we show new parallel algorithms for a set of classical string comparison problems: computation of string alignments, longest common subsequences (LCS) or edit distances, and longest increasing subsequence computation. These problems have a wide range of applications, in particular in computational biology and signal processing. We discuss the scalability of our new parallel algorithms in computation time, in memory, and in communication. Our new algorithms are based on an efficient parallel method for (min; +)-multiplication of distance matrices. The core result of this paper is a scalable parallel algorithm for multiplying implicit simple unit-Monge matrices of size n × n on p processors using time O( n log n/p ), communication O( n log p/p ) and O(log p) supersteps. This algorithm allows us to implement scalable LCS computation for two strings of length n using time O( n 2/p ) and communication O( n p/√p ), requiring local memory of size O( n p/√p ) on each processor. Furthermore, our algorithm can be used to obtain the first generally work-scalable algorithm for computing the longest increasing subsequence (LIS). Our algorithm for LIS computation requires computation O( n log2 n/p ), communication O( n log p/p ), and O(log2 p) supersteps for computing the LIS of a sequence of length n. This is within a log n factor of work-optimality for the LIS problem, which can be solved sequentially in time O(n log n) in the comparison-based model. Our LIS algorithm is also within a log p- factor of achieving perfectly scalable communication and furthermore has perfectly scalable memory size requirements of O( n/p ) per processor.

AB - In this paper, we show new parallel algorithms for a set of classical string comparison problems: computation of string alignments, longest common subsequences (LCS) or edit distances, and longest increasing subsequence computation. These problems have a wide range of applications, in particular in computational biology and signal processing. We discuss the scalability of our new parallel algorithms in computation time, in memory, and in communication. Our new algorithms are based on an efficient parallel method for (min; +)-multiplication of distance matrices. The core result of this paper is a scalable parallel algorithm for multiplying implicit simple unit-Monge matrices of size n × n on p processors using time O( n log n/p ), communication O( n log p/p ) and O(log p) supersteps. This algorithm allows us to implement scalable LCS computation for two strings of length n using time O( n 2/p ) and communication O( n p/√p ), requiring local memory of size O( n p/√p ) on each processor. Furthermore, our algorithm can be used to obtain the first generally work-scalable algorithm for computing the longest increasing subsequence (LIS). Our algorithm for LIS computation requires computation O( n log2 n/p ), communication O( n log p/p ), and O(log2 p) supersteps for computing the LIS of a sequence of length n. This is within a log n factor of work-optimality for the LIS problem, which can be solved sequentially in time O(n log n) in the comparison-based model. Our LIS algorithm is also within a log p- factor of achieving perfectly scalable communication and furthermore has perfectly scalable memory size requirements of O( n/p ) per processor.

KW - BSP algorithms

KW - Longest common subsequences

KW - Longest increasing subsequences

UR - http://www.scopus.com/inward/record.url?scp=77954938173&partnerID=8YFLogxK

U2 - 10.1145/1810479.1810521

DO - 10.1145/1810479.1810521

M3 - Conference contribution

AN - SCOPUS:77954938173

SP - 209

EP - 216

BT - Annual ACM Symposium on Parallelism in Algorithms and Architectures

ER -

ID: 127709588