Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › Рецензирование
New algorithms for efficient parallel string comparison. / Krusche, Peter; Tiskin, Alexander.
Annual ACM Symposium on Parallelism in Algorithms and Architectures. 2010. стр. 209-216.Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › Рецензирование
}
TY - GEN
T1 - New algorithms for efficient parallel string comparison
AU - Krusche, Peter
AU - Tiskin, Alexander
PY - 2010/7/30
Y1 - 2010/7/30
N2 - In this paper, we show new parallel algorithms for a set of classical string comparison problems: computation of string alignments, longest common subsequences (LCS) or edit distances, and longest increasing subsequence computation. These problems have a wide range of applications, in particular in computational biology and signal processing. We discuss the scalability of our new parallel algorithms in computation time, in memory, and in communication. Our new algorithms are based on an efficient parallel method for (min; +)-multiplication of distance matrices. The core result of this paper is a scalable parallel algorithm for multiplying implicit simple unit-Monge matrices of size n × n on p processors using time O( n log n/p ), communication O( n log p/p ) and O(log p) supersteps. This algorithm allows us to implement scalable LCS computation for two strings of length n using time O( n 2/p ) and communication O( n p/√p ), requiring local memory of size O( n p/√p ) on each processor. Furthermore, our algorithm can be used to obtain the first generally work-scalable algorithm for computing the longest increasing subsequence (LIS). Our algorithm for LIS computation requires computation O( n log2 n/p ), communication O( n log p/p ), and O(log2 p) supersteps for computing the LIS of a sequence of length n. This is within a log n factor of work-optimality for the LIS problem, which can be solved sequentially in time O(n log n) in the comparison-based model. Our LIS algorithm is also within a log p- factor of achieving perfectly scalable communication and furthermore has perfectly scalable memory size requirements of O( n/p ) per processor.
AB - In this paper, we show new parallel algorithms for a set of classical string comparison problems: computation of string alignments, longest common subsequences (LCS) or edit distances, and longest increasing subsequence computation. These problems have a wide range of applications, in particular in computational biology and signal processing. We discuss the scalability of our new parallel algorithms in computation time, in memory, and in communication. Our new algorithms are based on an efficient parallel method for (min; +)-multiplication of distance matrices. The core result of this paper is a scalable parallel algorithm for multiplying implicit simple unit-Monge matrices of size n × n on p processors using time O( n log n/p ), communication O( n log p/p ) and O(log p) supersteps. This algorithm allows us to implement scalable LCS computation for two strings of length n using time O( n 2/p ) and communication O( n p/√p ), requiring local memory of size O( n p/√p ) on each processor. Furthermore, our algorithm can be used to obtain the first generally work-scalable algorithm for computing the longest increasing subsequence (LIS). Our algorithm for LIS computation requires computation O( n log2 n/p ), communication O( n log p/p ), and O(log2 p) supersteps for computing the LIS of a sequence of length n. This is within a log n factor of work-optimality for the LIS problem, which can be solved sequentially in time O(n log n) in the comparison-based model. Our LIS algorithm is also within a log p- factor of achieving perfectly scalable communication and furthermore has perfectly scalable memory size requirements of O( n/p ) per processor.
KW - BSP algorithms
KW - Longest common subsequences
KW - Longest increasing subsequences
UR - http://www.scopus.com/inward/record.url?scp=77954938173&partnerID=8YFLogxK
U2 - 10.1145/1810479.1810521
DO - 10.1145/1810479.1810521
M3 - Conference contribution
AN - SCOPUS:77954938173
SP - 209
EP - 216
BT - Annual ACM Symposium on Parallelism in Algorithms and Architectures
ER -
ID: 127709588