Research output: Contribution to journal › Article › peer-review
SARP : A Novel Algorithm to Assess Compositional Biases in Protein Sequences. / Antonets, K.S.; Nizhnikov, A.A.
In: Evolutionary Bioinformatics, No. 9, 2013, p. 263-273.Research output: Contribution to journal › Article › peer-review
}
TY - JOUR
T1 - SARP
T2 - A Novel Algorithm to Assess Compositional Biases in Protein Sequences
AU - Antonets, K.S.
AU - Nizhnikov, A.A.
PY - 2013
Y1 - 2013
N2 - The composition of a defined set of subunits (nucleotides, amino acids) is one of the key features of biological sequences. Compositional biases are local shifts in amino acid or nucleotide frequencies that can occur as an adaptation of an organism to an extreme ecological niche, or as the signature of a specific function or localization of the corresponding protein. The calculation of probability is a method for annotating compositional bias and providing accurate detection of biased subsequences. Here, we present a Sequence Analysis based on the Ranking of Probabilities (SARP), a novel algorithm for the annotation of compositional biases based on ranking subsequences by their probabilities. SARP provides the same accuracy as the previously published Lower Probability Subsequences (LPS) algorithm but performs at an approximately 230-fold faster rate. It can be recommended for use when working with large datasets to reduce the time and resources required.
AB - The composition of a defined set of subunits (nucleotides, amino acids) is one of the key features of biological sequences. Compositional biases are local shifts in amino acid or nucleotide frequencies that can occur as an adaptation of an organism to an extreme ecological niche, or as the signature of a specific function or localization of the corresponding protein. The calculation of probability is a method for annotating compositional bias and providing accurate detection of biased subsequences. Here, we present a Sequence Analysis based on the Ranking of Probabilities (SARP), a novel algorithm for the annotation of compositional biases based on ranking subsequences by their probabilities. SARP provides the same accuracy as the previously published Lower Probability Subsequences (LPS) algorithm but performs at an approximately 230-fold faster rate. It can be recommended for use when working with large datasets to reduce the time and resources required.
U2 - 10.4137/EBO.S12299
DO - 10.4137/EBO.S12299
M3 - Article
C2 - 23919085
SP - 263
EP - 273
JO - Evolutionary Bioinformatics
JF - Evolutionary Bioinformatics
SN - 1176-9343
IS - 9
ER -
ID: 7379537