Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование
Pathset graphs : A novel approach for comprehensive utilization of paired reads in genome assembly. / Pham, Son K.; Antipov, Dmitry; Sirotkin, Alexander; Tesler, Glenn; Pevzner, Pavel A.; Alekseyev, Max A.
Research in Computational Molecular Biology - 16th Annual International Conference, RECOMB 2012, Proceedings. 2012. стр. 200-212 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Том 7262 LNBI).Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование
}
TY - GEN
T1 - Pathset graphs
T2 - 16th Annual International Conference on Research in Computational Molecular Biology
AU - Pham, Son K.
AU - Antipov, Dmitry
AU - Sirotkin, Alexander
AU - Tesler, Glenn
AU - Pevzner, Pavel A.
AU - Alekseyev, Max A.
PY - 2012
Y1 - 2012
N2 - One of the key advances in genome assembly that has led to a significant improvement in contig lengths has been utilization of paired reads (mate-pairs). While in most assemblers, mate-pair information is used in a post-processing step, the recently proposed Paired de Bruijn Graph (PDBG) approach incorporates the mate-pair information directly in the assembly graph structure. However, the PDBG approach faces difficulties when the variation in the insert sizes is high. To address this problem, we first transform mate-pairs into edge-pair histograms that allow one to better estimate the distance between edges in the assembly graph that represent regions linked by multiple mate-pairs. Further, we combine the ideas of mate-pair transformation and PDBGs to construct new data structures for genome assembly: pathsets and pathset graphs.
AB - One of the key advances in genome assembly that has led to a significant improvement in contig lengths has been utilization of paired reads (mate-pairs). While in most assemblers, mate-pair information is used in a post-processing step, the recently proposed Paired de Bruijn Graph (PDBG) approach incorporates the mate-pair information directly in the assembly graph structure. However, the PDBG approach faces difficulties when the variation in the insert sizes is high. To address this problem, we first transform mate-pairs into edge-pair histograms that allow one to better estimate the distance between edges in the assembly graph that represent regions linked by multiple mate-pairs. Further, we combine the ideas of mate-pair transformation and PDBGs to construct new data structures for genome assembly: pathsets and pathset graphs.
UR - http://www.scopus.com/inward/record.url?scp=84860808270&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-29627-7_21
DO - 10.1007/978-3-642-29627-7_21
M3 - Conference contribution
AN - SCOPUS:84860808270
SN - 9783642296260
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 200
EP - 212
BT - Research in Computational Molecular Biology - 16th Annual International Conference, RECOMB 2012, Proceedings
Y2 - 21 April 2012 through 24 April 2012
ER -
ID: 100630943