Standard

Multiplex de Bruijn graphs enable genome assembly from long, high-fidelity reads. / Bankevich, Anton; Bzikadze, Andrey V.; Kolmogorov, Mikhail; Antipov, Dmitry; Pevzner, Pavel A.

в: Nature Biotechnology, Том 40, № 7, 07.2022, стр. 1075-1081.

Результаты исследований: Научные публикации в периодических изданияхстатьяРецензирование

Harvard

Bankevich, A, Bzikadze, AV, Kolmogorov, M, Antipov, D & Pevzner, PA 2022, 'Multiplex de Bruijn graphs enable genome assembly from long, high-fidelity reads', Nature Biotechnology, Том. 40, № 7, стр. 1075-1081. https://doi.org/10.1038/s41587-022-01220-6

APA

Bankevich, A., Bzikadze, A. V., Kolmogorov, M., Antipov, D., & Pevzner, P. A. (2022). Multiplex de Bruijn graphs enable genome assembly from long, high-fidelity reads. Nature Biotechnology, 40(7), 1075-1081. https://doi.org/10.1038/s41587-022-01220-6

Vancouver

Bankevich A, Bzikadze AV, Kolmogorov M, Antipov D, Pevzner PA. Multiplex de Bruijn graphs enable genome assembly from long, high-fidelity reads. Nature Biotechnology. 2022 Июль;40(7):1075-1081. https://doi.org/10.1038/s41587-022-01220-6

Author

Bankevich, Anton ; Bzikadze, Andrey V. ; Kolmogorov, Mikhail ; Antipov, Dmitry ; Pevzner, Pavel A. / Multiplex de Bruijn graphs enable genome assembly from long, high-fidelity reads. в: Nature Biotechnology. 2022 ; Том 40, № 7. стр. 1075-1081.

BibTeX

@article{29422777281c4db3a725aface205c297,
title = "Multiplex de Bruijn graphs enable genome assembly from long, high-fidelity reads",
abstract = "Although most existing genome assemblers are based on de Bruijn graphs, the construction of these graphs for large genomes and large k-mer sizes has remained elusive. This algorithmic challenge has become particularly pressing with the emergence of long, high-fidelity (HiFi) reads that have been recently used to generate a semi-manual telomere-to-telomere assembly of the human genome. To enable automated assemblies of long, HiFi reads, we present the La Jolla Assembler (LJA), a fast algorithm using the Bloom filter, sparse de Bruijn graphs and disjointig generation. LJA reduces the error rate in HiFi reads by three orders of magnitude, constructs the de Bruijn graph for large genomes and large k-mer sizes and transforms it into a multiplex de Bruijn graph with varying k-mer sizes. Compared to state-of-the-art assemblers, our algorithm not only achieves five-fold fewer misassemblies but also generates more contiguous assemblies. We demonstrate the utility of LJA via the automated assembly of a human genome that completely assembled six chromosomes.",
keywords = "Algorithms, Genome, Human/genetics, High-Throughput Nucleotide Sequencing, Humans, Sequence Analysis, DNA, Software",
author = "Anton Bankevich and Bzikadze, {Andrey V.} and Mikhail Kolmogorov and Dmitry Antipov and Pevzner, {Pavel A.}",
note = "Bankevich, A., Bzikadze, A.V., Kolmogorov, M. et al. Multiplex de Bruijn graphs enable genome assembly from long, high-fidelity reads. Nat Biotechnol 40, 1075–1081 (2022). https://doi.org/10.1038/s41587-022-01220-6",
year = "2022",
month = jul,
doi = "10.1038/s41587-022-01220-6",
language = "English",
volume = "40",
pages = "1075--1081",
journal = "Nature Biotechnology",
issn = "1087-0156",
publisher = "Nature Publishing Group",
number = "7",

}

RIS

TY - JOUR

T1 - Multiplex de Bruijn graphs enable genome assembly from long, high-fidelity reads

AU - Bankevich, Anton

AU - Bzikadze, Andrey V.

AU - Kolmogorov, Mikhail

AU - Antipov, Dmitry

AU - Pevzner, Pavel A.

N1 - Bankevich, A., Bzikadze, A.V., Kolmogorov, M. et al. Multiplex de Bruijn graphs enable genome assembly from long, high-fidelity reads. Nat Biotechnol 40, 1075–1081 (2022). https://doi.org/10.1038/s41587-022-01220-6

PY - 2022/7

Y1 - 2022/7

N2 - Although most existing genome assemblers are based on de Bruijn graphs, the construction of these graphs for large genomes and large k-mer sizes has remained elusive. This algorithmic challenge has become particularly pressing with the emergence of long, high-fidelity (HiFi) reads that have been recently used to generate a semi-manual telomere-to-telomere assembly of the human genome. To enable automated assemblies of long, HiFi reads, we present the La Jolla Assembler (LJA), a fast algorithm using the Bloom filter, sparse de Bruijn graphs and disjointig generation. LJA reduces the error rate in HiFi reads by three orders of magnitude, constructs the de Bruijn graph for large genomes and large k-mer sizes and transforms it into a multiplex de Bruijn graph with varying k-mer sizes. Compared to state-of-the-art assemblers, our algorithm not only achieves five-fold fewer misassemblies but also generates more contiguous assemblies. We demonstrate the utility of LJA via the automated assembly of a human genome that completely assembled six chromosomes.

AB - Although most existing genome assemblers are based on de Bruijn graphs, the construction of these graphs for large genomes and large k-mer sizes has remained elusive. This algorithmic challenge has become particularly pressing with the emergence of long, high-fidelity (HiFi) reads that have been recently used to generate a semi-manual telomere-to-telomere assembly of the human genome. To enable automated assemblies of long, HiFi reads, we present the La Jolla Assembler (LJA), a fast algorithm using the Bloom filter, sparse de Bruijn graphs and disjointig generation. LJA reduces the error rate in HiFi reads by three orders of magnitude, constructs the de Bruijn graph for large genomes and large k-mer sizes and transforms it into a multiplex de Bruijn graph with varying k-mer sizes. Compared to state-of-the-art assemblers, our algorithm not only achieves five-fold fewer misassemblies but also generates more contiguous assemblies. We demonstrate the utility of LJA via the automated assembly of a human genome that completely assembled six chromosomes.

KW - Algorithms

KW - Genome, Human/genetics

KW - High-Throughput Nucleotide Sequencing

KW - Humans

KW - Sequence Analysis, DNA

KW - Software

UR - http://www.scopus.com/inward/record.url?scp=85125338574&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/2162a4e9-8cc4-3bc0-a3f2-3945dcbe842d/

U2 - 10.1038/s41587-022-01220-6

DO - 10.1038/s41587-022-01220-6

M3 - Article

C2 - 35228706

AN - SCOPUS:85125338574

VL - 40

SP - 1075

EP - 1081

JO - Nature Biotechnology

JF - Nature Biotechnology

SN - 1087-0156

IS - 7

ER -

ID: 100863933