Standard

Joint Analysis of Long and Short Reads Enables Accurate Estimates of Microbiome Complexity. / Bankevich, Anton; Pevzner, Pavel A.

в: Cell Systems, Том 7, № 2, 22.08.2018, стр. 192-200.e3.

Результаты исследований: Научные публикации в периодических изданияхстатьяРецензирование

Harvard

APA

Vancouver

Author

Bankevich, Anton ; Pevzner, Pavel A. / Joint Analysis of Long and Short Reads Enables Accurate Estimates of Microbiome Complexity. в: Cell Systems. 2018 ; Том 7, № 2. стр. 192-200.e3.

BibTeX

@article{478061651a3641f8b64336c439153e21,
title = "Joint Analysis of Long and Short Reads Enables Accurate Estimates of Microbiome Complexity",
abstract = "Reduced microbiome diversity has been linked to several diseases. However, estimating the diversity of bacterial communities—the number and the total length of distinct genomes within a metagenome—remains an open problem in microbial ecology. Here, we describe an algorithm for estimating the microbial diversity in a metagenomic sample based on a joint analysis of short and long reads. Unlike previous approaches, the algorithm does not make any assumptions on the distribution of the frequencies of genomes within a metagenome (as in parametric methods) and does not require a large database that covers the total diversity (as in non-parametric methods). We estimate that genomes comprising a human gut metagenome have total length varying from 1.3 to 3.5 billion nucleotides, with genomes responsible for 50% of total abundance having total length varying from only 25 to 61 million nucleotides. In contrast, genomes comprising an aquifer sediment metagenome have more than two orders of magnitude larger total length (≈840 billion nucleotides). We present a method for estimating the diversity of metagenomic samples that combines short and long sequencing reads. We show that our method is capable of capturing rare species and apply it to analyze diversity of the human gut and aquifer sediment metagenomes.",
keywords = "metagenomics, microbal diversity, rare species, BACTERIAL DIVERSITY, COVERAGE, DNA, GUT MICROBIOTA, RARE BIOSPHERE, ABUNDANCE",
author = "Anton Bankevich and Pevzner, {Pavel A.}",
year = "2018",
month = aug,
day = "22",
doi = "10.1016/j.cels.2018.06.009",
language = "English",
volume = "7",
pages = "192--200.e3",
journal = "Cell Systems",
issn = "2405-4712",
publisher = "Cell Press",
number = "2",

}

RIS

TY - JOUR

T1 - Joint Analysis of Long and Short Reads Enables Accurate Estimates of Microbiome Complexity

AU - Bankevich, Anton

AU - Pevzner, Pavel A.

PY - 2018/8/22

Y1 - 2018/8/22

N2 - Reduced microbiome diversity has been linked to several diseases. However, estimating the diversity of bacterial communities—the number and the total length of distinct genomes within a metagenome—remains an open problem in microbial ecology. Here, we describe an algorithm for estimating the microbial diversity in a metagenomic sample based on a joint analysis of short and long reads. Unlike previous approaches, the algorithm does not make any assumptions on the distribution of the frequencies of genomes within a metagenome (as in parametric methods) and does not require a large database that covers the total diversity (as in non-parametric methods). We estimate that genomes comprising a human gut metagenome have total length varying from 1.3 to 3.5 billion nucleotides, with genomes responsible for 50% of total abundance having total length varying from only 25 to 61 million nucleotides. In contrast, genomes comprising an aquifer sediment metagenome have more than two orders of magnitude larger total length (≈840 billion nucleotides). We present a method for estimating the diversity of metagenomic samples that combines short and long sequencing reads. We show that our method is capable of capturing rare species and apply it to analyze diversity of the human gut and aquifer sediment metagenomes.

AB - Reduced microbiome diversity has been linked to several diseases. However, estimating the diversity of bacterial communities—the number and the total length of distinct genomes within a metagenome—remains an open problem in microbial ecology. Here, we describe an algorithm for estimating the microbial diversity in a metagenomic sample based on a joint analysis of short and long reads. Unlike previous approaches, the algorithm does not make any assumptions on the distribution of the frequencies of genomes within a metagenome (as in parametric methods) and does not require a large database that covers the total diversity (as in non-parametric methods). We estimate that genomes comprising a human gut metagenome have total length varying from 1.3 to 3.5 billion nucleotides, with genomes responsible for 50% of total abundance having total length varying from only 25 to 61 million nucleotides. In contrast, genomes comprising an aquifer sediment metagenome have more than two orders of magnitude larger total length (≈840 billion nucleotides). We present a method for estimating the diversity of metagenomic samples that combines short and long sequencing reads. We show that our method is capable of capturing rare species and apply it to analyze diversity of the human gut and aquifer sediment metagenomes.

KW - metagenomics

KW - microbal diversity

KW - rare species

KW - BACTERIAL DIVERSITY

KW - COVERAGE

KW - DNA

KW - GUT MICROBIOTA

KW - RARE BIOSPHERE

KW - ABUNDANCE

UR - http://www.scopus.com/inward/record.url?scp=85053854176&partnerID=8YFLogxK

U2 - 10.1016/j.cels.2018.06.009

DO - 10.1016/j.cels.2018.06.009

M3 - Article

AN - SCOPUS:85053854176

VL - 7

SP - 192-200.e3

JO - Cell Systems

JF - Cell Systems

SN - 2405-4712

IS - 2

ER -

ID: 36146981