Critical Assessment of Metagenome Interpretation --- a benchmark of metagenomics software

Алексей Александрович Гуревич

Результат исследований: Научные публикации в периодических изданияхстатья

113 Цитирования (Scopus)

Выдержка

Methods for assembly, taxonomic profiling and binning are key to interpreting metagenome data, but a lack of consensus about benchmarking complicates performance assessment. The Critical Assessment of Metagenome Interpretation (CAMI) challenge has engaged the global developer community to benchmark their programs on highly complex and realistic data sets, generated from ∼700 newly sequenced microorganisms and ∼600 novel viruses and plasmids and representing common experimental setups. Assembly and genome binning programs performed well for species represented by individual genomes but were substantially affected by the presence of related strains. Taxonomic profiling and binning programs were proficient at high taxonomic ranks, with a notable performance decrease below family level. Parameter settings markedly affected performance, underscoring their importance for program reproducibility. The CAMI results highlight current challenges but also provide a roadmap for software selection to answer specific research questions.
Язык оригиналаанглийский
Страницы (с-по)1063–1071
Число страниц9
ЖурналNature Methods
Том14
DOI
СостояниеОпубликовано - 2 окт 2017

Отпечаток

Metagenome
Benchmarking
Metagenomics
Software
Genes
Viruses
Genome
Microorganisms
Plasmids
Research

Цитировать

Гуревич, Алексей Александрович. / Critical Assessment of Metagenome Interpretation --- a benchmark of metagenomics software. В: Nature Methods. 2017 ; Том 14. стр. 1063–1071.
@article{0e909d72188c42399cf3ba987f4fb5d6,
title = "Critical Assessment of Metagenome Interpretation --- a benchmark of metagenomics software",
abstract = "Methods for assembly, taxonomic profiling and binning are key to interpreting metagenome data, but a lack of consensus about benchmarking complicates performance assessment. The Critical Assessment of Metagenome Interpretation (CAMI) challenge has engaged the global developer community to benchmark their programs on highly complex and realistic data sets, generated from ∼700 newly sequenced microorganisms and ∼600 novel viruses and plasmids and representing common experimental setups. Assembly and genome binning programs performed well for species represented by individual genomes but were substantially affected by the presence of related strains. Taxonomic profiling and binning programs were proficient at high taxonomic ranks, with a notable performance decrease below family level. Parameter settings markedly affected performance, underscoring their importance for program reproducibility. The CAMI results highlight current challenges but also provide a roadmap for software selection to answer specific research questions.",
author = "Гуревич, {Алексей Александрович}",
year = "2017",
month = "10",
day = "2",
doi = "10.1038/nmeth.4458",
language = "English",
volume = "14",
pages = "1063–1071",
journal = "Nature Methods",
issn = "1548-7091",
publisher = "Nature Publishing Group",

}

Critical Assessment of Metagenome Interpretation --- a benchmark of metagenomics software. / Гуревич, Алексей Александрович.

В: Nature Methods, Том 14, 02.10.2017, стр. 1063–1071.

Результат исследований: Научные публикации в периодических изданияхстатья

TY - JOUR

T1 - Critical Assessment of Metagenome Interpretation --- a benchmark of metagenomics software

AU - Гуревич, Алексей Александрович

PY - 2017/10/2

Y1 - 2017/10/2

N2 - Methods for assembly, taxonomic profiling and binning are key to interpreting metagenome data, but a lack of consensus about benchmarking complicates performance assessment. The Critical Assessment of Metagenome Interpretation (CAMI) challenge has engaged the global developer community to benchmark their programs on highly complex and realistic data sets, generated from ∼700 newly sequenced microorganisms and ∼600 novel viruses and plasmids and representing common experimental setups. Assembly and genome binning programs performed well for species represented by individual genomes but were substantially affected by the presence of related strains. Taxonomic profiling and binning programs were proficient at high taxonomic ranks, with a notable performance decrease below family level. Parameter settings markedly affected performance, underscoring their importance for program reproducibility. The CAMI results highlight current challenges but also provide a roadmap for software selection to answer specific research questions.

AB - Methods for assembly, taxonomic profiling and binning are key to interpreting metagenome data, but a lack of consensus about benchmarking complicates performance assessment. The Critical Assessment of Metagenome Interpretation (CAMI) challenge has engaged the global developer community to benchmark their programs on highly complex and realistic data sets, generated from ∼700 newly sequenced microorganisms and ∼600 novel viruses and plasmids and representing common experimental setups. Assembly and genome binning programs performed well for species represented by individual genomes but were substantially affected by the presence of related strains. Taxonomic profiling and binning programs were proficient at high taxonomic ranks, with a notable performance decrease below family level. Parameter settings markedly affected performance, underscoring their importance for program reproducibility. The CAMI results highlight current challenges but also provide a roadmap for software selection to answer specific research questions.

U2 - 10.1038/nmeth.4458

DO - 10.1038/nmeth.4458

M3 - Article

VL - 14

SP - 1063

EP - 1071

JO - Nature Methods

JF - Nature Methods

SN - 1548-7091

ER -