Standard

MetaGT: A pipeline for de novo assembly of metatranscriptomes with the aid of metagenomic data. / Шафранская, Дарья Дмитриевна; Kale, Varsha ; Finn, Rob; Лапидус, Алла Львовна; Коробейников, Антон Иванович; Пржибельский, Андрей Дмитриевич.

In: Frontiers in Microbiology, Vol. 13, 981458, 28.10.2022.

Research output: Contribution to journalArticlepeer-review

Harvard

APA

Vancouver

Author

BibTeX

@article{f44bc55b6c13438dbade6f39f080e922,
title = "MetaGT: A pipeline for de novo assembly of metatranscriptomes with the aid of metagenomic data",
abstract = "While metagenome sequencing may provide insights on the genome sequences and composition of microbial communities, metatranscriptome analysis can be useful for studying the functional activity of a microbiome. RNA-Seq data provides the possibility to determine active genes in the community and how their expression levels depend on external conditions. Although the field of metatranscriptomics is relatively young, the number of projects related to metatranscriptome analysis increases every year and the scope of its applications expands. However, there are several problems that complicate metatranscriptome analysis: complexity of microbial communities, wide dynamic range of transcriptome expression and importantly, the lack of high-quality computational methods for assembling meta-RNA sequencing data. These factors deteriorate the contiguity and completeness of metatranscriptome assemblies, therefore affecting further downstream analysis.Here we present MetaGT, a pipeline for de novo assembly of metatranscriptomes, which is based on the idea of combining both metatranscriptomic and metagenomic data sequenced from the same sample. MetaGT assembles metatranscriptomic contigs and fills in missing regions based on their alignments to metagenome assembly. This approach allows to overcome described complexities and obtain complete RNA sequences, and additionally estimate their abundances. Using various publicly available real and simulated datasets, we demonstrate that MetaGT yields significant improvement in coverage and completeness of metatranscriptome assemblies compared to existing methods that do not exploit metagenomic data. The pipeline is implemented in NextFlow and is freely available from https://github.com/ablab/metaGT.",
keywords = "metatranscriptomic, metagenomic, computational pipeline, de novo assembly, metagenomics, metatranscriptomics, RNA-Seq",
author = "Шафранская, {Дарья Дмитриевна} and Varsha Kale and Rob Finn and Лапидус, {Алла Львовна} and Коробейников, {Антон Иванович} and Пржибельский, {Андрей Дмитриевич}",
note = "Publisher Copyright: Copyright {\textcopyright} 2022 Shafranskaya, Kale, Finn, Lapidus, Korobeynikov and Prjibelski.",
year = "2022",
month = oct,
day = "28",
doi = "10.3389/fmicb.2022.981458",
language = "English",
volume = "13",
journal = "Frontiers in Microbiology",
issn = "1664-302X",
publisher = "Frontiers Media S.A.",

}

RIS

TY - JOUR

T1 - MetaGT: A pipeline for de novo assembly of metatranscriptomes with the aid of metagenomic data

AU - Шафранская, Дарья Дмитриевна

AU - Kale, Varsha

AU - Finn, Rob

AU - Лапидус, Алла Львовна

AU - Коробейников, Антон Иванович

AU - Пржибельский, Андрей Дмитриевич

N1 - Publisher Copyright: Copyright © 2022 Shafranskaya, Kale, Finn, Lapidus, Korobeynikov and Prjibelski.

PY - 2022/10/28

Y1 - 2022/10/28

N2 - While metagenome sequencing may provide insights on the genome sequences and composition of microbial communities, metatranscriptome analysis can be useful for studying the functional activity of a microbiome. RNA-Seq data provides the possibility to determine active genes in the community and how their expression levels depend on external conditions. Although the field of metatranscriptomics is relatively young, the number of projects related to metatranscriptome analysis increases every year and the scope of its applications expands. However, there are several problems that complicate metatranscriptome analysis: complexity of microbial communities, wide dynamic range of transcriptome expression and importantly, the lack of high-quality computational methods for assembling meta-RNA sequencing data. These factors deteriorate the contiguity and completeness of metatranscriptome assemblies, therefore affecting further downstream analysis.Here we present MetaGT, a pipeline for de novo assembly of metatranscriptomes, which is based on the idea of combining both metatranscriptomic and metagenomic data sequenced from the same sample. MetaGT assembles metatranscriptomic contigs and fills in missing regions based on their alignments to metagenome assembly. This approach allows to overcome described complexities and obtain complete RNA sequences, and additionally estimate their abundances. Using various publicly available real and simulated datasets, we demonstrate that MetaGT yields significant improvement in coverage and completeness of metatranscriptome assemblies compared to existing methods that do not exploit metagenomic data. The pipeline is implemented in NextFlow and is freely available from https://github.com/ablab/metaGT.

AB - While metagenome sequencing may provide insights on the genome sequences and composition of microbial communities, metatranscriptome analysis can be useful for studying the functional activity of a microbiome. RNA-Seq data provides the possibility to determine active genes in the community and how their expression levels depend on external conditions. Although the field of metatranscriptomics is relatively young, the number of projects related to metatranscriptome analysis increases every year and the scope of its applications expands. However, there are several problems that complicate metatranscriptome analysis: complexity of microbial communities, wide dynamic range of transcriptome expression and importantly, the lack of high-quality computational methods for assembling meta-RNA sequencing data. These factors deteriorate the contiguity and completeness of metatranscriptome assemblies, therefore affecting further downstream analysis.Here we present MetaGT, a pipeline for de novo assembly of metatranscriptomes, which is based on the idea of combining both metatranscriptomic and metagenomic data sequenced from the same sample. MetaGT assembles metatranscriptomic contigs and fills in missing regions based on their alignments to metagenome assembly. This approach allows to overcome described complexities and obtain complete RNA sequences, and additionally estimate their abundances. Using various publicly available real and simulated datasets, we demonstrate that MetaGT yields significant improvement in coverage and completeness of metatranscriptome assemblies compared to existing methods that do not exploit metagenomic data. The pipeline is implemented in NextFlow and is freely available from https://github.com/ablab/metaGT.

KW - metatranscriptomic

KW - metagenomic

KW - computational pipeline

KW - de novo assembly

KW - metagenomics

KW - metatranscriptomics

KW - RNA-Seq

UR - https://www.mendeley.com/catalogue/19f9cf6a-7a65-3963-9c0a-ea1e923a95b7/

UR - http://www.scopus.com/inward/record.url?scp=85141993146&partnerID=8YFLogxK

U2 - 10.3389/fmicb.2022.981458

DO - 10.3389/fmicb.2022.981458

M3 - Article

C2 - 36386613

VL - 13

JO - Frontiers in Microbiology

JF - Frontiers in Microbiology

SN - 1664-302X

M1 - 981458

ER -

ID: 99745058