• Natalya Yutin
  • Sean Benler
  • Sergei A. Shmakov
  • Yuri I. Wolf
  • Igor Tolstoy
  • Mike Rayko
  • Dmitry Antipov
  • Pavel A. Pevzner
  • Eugene V. Koonin

CrAssphage is the most abundant human-associated virus and the founding member of a large group of bacteriophages, discovered in animal-associated and environmental metagenomes, that infect bacteria of the phylum Bacteroidetes. We analyze 4907 Circular Metagenome Assembled Genomes (cMAGs) of putative viruses from human gut microbiomes and identify nearly 600 genomes of crAss-like phages that account for nearly 87% of the DNA reads mapped to these cMAGs. Phylogenetic analysis of conserved genes demonstrates the monophyly of crAss-like phages, a putative virus order, and of 5 branches, potential families within that order, two of which have not been identified previously. The phage genomes in one of these families are almost twofold larger than the crAssphage genome (145-192 kilobases), with high density of self-splicing introns and inteins. Many crAss-like phages encode suppressor tRNAs that enable read-through of UGA or UAG stop-codons, mostly, in late phage genes. A distinct feature of the crAss-like phages is the recurrent switch of the phage DNA polymerase type between A and B families. Thus, comparative genomic analysis of the expanded assemblage of crAss-like phages reveals aspects of genome architecture and expression as well as phage biology that were not apparent from the previous work on phage genomics.

Original languageEnglish
Article number1044
Number of pages11
JournalNature Communications
Volume12
Issue number1
DOIs
StatePublished - Dec 2021

    Scopus subject areas

  • Physics and Astronomy(all)
  • Chemistry(all)
  • Biochemistry, Genetics and Molecular Biology(all)

    Research areas

  • Bacteriophages/genetics, Codon/genetics, Conserved Sequence, DNA-Directed DNA Polymerase/metabolism, Gastrointestinal Microbiome/genetics, Genome, Viral, Humans, Inteins, Introns/genetics, Metagenome, Open Reading Frames/genetics, Phylogeny, RNA Splicing/genetics, Transcription, Genetic, Virome/genetics, ALIGNMENT, SEQUENCE

ID: 75307668