Research output: Contribution to journal › Article › peer-review
Chromosome-level genome assembly and structural variant analysis of two laboratory yeast strains from the Peterhof Genetic Collection lineage. / Barbitoff, Yury A.; Matveenko, Andrew G.; Matiiv, Anton B.; Maksiutenko, Evgeniia M.; Moskalenko, Svetlana E.; Drozdova, Polina B.; Polev, Dmitrii E.; Beliavskaia, Alexandra Y.; Danilov, Lavrentii G.; Predeus, Alexander V.; Zhouravleva, Galina A.
In: G3: Genes, Genomes, Genetics, Vol. 11, No. 4, jkab029, 04.2021.Research output: Contribution to journal › Article › peer-review
}
TY - JOUR
T1 - Chromosome-level genome assembly and structural variant analysis of two laboratory yeast strains from the Peterhof Genetic Collection lineage
AU - Barbitoff, Yury A.
AU - Matveenko, Andrew G.
AU - Matiiv, Anton B.
AU - Maksiutenko, Evgeniia M.
AU - Moskalenko, Svetlana E.
AU - Drozdova, Polina B.
AU - Polev, Dmitrii E.
AU - Beliavskaia, Alexandra Y.
AU - Danilov, Lavrentii G.
AU - Predeus, Alexander V.
AU - Zhouravleva, Galina A.
N1 - Publisher Copyright: © The Author(s) 2021.
PY - 2021/4
Y1 - 2021/4
N2 - Thousands of yeast genomes have been sequenced with both traditional and long-read technologies, and multiple observations about modes of genome evolution for both wild and laboratory strains have been drawn from these sequences. In our study, we applied Oxford Nanopore and Illumina technologies to assemble complete genomes of two widely used members of a distinct laboratory yeast lineage, the Peterhof Genetic Collection (PGC), and investigate the structural features of these genomes including transposable element content, copy number alterations, and structural rearrangements. We identified numerous notable structural differences between genomes of PGC strains and the reference S288C strain. We discovered a substantial enrichment of mid-length insertions and deletions within repetitive coding sequences, such as in the SCH9 gene or the NUP100 gene, with possible impact of these variants on protein amyloidogenicity. High contiguity of the final assemblies allowed us to trace back the history of reciprocal unbalanced translocations between chromosomes I, VIII, IX, XI, and XVI of the PGC strains. We show that formation of hybrid alleles of the FLO genes during such chromosomal rearrangements is likely responsible for the lack of invasive growth of yeast strains. Taken together, our results highlight important features of laboratory yeast strain evolution using the power of long-read sequencing.
AB - Thousands of yeast genomes have been sequenced with both traditional and long-read technologies, and multiple observations about modes of genome evolution for both wild and laboratory strains have been drawn from these sequences. In our study, we applied Oxford Nanopore and Illumina technologies to assemble complete genomes of two widely used members of a distinct laboratory yeast lineage, the Peterhof Genetic Collection (PGC), and investigate the structural features of these genomes including transposable element content, copy number alterations, and structural rearrangements. We identified numerous notable structural differences between genomes of PGC strains and the reference S288C strain. We discovered a substantial enrichment of mid-length insertions and deletions within repetitive coding sequences, such as in the SCH9 gene or the NUP100 gene, with possible impact of these variants on protein amyloidogenicity. High contiguity of the final assemblies allowed us to trace back the history of reciprocal unbalanced translocations between chromosomes I, VIII, IX, XI, and XVI of the PGC strains. We show that formation of hybrid alleles of the FLO genes during such chromosomal rearrangements is likely responsible for the lack of invasive growth of yeast strains. Taken together, our results highlight important features of laboratory yeast strain evolution using the power of long-read sequencing.
KW - 74-d694
KW - Flo genes
KW - Long reads
KW - Structural variant
KW - Yeast genome
KW - Laboratories
KW - Saccharomyces cerevisiae/genetics
KW - Sequence Analysis, DNA
KW - DNA Transposable Elements
KW - Protein Serine-Threonine Kinases
KW - Saccharomyces cerevisiae Proteins
KW - High-Throughput Nucleotide Sequencing
KW - Chromosomes
KW - long reads
KW - HSP104
KW - MULTIPLE SEQUENCE ALIGNMENT
KW - SACCHAROMYCES-CEREVISIAE
KW - PREDICTION
KW - 74-D694
KW - MUTANTS
KW - FLO genes
KW - DNA
KW - structural variant
KW - PCR
UR - http://www.scopus.com/inward/record.url?scp=85104899488&partnerID=8YFLogxK
UR - https://www.mendeley.com/catalogue/4c291dcd-257f-3ed3-a3ad-f0478f6cbd26/
U2 - 10.1093/g3journal/jkab029
DO - 10.1093/g3journal/jkab029
M3 - Article
C2 - 33677552
AN - SCOPUS:85104899488
VL - 11
JO - G3: Genes, Genomes, Genetics
JF - G3: Genes, Genomes, Genetics
SN - 2160-1836
IS - 4
M1 - jkab029
ER -
ID: 84957906