Standard

Linking protein aggregation and structural stability to predict pathogenic MYH7 variants via machine learning. / Пьянков, Иван Алексеевич; Кокорина, Марина; Рычков, Георгий Николаевич; Костарева, А. А.; Успенская, Майя Валерьевна; Каява, Андрей Вилхович.

In: Journal of Structural Biology, Vol. 218, No. 2, 108307, 02.06.2026.

Research output: Contribution to journalArticlepeer-review

Harvard

Пьянков, ИА, Кокорина, М, Рычков, ГН, Костарева, АА, Успенская, МВ & Каява, АВ 2026, 'Linking protein aggregation and structural stability to predict pathogenic MYH7 variants via machine learning', Journal of Structural Biology, vol. 218, no. 2, 108307. https://doi.org/10.1016/j.jsb.2026.108307

APA

Пьянков, И. А., Кокорина, М., Рычков, Г. Н., Костарева, А. А., Успенская, М. В., & Каява, А. В. (2026). Linking protein aggregation and structural stability to predict pathogenic MYH7 variants via machine learning. Journal of Structural Biology, 218(2), [108307]. https://doi.org/10.1016/j.jsb.2026.108307

Vancouver

Пьянков ИА, Кокорина М, Рычков ГН, Костарева АА, Успенская МВ, Каява АВ. Linking protein aggregation and structural stability to predict pathogenic MYH7 variants via machine learning. Journal of Structural Biology. 2026 Jun 2;218(2). 108307. https://doi.org/10.1016/j.jsb.2026.108307

Author

Пьянков, Иван Алексеевич ; Кокорина, Марина ; Рычков, Георгий Николаевич ; Костарева, А. А. ; Успенская, Майя Валерьевна ; Каява, Андрей Вилхович. / Linking protein aggregation and structural stability to predict pathogenic MYH7 variants via machine learning. In: Journal of Structural Biology. 2026 ; Vol. 218, No. 2.

BibTeX

@article{4b94faad91474ffda60e92ec8ac2b126,
title = "Linking protein aggregation and structural stability to predict pathogenic MYH7 variants via machine learning",
abstract = "As genome and gene sequencing rapidly expand, data increasingly outpace studies linking genetic variants to specific diseases, making computational methods for associating potential mutations with pathology both essential and feasible. We found that disease-causing variants associated with Myosin Storage Myopathy (MSM) generally destabilize the MYH7 α-helical coiled-coil domain more than non-disease-associated variants, and structural mapping revealed that pathogenic variants cluster in locally unwound regions of the coiled-coil dimer, suggesting that changes in these strained sites may promote dimer destabilization and aggregation. However, these features alone are insufficient to reliably predict hereditary Myosin Storage Myopathy. By integrating protein aggregation, structural stability, and additional informative features, we developed RDSM-MYH7, a machine learning-based predictor for assessing the pathogenicity of missense mutations in the MYH7 rod domain. RDSM-MYH7 achieved superior performance (F1 = 0.869, accuracy = 0.875), compared to existing tools, and can be applied to individual gene sequencing data to identify pathogenic MYH7-variants associated with storage myopathy. Its implementation in clinical screening could facilitate early diagnosis of myopathies and other hereditary protein storage diseases, in which protein unfolding precedes pathological aggregation.",
author = "Пьянков, {Иван Алексеевич} and Марина Кокорина and Рычков, {Георгий Николаевич} and Костарева, {А. А.} and Успенская, {Майя Валерьевна} and Каява, {Андрей Вилхович}",
year = "2026",
month = mar,
day = "2",
doi = "10.1016/j.jsb.2026.108307",
language = "English",
volume = "218",
journal = "Journal of Structural Biology",
issn = "1047-8477",
publisher = "Elsevier",
number = "2",

}

RIS

TY - JOUR

T1 - Linking protein aggregation and structural stability to predict pathogenic MYH7 variants via machine learning

AU - Пьянков, Иван Алексеевич

AU - Кокорина, Марина

AU - Рычков, Георгий Николаевич

AU - Костарева, А. А.

AU - Успенская, Майя Валерьевна

AU - Каява, Андрей Вилхович

PY - 2026/3/2

Y1 - 2026/3/2

N2 - As genome and gene sequencing rapidly expand, data increasingly outpace studies linking genetic variants to specific diseases, making computational methods for associating potential mutations with pathology both essential and feasible. We found that disease-causing variants associated with Myosin Storage Myopathy (MSM) generally destabilize the MYH7 α-helical coiled-coil domain more than non-disease-associated variants, and structural mapping revealed that pathogenic variants cluster in locally unwound regions of the coiled-coil dimer, suggesting that changes in these strained sites may promote dimer destabilization and aggregation. However, these features alone are insufficient to reliably predict hereditary Myosin Storage Myopathy. By integrating protein aggregation, structural stability, and additional informative features, we developed RDSM-MYH7, a machine learning-based predictor for assessing the pathogenicity of missense mutations in the MYH7 rod domain. RDSM-MYH7 achieved superior performance (F1 = 0.869, accuracy = 0.875), compared to existing tools, and can be applied to individual gene sequencing data to identify pathogenic MYH7-variants associated with storage myopathy. Its implementation in clinical screening could facilitate early diagnosis of myopathies and other hereditary protein storage diseases, in which protein unfolding precedes pathological aggregation.

AB - As genome and gene sequencing rapidly expand, data increasingly outpace studies linking genetic variants to specific diseases, making computational methods for associating potential mutations with pathology both essential and feasible. We found that disease-causing variants associated with Myosin Storage Myopathy (MSM) generally destabilize the MYH7 α-helical coiled-coil domain more than non-disease-associated variants, and structural mapping revealed that pathogenic variants cluster in locally unwound regions of the coiled-coil dimer, suggesting that changes in these strained sites may promote dimer destabilization and aggregation. However, these features alone are insufficient to reliably predict hereditary Myosin Storage Myopathy. By integrating protein aggregation, structural stability, and additional informative features, we developed RDSM-MYH7, a machine learning-based predictor for assessing the pathogenicity of missense mutations in the MYH7 rod domain. RDSM-MYH7 achieved superior performance (F1 = 0.869, accuracy = 0.875), compared to existing tools, and can be applied to individual gene sequencing data to identify pathogenic MYH7-variants associated with storage myopathy. Its implementation in clinical screening could facilitate early diagnosis of myopathies and other hereditary protein storage diseases, in which protein unfolding precedes pathological aggregation.

UR - https://linkinghub.elsevier.com/retrieve/pii/S1047847726000237

UR - https://www.mendeley.com/catalogue/f7a25961-4457-3108-be3d-8ad8442cefd4/

U2 - 10.1016/j.jsb.2026.108307

DO - 10.1016/j.jsb.2026.108307

M3 - Article

C2 - 41780808

VL - 218

JO - Journal of Structural Biology

JF - Journal of Structural Biology

SN - 1047-8477

IS - 2

M1 - 108307

ER -

ID: 150622171