Research output: Contribution to journal › Article › peer-review
Implementation of machine learning protocols to predict the hydrolysis reaction properties of organophosphorus substrates using descriptors of electron density topology. / Петрова, Влада Витальевна (Author and editor); Домнин, Антон Владимирович; Порозов, Юрий Борисович; Куляев, Павел Олегович; Соловьев, Ярослав.
In: Journal of Computational Chemistry, Vol. 45, No. 3, 3, 30.01.2024, p. 170-182.Research output: Contribution to journal › Article › peer-review
}
TY - JOUR
T1 - Implementation of machine learning protocols to predict the hydrolysis reaction properties of organophosphorus substrates using descriptors of electron density topology
AU - Домнин, Антон Владимирович
AU - Порозов, Юрий Борисович
AU - Куляев, Павел Олегович
AU - Соловьев, Ярослав
A2 - Петрова, Влада Витальевна
PY - 2024/1/30
Y1 - 2024/1/30
N2 - Prediction of catalytic reaction efficiency is one of the most intriguing and challenging applications of machine learning (ML) algorithms in chemistry. In this study, we demonstrated a strategy for utilizing ML protocols applied to Quantum Theory of Atoms In Molecules (QTAIM) parameters to predict the ability of the A17 L47K catalytic antibody to covalently capture organophosphate pesticides. We found that the novel “composite” DFT functional B97-3c could be effectively employed for fast and accurate initial geometry optimization, aligning well with the input dataset creation. QTAIM descriptors proved to be well-established in describing the examined dataset using density-based and hierarchical clustering algorithms. The obtained clusters exhibited correlations with the chemical classes of the input compounds. The precise physical interpretation of the QTAIM properties simplifies the explanation of feature impact for both supervised and unsupervised ML protocols. It also enables acceleration in the search for entries with desired properties within large databases. Furthermore, our findings indicated that Ridge Regression with Laplacian kernel and CatBoost Regressor algorithms demonstrated suitable performance in handling small datasets with non-trivial dependencies. They were able to predict the actual reaction barrier values with a high level of accuracy. Additionally, the CatBoost Classifier proved reliable in discriminating between “active” and “inactive” compounds.
AB - Prediction of catalytic reaction efficiency is one of the most intriguing and challenging applications of machine learning (ML) algorithms in chemistry. In this study, we demonstrated a strategy for utilizing ML protocols applied to Quantum Theory of Atoms In Molecules (QTAIM) parameters to predict the ability of the A17 L47K catalytic antibody to covalently capture organophosphate pesticides. We found that the novel “composite” DFT functional B97-3c could be effectively employed for fast and accurate initial geometry optimization, aligning well with the input dataset creation. QTAIM descriptors proved to be well-established in describing the examined dataset using density-based and hierarchical clustering algorithms. The obtained clusters exhibited correlations with the chemical classes of the input compounds. The precise physical interpretation of the QTAIM properties simplifies the explanation of feature impact for both supervised and unsupervised ML protocols. It also enables acceleration in the search for entries with desired properties within large databases. Furthermore, our findings indicated that Ridge Regression with Laplacian kernel and CatBoost Regressor algorithms demonstrated suitable performance in handling small datasets with non-trivial dependencies. They were able to predict the actual reaction barrier values with a high level of accuracy. Additionally, the CatBoost Classifier proved reliable in discriminating between “active” and “inactive” compounds.
KW - Quantum Theory of Atoms In Molecules (QTAIM)
KW - catalysis
KW - catalytic antibodies
KW - computational chemistry
KW - machine learning
UR - https://www.mendeley.com/catalogue/d0b7d3f8-cc00-3e74-b21a-e3b002a985e8/
U2 - 10.1002/jcc.27227
DO - 10.1002/jcc.27227
M3 - Article
VL - 45
SP - 170
EP - 182
JO - Journal of Computational Chemistry
JF - Journal of Computational Chemistry
SN - 0192-8651
IS - 3
M1 - 3
ER -
ID: 108691502