Research output: Contribution to journal › Article › peer-review
Machine Learning-Based Virtual Screening and Molecular Simulation Approaches Identified Novel Potential Inhibitors for Cancer Therapy. / Shahab, Muhammad; Zheng, Guojun; Khan, Abbas; Wei, Dongqing; Novikov, Alexander S.
In: Biomedicines, Vol. 11, No. 8, 2251, 11.08.2023.Research output: Contribution to journal › Article › peer-review
}
TY - JOUR
T1 - Machine Learning-Based Virtual Screening and Molecular Simulation Approaches Identified Novel Potential Inhibitors for Cancer Therapy
AU - Shahab, Muhammad
AU - Zheng, Guojun
AU - Khan, Abbas
AU - Wei, Dongqing
AU - Novikov, Alexander S.
PY - 2023/8/11
Y1 - 2023/8/11
N2 - Cyclin-dependent kinase 2 (CDK2) is a promising target for cancer treatment, developing new effective CDK2 inhibitors is of great significance in anticancer therapy. The involvement of CDK2 in tumorigenesis has been debated, but recent evidence suggests that specifically inhibiting CDK2 could be beneficial in treating certain tumors. This approach remains attractive in the development of anticancer drugs. Several small-molecule inhibitors targeting CDK2 have reached clinical trials, but a selective inhibitor for CDK2 is yet to be discovered. In this study, we conducted machine learning-based drug designing to search for a drug candidate for CDK2. Machine learning models, including k-NN, SVM, RF, and GNB, were created to detect active and inactive inhibitors for a CDK2 drug target. The models were assessed using 10-fold cross-validation to ensure their accuracy and reliability. These methods are highly suitable for classifying compounds as either active or inactive through the virtual screening of extensive compound libraries. Subsequently, machine learning techniques were employed to analyze the test dataset obtained from the zinc database. A total of 25 compounds with 98% accuracy were predicted as active against CDK2. These compounds were docked into CDK2's active site. Finally, three compounds were selected based on good docking score, and, along with a reference compound, underwent MD simulation. The Gaussian naïve Bayes model yielded superior results compared to other models. The top three hits exhibited enhanced stability and compactness compared to the reference compound. In conclusion, our study provides valuable insights for identifying and refining lead compounds as CDK2 inhibitors.
AB - Cyclin-dependent kinase 2 (CDK2) is a promising target for cancer treatment, developing new effective CDK2 inhibitors is of great significance in anticancer therapy. The involvement of CDK2 in tumorigenesis has been debated, but recent evidence suggests that specifically inhibiting CDK2 could be beneficial in treating certain tumors. This approach remains attractive in the development of anticancer drugs. Several small-molecule inhibitors targeting CDK2 have reached clinical trials, but a selective inhibitor for CDK2 is yet to be discovered. In this study, we conducted machine learning-based drug designing to search for a drug candidate for CDK2. Machine learning models, including k-NN, SVM, RF, and GNB, were created to detect active and inactive inhibitors for a CDK2 drug target. The models were assessed using 10-fold cross-validation to ensure their accuracy and reliability. These methods are highly suitable for classifying compounds as either active or inactive through the virtual screening of extensive compound libraries. Subsequently, machine learning techniques were employed to analyze the test dataset obtained from the zinc database. A total of 25 compounds with 98% accuracy were predicted as active against CDK2. These compounds were docked into CDK2's active site. Finally, three compounds were selected based on good docking score, and, along with a reference compound, underwent MD simulation. The Gaussian naïve Bayes model yielded superior results compared to other models. The top three hits exhibited enhanced stability and compactness compared to the reference compound. In conclusion, our study provides valuable insights for identifying and refining lead compounds as CDK2 inhibitors.
KW - CDK2
KW - MD simulation
KW - machine learning
KW - molecular docking
KW - virtual screening
UR - https://www.mendeley.com/catalogue/b23b07f6-33ea-3a3f-bb0c-b0f0d8f7298e/
U2 - 10.3390/biomedicines11082251
DO - 10.3390/biomedicines11082251
M3 - Article
C2 - 37626747
VL - 11
JO - Biomedicines
JF - Biomedicines
SN - 2227-9059
IS - 8
M1 - 2251
ER -
ID: 107983178