Computational approaches for lead compound discovery in dipeptidyl peptidase-4 inhibition using machine learning and molecular dynamics techniques

Sandra De La Torre, Sebastián A. Cuesta, Luis Calle, José R. Mora*, Jose L. Paz, Patricio J. Espinoza-Montero, Máryury Flores-Sumoza, Edgar A. Márquez

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

The prediction of possible lead compounds from already-known drugs that may present DPP-4 inhibition activity imply a advantage in the drug development in terms of time and cost to find alternative medicines for the treatment of Type 2 Diabetes Mellitus (T2DM). The inhibition of dipeptidyl peptidase-4 (DPP-4) has been one of the most explored strategies to develop potential drugs against this condition. A diverse dataset of molecules with known experimental inhibitory activity against DPP-4 was constructed and used to develop predictive models using different machine-learning algorithms. Model M36 is the most promising one based on the internal and external performance showing values of Q2 CV = 0.813, and Q2 EXT = 0.803. The applicability domain evaluation and Tropsha's analysis were conducted to validate M36, indicating its robustness and accuracy in predicting pIC50 values for organic molecules within the established domain. The physicochemical properties of the ligands, including electronegativity, polarizability, and van der Waals volume were relevant to predict the inhibition process. The model was then employed in the virtual screening of potential DPP4 inhibitors, finding 448 compounds from the DrugBank and 9 from DiaNat with potential inhibitory activity. Molecular docking and molecular dynamics simulations were used to get insight into the ligand-protein interaction. From the screening and the favorable molecular dynamic results, several compounds including Skimmin (pIC50 = 3.54, Binding energy = −8.86 kcal/mol), bergenin (pIC50 = 2.69, Binding energy = −13.90 kcal/mol), and DB07272 (pIC50 = 3.97, Binding energy = −25.28 kcal/mol) seem to be promising hits to be tested and optimized in the treatment of T2DM. This results imply a important reduction in cost and time on the application of this drugs because all the information about the its metabolism is already available.

Original languageEnglish
Article number108145
JournalComputational Biology and Chemistry
Volume112
DOIs
StatePublished - Oct 2024

Bibliographical note

Publisher Copyright:
© 2024 Elsevier Ltd

Funding

The authors are grateful to the USFQ-POLI grants 2023-2024 for the financial support. The authors have used the high-performance computing (HPC) system available in the USFQ and Uninorte, for the development of this project.

FundersFunder number
Universidad San Francisco de Quito
USFQ-POLI2023-2024

    Keywords

    • Diabetes mellitus
    • Dipeptidyl peptidase-4
    • Drug discovery
    • Drugbank
    • Molecular dynamics

    Fingerprint

    Dive into the research topics of 'Computational approaches for lead compound discovery in dipeptidyl peptidase-4 inhibition using machine learning and molecular dynamics techniques'. Together they form a unique fingerprint.

    Cite this