Comparative Data Analysis of Virtual Screening Methodologies for Predicting Urease Inhibitory Activity

Elizabeth Valdés-Muñoz, Gabriel J. Olguín-Orellana, Sofía E. Ríos-Rozas, Melissa Alegría-Arcos, Natalia Morales, Vicente Rojas-Santander, Javier Farías-Abarca, Jonathan M. Palma, Erix W. Hernández-Rodríguez, Reynier Suardíaz, Daniel Bustos

Producción científica: Contribución a una revistaArtículorevisión exhaustiva

Resumen

Structure-based virtual screening (SBVS) is a fundamental approach in drug discovery, yet its predictive accuracy is highly dependent on methodological choices, scoring functions, and data processing strategies. This study systematically evaluates five protocol variants integrating molecular docking, induced-fit docking (IFD), quantum-polarized ligand docking (QPLD), ensemble docking (ED), and molecular mechanics/generalized Born surface area (MM-GBSA) in Helicobacter pylori urease employing four distinct crystallographic structures obtained from the protein data bank (PDB). We assess their predictive performance using statistical correlation metrics (Spearman and Pearson) and error-based measures (mean absolute error, root-mean-squared error, and inlier ratio metric). Additionally, we investigate the influence of data fusion techniques─minimum, median, arithmetic, geometric, harmonic, and Euclidean means─and varying numbers of docking poses (ranging from 1 to 100) on ligand ranking accuracy. Results indicate that MM-GBSA and ED consistently outperform other methods in compound ranking, although MM-GBSA exhibits higher errors in absolute binding energy predictions. While increasing the number of poses generally reduces predictive accuracy, the minimum fusion approach remains robust across all conditions. Comparisons between IC50and pIC50as experimental reference values reveal that pIC50provides higher Pearson correlations, reinforcing its suitability for affinity prediction, while both metrics perform similarly in Spearman rankings. These findings refine SBVS workflows by optimizing scoring and pose aggregation strategies, highlighting the importance of method selection and data fusion techniques. The proposed framework enhances ligand prioritization in virtual screening campaigns and can be adapted to other therapeutic targets. Future research should explore adaptive scoring frameworks and machine-learning approaches to further improve the SBVS predictive reliability.

Idioma originalInglés
Páginas (desde-hasta)49641-49658
Número de páginas18
PublicaciónACS Omega
Volumen10
N.º42
DOI
EstadoPublicada - 28 oct. 2025

Huella

Profundice en los temas de investigación de 'Comparative Data Analysis of Virtual Screening Methodologies for Predicting Urease Inhibitory Activity'. En conjunto forman una huella única.

Citar esto