Universitat de Barcelona. Facultat de Biologia
The incremental application of supercomputers to offer solutions to complex problems has motivated the usage of computational modeling tools in drug design pipelines. Specifically, small low-affinity compounds are modified by including multiple decorators in the hit-to-lead phase to obtain more potent compounds. Techniques such as docking provide quick answers on classifying millions of candidates, differentiating active from inactive, but their accuracies tend to drop when ranking ligand’s potencies. Expensive methods such as FEP are more precise; however, the time consumption limitate their application in hit-to-lead campaigns. This thesis aims to implement and test novel methodologies in a mid computation and accuracy term, focused on facing hit-to-lead stages. We developed FragPELE, a novel ligand growing method integrated with PELE, an unconventional Monte Carlo sampling algorithm. FragPELE introduces a new concept of progressively expanding a small atom-sized moiety of atoms (fragment) within PELE simulations, adapting the protein binding site to the newly grown R-group. Structural and scoring benchmarks remarked accurate geometrical predictions and correlation with relative free energies, with a reasonable consumption of time and resources. Besides, we combined FragPELE with the recently developed aquaPELE algorithm to expand fragments on hydrated binding sites. Results stressed improved accuracies when introducing the mixed implicit/explicit solvent models integrated within aquaPELE. Additionally, we participated in a collaborative project with Almirall. We assessed our FragPELE tool in two prospective hit-to-lead studies. One of them ended with synthesizing an improved version of the initial hit. On the other, the method showed good predictive power in classifying non-terminal R-groups on 27 new compounds (not reported in the literature). Finally, we optimized virtual screening pipelines by integrating machine learning analysis with simulated data, training, testing, and validating the designed classification models with external experimental sets. From 785 compounds, Almirall purchased 23 based on our results. Two of them showed inhibition of the target, one in the nM range of activity.
Disseny de medicaments; Diseño de medicamentos; Drug design; Bioinformàtica; Bioinformática; Bioinformatics; Aprenentatge automàtic; Aprendizaje automático; Machine learning
577 - Bioquímica. Biologia molecular. Biofísica
Ciències Experimentals i Matemàtiques
Programa de Doctorat en Biomedicina / Tesi realitzada al Centre de Supercomputació de Barcelona (BSC)
Facultat de Biologia [236]