Adapting learning and search algorithms to handle protein structural data with the goal of aiding drug discovery

dc.contributor.advisorKavraki, Lydia Een_US
dc.creatorConev, Anjaen_US
dc.date.accessioned2025-01-16T20:16:07Zen_US
dc.date.created2024-12en_US
dc.date.issued2024-09-16en_US
dc.date.submittedDecember 2024en_US
dc.date.updated2025-01-16T20:16:07Zen_US
dc.description.abstractExperimental methods for protein structure determination (e.g., x-ray crystallography, NMR, cryoEM) require access to expensive equipment and are not scalable. Computational methods assist protein structure prediction and analysis on a far larger scale. Recent deep learning advances, the most notable being DeepMind’s AlphaFold2.0 release in 2021, have provided a wealth of structural data for further analysis and open new opportunities for algorithmic development. In my work, I address three different tasks that make use of the available protein structure data: (1) system-specific binding-affinity prediction (in the context of the immune-related peptide-HLA system); (2) generation of representative ensembles from generic protein structure datasets; (3) protein-ligand ensemble docking. To this end, I examine and adapt a range of algorithms including random forest regression models, unsupervised learning methods and stochastic global optimization techniques. I validate the resulting pipelines on available experimental data and apply them to different macromolecular contexts such as the immune-related formation of the peptide-HLA complex; flexibility of the signal transducer PI3K lipid kinase; CDK2 protein kinase and estrogen receptor α. Developed pipelines are open source and freely available and can help guide the search for novel therapeutics.en_US
dc.embargo.lift2025-06-01en_US
dc.embargo.terms2025-06-01en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.urihttps://hdl.handle.net/1911/118187en_US
dc.language.isoenen_US
dc.subjectProtein structureen_US
dc.subjectmachine learningen_US
dc.subjectunsupervised learningen_US
dc.subjectpeptide-HLAen_US
dc.subjectmolecular dockingen_US
dc.titleAdapting learning and search algorithms to handle protein structural data with the goal of aiding drug discoveryen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentComputer Scienceen_US
thesis.degree.disciplineComputer Scienceen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelDoctoralen_US
thesis.degree.nameDoctor of Philosophyen_US
Files
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.98 KB
Format:
Plain Text
Description: