Sequence-based and structure-based methods for studying the adaptive immune response

dc.contributor.advisorKavraki, Lydia E.en_US
dc.creatorFasoulis, Romanosen_US
dc.date.accessioned2025-01-16T19:33:29Zen_US
dc.date.created2024-12en_US
dc.date.issued2024-08-23en_US
dc.date.submittedDecember 2024en_US
dc.date.updated2025-01-16T19:33:29Zen_US
dc.description.abstractThe adaptive immune system comprises various biological mechanisms that, in unison, protect an organism against various threats, such as pathogens, viral infections, and tumor cells. One of such mechanisms involves the binding of intracellular protein fragments called peptides to class-I Major Histocompability Complexes (MHCs). The formed peptide-MHC (pMHC) complex is presented to the surface of the cell, where it interacts with the T-cell receptor, an interaction that can elicit an immune response. Knowing which peptides bind to MHCs, which peptides are presented to the surface of the cell, and which peptides elicit an immune response is crucial for successful clinical applications and therapies. Due to the advent of mass spectrometry resulting in high-throughput generation of amino acid sequence-based pMHC binding data, amino acid sequence-based Machine Learning (ML) approaches have dominated the field, showing immense potential. At the same time however, it is known that the pMHC interaction is characterized by a strong structural component that is shown to be extremely important in fully explaining pMHC binding and peptide immunogenicity. This thesis presents methodologies that attend to both the amino acid sequence component and the structural component of the pMHC interaction. Focusing on the sequence component first, we present TLStab and TLImm, two ML-based tools that predict peptide kinetic stability and peptide immunogenicity respectively. Developed through adopting transfer learning methodologies, TLStab and TLImm outperform state-of-the-art approaches in their respective tasks. Next, focusing on the structural component, we present APE-Gen2.0, a new pMHC structural modeling tool. APE-Gen2.0 outperforms other approaches in the literature in regard to modeling accuracy. It also expands the pMHC structural modeling repertoire to peptides exhibiting post-translational modifications, as well as peptides that assume non-canonical geometries in the MHC binding cleft. Finally, we present RankMHC, a novel, Learning to Rank-based pMHC binding mode identification tool. RankMHC outperforms both classical protein-ligand scoring functions and pMHC-specific scoring functions in predicting the most representative peptide conformation among an ensemble of conformations. Overall, acknowledging the potential of both pMHC sequence and pMHC structure information, our work expands on both areas, through novel and effective computational contributions.en_US
dc.embargo.lift2025-12-01en_US
dc.embargo.terms2025-12-01en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.urihttps://hdl.handle.net/1911/118168en_US
dc.language.isoenen_US
dc.subjectpeptide-MHCen_US
dc.subjectMachine learningen_US
dc.subjectpeptide-MHC binding affinityen_US
dc.subjectpeptide-MHC structural modelingen_US
dc.titleSequence-based and structure-based methods for studying the adaptive immune responseen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentComputer Scienceen_US
thesis.degree.disciplineComputer Scienceen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelDoctoralen_US
thesis.degree.nameDoctor of Philosophyen_US
Files
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.85 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.98 KB
Format:
Plain Text
Description: