Joint Estimation and Selection of Multiple Graphical Models for Microbiome Data

dc.contributor.advisorEnsor, Katherine Ben_US
dc.contributor.advisorPeterson, Christine Ben_US
dc.creatorRobinson, Sarahen_US
dc.date.accessioned2022-09-23T16:16:26Zen_US
dc.date.available2023-08-01T05:01:11Zen_US
dc.date.created2022-08en_US
dc.date.issued2022-08-12en_US
dc.date.submittedAugust 2022en_US
dc.date.updated2022-09-23T16:16:26Zen_US
dc.description.abstractThe human microbiome, which plays a key role in health and disease, consists of a dynamic community of microorganisms. There is a keen interest in understanding interactions among these microbes, and how these relations change over time. However, current methods for microbiome network inference exist only for a single time point. We propose a novel method to jointly estimate time-varying network associations for microbiome data, which encourages edge similarity across a neighborhood of time points. To account for the compositional constraint and zero-inflation that typify microbiome data sets, we utilize a modified centered-log ratio transformation, then use a truncated Gaussian copula model to estimate the covariance matrices at each time point. We also propose an extension of this method to analyze multi-site or multi-domain microbiome data. We compare the performance of our method to existing alternative approaches on simulated data and apply the proposed method to learn cross-site dynamic networks based on oral and stool microbiome samples collected from leukemic patients during the course of cancer treatment. In the first project, we encountered challenges in selecting reasonably sparse models using traditional model selection criteria. While AIC and BIC, two of the most popular model selection information criteria, attempt to balance model fit and sparsity, selected models still tend to be very dense. Other existing approaches were not well suited to handle the selection of multiple hyperparameters to satisfy multiple objectives. We therefore propose multi-objective optimization to allow the user to filter through the model trade-offs to achieve a more desirable model. In this method, we allow for simultaneous hyperparameter tuning, rather than performing a more traditional grid search. In this project, we focus on its use in the selection of both single and joint graphical models, but we note that this method can be generalized to a wide variety of statistical models where competing objectives, such as sparsity or smoothness and fit, need to be optimized. We demonstrate its use for model selection for both the graphical lasso and the joint graphical lasso.en_US
dc.embargo.terms2023-08-01en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationRobinson, Sarah. "Joint Estimation and Selection of Multiple Graphical Models for Microbiome Data." (2022) Diss., Rice University. <a href="https://hdl.handle.net/1911/113239">https://hdl.handle.net/1911/113239</a>.en_US
dc.identifier.urihttps://hdl.handle.net/1911/113239en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectGraphical Modelsen_US
dc.subjectNetwork Inferenceen_US
dc.subjectModel Selectionen_US
dc.subjectMicrobiomeen_US
dc.titleJoint Estimation and Selection of Multiple Graphical Models for Microbiome Dataen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentStatisticsen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelDoctoralen_US
thesis.degree.nameDoctor of Philosophyen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ROBINSON-DOCUMENT-2022.pdf
Size:
1.01 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.61 KB
Format:
Plain Text
Description: