Joint Estimation and Selection of Multiple Graphical Models for Microbiome Data

dc.contributor.advisorEnsor, Katherine B
dc.contributor.advisorPeterson, Christine B
dc.creatorRobinson, Sarah
dc.date.accessioned2022-09-23T16:16:26Z
dc.date.available2023-08-01T05:01:11Z
dc.date.created2022-08
dc.date.issued2022-08-12
dc.date.submittedAugust 2022
dc.date.updated2022-09-23T16:16:26Z
dc.description.abstractThe human microbiome, which plays a key role in health and disease, consists of a dynamic community of microorganisms. There is a keen interest in understanding interactions among these microbes, and how these relations change over time. However, current methods for microbiome network inference exist only for a single time point. We propose a novel method to jointly estimate time-varying network associations for microbiome data, which encourages edge similarity across a neighborhood of time points. To account for the compositional constraint and zero-inflation that typify microbiome data sets, we utilize a modified centered-log ratio transformation, then use a truncated Gaussian copula model to estimate the covariance matrices at each time point. We also propose an extension of this method to analyze multi-site or multi-domain microbiome data. We compare the performance of our method to existing alternative approaches on simulated data and apply the proposed method to learn cross-site dynamic networks based on oral and stool microbiome samples collected from leukemic patients during the course of cancer treatment. In the first project, we encountered challenges in selecting reasonably sparse models using traditional model selection criteria. While AIC and BIC, two of the most popular model selection information criteria, attempt to balance model fit and sparsity, selected models still tend to be very dense. Other existing approaches were not well suited to handle the selection of multiple hyperparameters to satisfy multiple objectives. We therefore propose multi-objective optimization to allow the user to filter through the model trade-offs to achieve a more desirable model. In this method, we allow for simultaneous hyperparameter tuning, rather than performing a more traditional grid search. In this project, we focus on its use in the selection of both single and joint graphical models, but we note that this method can be generalized to a wide variety of statistical models where competing objectives, such as sparsity or smoothness and fit, need to be optimized. We demonstrate its use for model selection for both the graphical lasso and the joint graphical lasso.
dc.embargo.terms2023-08-01
dc.format.mimetypeapplication/pdf
dc.identifier.citationRobinson, Sarah. "Joint Estimation and Selection of Multiple Graphical Models for Microbiome Data." (2022) Diss., Rice University. <a href="https://hdl.handle.net/1911/113239">https://hdl.handle.net/1911/113239</a>.
dc.identifier.urihttps://hdl.handle.net/1911/113239
dc.language.isoeng
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
dc.subjectGraphical Models
dc.subjectNetwork Inference
dc.subjectModel Selection
dc.subjectMicrobiome
dc.titleJoint Estimation and Selection of Multiple Graphical Models for Microbiome Data
dc.typeThesis
dc.type.materialText
thesis.degree.departmentStatistics
thesis.degree.disciplineEngineering
thesis.degree.grantorRice University
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ROBINSON-DOCUMENT-2022.pdf
Size:
1.01 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.61 KB
Format:
Plain Text
Description: