Bayesian graphical models for complex biological networks

dc.contributor.advisorVannucci, Marina
dc.contributor.committeeMemberStingo, Francesco C
dc.creatorNi, Yang
dc.date.accessioned2020-02-05T15:07:19Z
dc.date.available2020-02-05T15:07:19Z
dc.date.created2015-12
dc.date.issued2015-12-04
dc.date.submittedDecember 2015
dc.date.updated2020-02-05T15:07:19Z
dc.description.abstractIn this thesis, we propose novel Bayesian methodologies in estimating graphical models from complex genomic/health data, for which traditional methods are often found to be inefficient and unsuitable. Our approaches are motivated by various applications including construction of non-linear gene regulatory networks, data integration, cancer surveillance and precision medicine. This thesis consists of three projects. First, we develop a novel semi/non-parametric directed acyclic graphical model to reconstruct gene regulatory network from cancer gene expression data. The regulatory relationship between genes is assumed to be sparse and is allowed to be nonlinear, which is modeled by penalized splines with a spike-and-slab selection prior. We impose a discrete mixture prior on the smoothing parameter of the splines so that we are able to distinguish between linear and nonlinear relationships. Simulation studies show good performance of our approach in comparison with competing methods. Application to GBM data reveals several interesting findings. Second, we propose a multi-dimensional graphical model based on Cholesky-type decomposition of precision matrices to study the conditional independences of multi-dimensional data that are constituted by measurements along multiple axes. Our proposed approach is a unified framework applicable to both directed and undirected graphs as well as arbitrary combinations of these. We develop efficient sampling algorithm based on partially collapsed Gibbs samplers. Simulation studies show that our method has favorable performance against both benchmark and state-of-the-art approaches. We apply our approach to ovarian cancer protein expression data and U.S. cancer mortality data. Third, we propose a novel class of graphical models, graphical regression, which allow graph structure to vary with additional covariates in a flexible fashion. We impose sparsity in both graph structure and covariates. Our approach produces subject-specific graph and predictive graph for new subject. We provide theoretical property and demonstrate the good performance of our method through simulation studies. Finally, we apply our approach to multiple myeloma gene expression data taking prognostic factors as covariates, which reveals several interesting findings.
dc.format.mimetypeapplication/pdf
dc.identifier.citationNi, Yang. "Bayesian graphical models for complex biological networks." (2015) Diss., Rice University. <a href="https://hdl.handle.net/1911/108000">https://hdl.handle.net/1911/108000</a>.
dc.identifier.urihttps://hdl.handle.net/1911/108000
dc.language.isoeng
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
dc.subjectDirected acyclic graph
dc.subjectDecomposable and non-decomposable graphs
dc.subjectP-splines
dc.subjectLDL decomposition
dc.subjectNon-local prior
dc.subjectPredictive network.
dc.titleBayesian graphical models for complex biological networks
dc.typeThesis
dc.type.materialText
thesis.degree.departmentStatistics
thesis.degree.disciplineEngineering
thesis.degree.grantorRice University
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
NI-DOCUMENT-2015.pdf
Size:
5.94 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.6 KB
Format:
Plain Text
Description: