Association Studies in Human Cancers: Metabolic Expression Subtypes and Somatic Mutations/Germline Variations

dc.contributor.advisorScott, David W
dc.contributor.advisorLiang, Han
dc.contributor.advisorWei, Peng
dc.creatorChen, Zhongyuan
dc.date.accessioned2019-08-01T20:35:56Z
dc.date.available2020-08-01T05:01:08Z
dc.date.created2019-08
dc.date.issued2019-07-29
dc.date.submittedAugust 2019
dc.date.updated2019-08-01T20:35:56Z
dc.description.abstractCancer is a highly complex genetic disease caused by certain gene mutations. This thesis focuses on two critical categories of association studies for human cancers: associations between tumor metabolic subtypes and various other cancer aspects, and associations between somatic mutations and germline variations. In the first category, we classify metabolic expression subtypes in multiple TCGA (the Cancer Genome Atlas) cancer types, identify consistent prognostic patterns, and analyze master regulators of metabolic subtypes. We apply various statistical methods to study the associations between the metabolic expression subtypes and patients' survival, somatic mutations, copy number variations, and hallmark pathways. The results show that the metabolic expression subtypes are extensively correlated with patients' survival. The work gives a systematic view of metabolic heterogeneity and indicates the values of metabolic expression subtypes as predictive, prognostic, and therapeutic markers. In the second category, we design data-adaptive and pathway-based large-sample score test methods for association studies between somatic mutations and germline variations. A combination of multiple statistical techniques is used. Extensive information aggregation at both SNP and gene levels is involved. p-values from different parameters are combined to yield data-adaptive tests for somatic mutations and germline variations. To avoid using too many parameters so as to reduce costs, a randomized low-rank parameter preselection strategy is proposed to predict parameters that are likely more effective. In comparison with some commonly used methods, our data-adaptive somatic mutations/germline variations test methods are much more flexible, can apply to multiple germline SNPs/genes/pathways, and generally have much higher statistical powers. The test models are applied to both simulations and real-world ICGC (International Cancer Genome Consortium) datasets. For the ICGC data, a sequence of filtering, screening, and processing techniques is applied, followed by extensive association studies with our models. Our studies systematically identify the associations between various germline variations and somatic mutations across different cancer types. Our research provides valuable statistical tools for cancer risk prediction. The work leads to deeper understanding of molecular mechanisms of specific cancer genes and brings new insights into the development of novel cancer therapy.
dc.embargo.terms2020-08-01
dc.format.mimetypeapplication/pdf
dc.identifier.citationChen, Zhongyuan. "Association Studies in Human Cancers: Metabolic Expression Subtypes and Somatic Mutations/Germline Variations." (2019) Diss., Rice University. <a href="https://hdl.handle.net/1911/106172">https://hdl.handle.net/1911/106172</a>.
dc.identifier.urihttps://hdl.handle.net/1911/106172
dc.language.isoeng
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
dc.subjectAssociation studies
dc.subjectstatistical tests
dc.subjecthuman cancers, metabolic expression subtypes
dc.subjectsomatic mutations
dc.subjectgermline variations
dc.titleAssociation Studies in Human Cancers: Metabolic Expression Subtypes and Somatic Mutations/Germline Variations
dc.typeThesis
dc.type.materialText
thesis.degree.departmentStatistics
thesis.degree.disciplineEngineering
thesis.degree.grantorRice University
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
CHEN-DOCUMENT-2019.pdf
Size:
2.81 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.61 KB
Format:
Plain Text
Description: