A Simulation-based Approach to Study Rare Variant Associations Across the Disease Spectrum

dc.contributor.advisorKimmel, Mareken_US
dc.contributor.committeeMemberLeal, Suzanneen_US
dc.contributor.committeeMemberThompson, James R.en_US
dc.contributor.committeeMemberNakhleh, Luay K.en_US
dc.creatorBanuelos, Rosaen_US
dc.date.accessioned2013-09-16T14:30:07Zen_US
dc.date.accessioned2013-09-16T14:30:10Zen_US
dc.date.available2013-09-16T14:30:07Zen_US
dc.date.available2013-09-16T14:30:10Zen_US
dc.date.created2013-05en_US
dc.date.issued2013-09-16en_US
dc.date.submittedMay 2013en_US
dc.date.updated2013-09-16T14:30:10Zen_US
dc.description.abstractAlthough complete understanding of the mechanisms of rare genetic variants in disease continues to elude us, Next Generation Sequencing (NGS) has facilitated significant gene discoveries across the disease spectrum. However, the cost of NGS hinders its use for identifying rare variants in common diseases that require large samples. To circumvent the need for larger samples, designing efficient sampling studies is crucial in order to detect potential associations. This research therefore evaluates sampling designs for rare variant - quantitative trait association studies and assesses the effect on power that freely available public cohort data can have in the design. Performing simulations and evaluating common and unconventional sampling schemes results in several noteworthy findings. Specifically, the extreme-trait design is the most powerful design for analyzing quantitative traits. This research also shows that sampling more individuals from the extreme of clinical interest does not increase power. Variant filtering has served as a "proof-of-concept" approach for the discovery of disease-causing genes in Mendelian traits and formal statistical methods have been lacking in this area. However, combining variant filtering schemes with existing rare variant association tests is a practical alternative. Thus, this thesis also compares the robustness of six burden-based rare variant association tests for Mendelian traits after a variant filtering step in the presence of genetic heterogeneity and genotyping errors. This research shows that with low locus heterogeneity, these tests are powerful for testing association. With the exception of the weighted sum statistic (WSS), the remaining tests were very conservative in preserving the type I error when the number of affected and unaffected individuals was unequal. The WSS, on the other hand, had inflated type I error as the number of unaffected individuals increased. The framework presented can serve as a catalyst to improve sampling design and to develop robust statistical methods for association testing.en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationBanuelos, Rosa. "A Simulation-based Approach to Study Rare Variant Associations Across the Disease Spectrum." (2013) Diss., Rice University. <a href="https://hdl.handle.net/1911/71920">https://hdl.handle.net/1911/71920</a>.en_US
dc.identifier.slug123456789/ETD-2013-05-353en_US
dc.identifier.urihttps://hdl.handle.net/1911/71920en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectRare variantsen_US
dc.subjectCommon diseaseen_US
dc.subjectPoweren_US
dc.subjectMendelianen_US
dc.subjectSamplingen_US
dc.titleA Simulation-based Approach to Study Rare Variant Associations Across the Disease Spectrumen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentStatisticsen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelDoctoralen_US
thesis.degree.nameDoctor of Philosophyen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
BANUELOS-THESIS.pdf
Size:
1.23 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed upon to submission
Description: