Gene Tree Distributions under Duplication, Loss and Deep Coalescence

dc.contributor.advisorNakhleh, Luayen_US
dc.creatorYe, Danen_US
dc.date.accessioned2017-08-01T16:05:24Zen_US
dc.date.available2017-08-01T16:05:24Zen_US
dc.date.created2017-05en_US
dc.date.issued2017-01-05en_US
dc.date.submittedMay 2017en_US
dc.date.updated2017-08-01T16:05:24Zen_US
dc.description.abstractGene duplication and loss are two evolutionary processes that occur across all three domains of life. These two processes result in different loci, across a set of related genomes, having different gene trees. Inferring the phylogeny of the genomes from data sets of such gene trees is a central task in phylogenomics. Furthermore, when the evolutionary history of the genomes includes relatively close divergence events, as in cases of closely related organisms or rapid radiations, deep coalescence of gene copies could be at play, in addition to duplication and loss, further adding to the complexity of gene/genome relationships. In this work, we develop a probabilistic model of gene evolution that incorporates duplications and loss, and accounts for deep coalescence. We formulate the models in terms of Markov chains, and provide algorithms for computing gene tree distributions for the two cases of gene trees with and without branch lengths. We illustrate the use of our work on simulated and biological data by assessing the accuracy of species tree inferences under our models (topology and branch lengths) and contrasting them to inferences under cases of deep coalescence alone. It is important to highlight that our models sidestep the issue of hidden paralogy by ``integrating out" the possible orthology assignments of gene copies. Our work enables new statistical phylogenomic analyses, particularly when hidden paralogy and deep coalescence could be at play.en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationYe, Dan. "Gene Tree Distributions under Duplication, Loss and Deep Coalescence." (2017) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/95995">https://hdl.handle.net/1911/95995</a>.en_US
dc.identifier.urihttps://hdl.handle.net/1911/95995en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectPhylogeneticsen_US
dc.subjectMarkov Chainen_US
dc.subjectGene Tree Distributionen_US
dc.subjectGene Duplicationen_US
dc.subjectGene Lossen_US
dc.subjectDeep Coalescenceen_US
dc.titleGene Tree Distributions under Duplication, Loss and Deep Coalescenceen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentComputer Scienceen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMaster of Scienceen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
YE-DOCUMENT-2017.pdf
Size:
932.76 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.6 KB
Format:
Plain Text
Description: