Gene Tree Distributions under Duplication, Loss and Deep Coalescence

Date
2017-01-05
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract

Gene duplication and loss are two evolutionary processes that occur across all three domains of life. These two processes result in different loci, across a set of related genomes, having different gene trees. Inferring the phylogeny of the genomes from data sets of such gene trees is a central task in phylogenomics. Furthermore, when the evolutionary history of the genomes includes relatively close divergence events, as in cases of closely related organisms or rapid radiations, deep coalescence of gene copies could be at play, in addition to duplication and loss, further adding to the complexity of gene/genome relationships. In this work, we develop a probabilistic model of gene evolution that incorporates duplications and loss, and accounts for deep coalescence. We formulate the models in terms of Markov chains, and provide algorithms for computing gene tree distributions for the two cases of gene trees with and without branch lengths. We illustrate the use of our work on simulated and biological data by assessing the accuracy of species tree inferences under our models (topology and branch lengths) and contrasting them to inferences under cases of deep coalescence alone. It is important to highlight that our models sidestep the issue of hidden paralogy by ``integrating out" the possible orthology assignments of gene copies. Our work enables new statistical phylogenomic analyses, particularly when hidden paralogy and deep coalescence could be at play.

Description
Degree
Master of Science
Type
Thesis
Keywords
Phylogenetics, Markov Chain, Gene Tree Distribution, Gene Duplication, Gene Loss, Deep Coalescence
Citation

Ye, Dan. "Gene Tree Distributions under Duplication, Loss and Deep Coalescence." (2017) Master’s Thesis, Rice University. https://hdl.handle.net/1911/95995.

Has part(s)
Forms part of
Published Version
Rights
Copyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
Link to license
Citable link to this page