Statistical Inference of Phylogenetic Networks from Unlinked Bi-allelic Markers

dc.contributor.advisorNakhleh, Luay
dc.creatorZhu, Jiafan
dc.date.accessioned2019-05-17T13:28:54Z
dc.date.available2019-05-17T13:28:54Z
dc.date.created2018-05
dc.date.issued2018-04-18
dc.date.submittedMay 2018
dc.date.updated2019-05-17T13:28:54Z
dc.description.abstractPhylogenetic networks are rooted, directed, acyclic graphs that model reticulate evolutionary histories. Recently, statistical methods were devised for inferring such networks from either gene tree estimates or the sequence alignments of multiple unlinked loci. Bi-allelic markers, most notably single nucleotide polymorphisms (SNPs) and amplified fragment length polymorphisms (AFLPs), provide a powerful source of genome-wide data. In a recent paper, a method called SNAPP was introduced for statistical inference of species trees from unlinked bi-allelic markers. The generative process assumed by the method combined both a model of evolution for the bi-allelic markers, as well as the multispecies coalescent. A novel component of the method was a polynomial-time algorithm for exact computation of the likelihood of a fixed species tree via integration over all possible gene trees for a given marker. Here we report on a method for Bayesian inference of phylogenetic networks from bi-allelic markers. Our method significantly extends the algorithm for exact computation of phylogenetic network likelihood via integration over all possible gene trees. Unlike the case of species trees, the algorithm is no longer polynomial-time on all instances of phylogenetic networks. Furthermore, the method utilizes a reversible-jump MCMC technique to sample the posterior of phylogenetic networks given bi-allelic marker data. Our method has a very good performance in terms of accuracy and robustness as we demonstrate on simulated data, as well as a data set of multiple New Zealand species of the plant genus Ourisia (Plantaginaceae). We implemented the method in the publicly available, open-source PhyloNet software package.
dc.format.mimetypeapplication/pdf
dc.identifier.citationZhu, Jiafan. "Statistical Inference of Phylogenetic Networks from Unlinked Bi-allelic Markers." (2018) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/105598">https://hdl.handle.net/1911/105598</a>.
dc.identifier.urihttps://hdl.handle.net/1911/105598
dc.language.isoeng
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
dc.subjectmultispecies network coalescent
dc.subjectphylogenetic networks
dc.subjectbi-allelic markers
dc.subjectreticulation
dc.subjectincomplete lineage sorting
dc.titleStatistical Inference of Phylogenetic Networks from Unlinked Bi-allelic Markers
dc.typeThesis
dc.type.materialText
thesis.degree.departmentComputer Science
thesis.degree.disciplineEngineering
thesis.degree.grantorRice University
thesis.degree.levelMasters
thesis.degree.nameMaster of Science
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ZHU-DOCUMENT-2018.pdf
Size:
865.92 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.6 KB
Format:
Plain Text
Description: