Statistical Inference of Phylogenetic Networks from Unlinked Bi-allelic Markers

dc.contributor.advisorNakhleh, Luayen_US
dc.creatorZhu, Jiafanen_US
dc.date.accessioned2019-05-17T13:28:54Zen_US
dc.date.available2019-05-17T13:28:54Zen_US
dc.date.created2018-05en_US
dc.date.issued2018-04-18en_US
dc.date.submittedMay 2018en_US
dc.date.updated2019-05-17T13:28:54Zen_US
dc.description.abstractPhylogenetic networks are rooted, directed, acyclic graphs that model reticulate evolutionary histories. Recently, statistical methods were devised for inferring such networks from either gene tree estimates or the sequence alignments of multiple unlinked loci. Bi-allelic markers, most notably single nucleotide polymorphisms (SNPs) and amplified fragment length polymorphisms (AFLPs), provide a powerful source of genome-wide data. In a recent paper, a method called SNAPP was introduced for statistical inference of species trees from unlinked bi-allelic markers. The generative process assumed by the method combined both a model of evolution for the bi-allelic markers, as well as the multispecies coalescent. A novel component of the method was a polynomial-time algorithm for exact computation of the likelihood of a fixed species tree via integration over all possible gene trees for a given marker. Here we report on a method for Bayesian inference of phylogenetic networks from bi-allelic markers. Our method significantly extends the algorithm for exact computation of phylogenetic network likelihood via integration over all possible gene trees. Unlike the case of species trees, the algorithm is no longer polynomial-time on all instances of phylogenetic networks. Furthermore, the method utilizes a reversible-jump MCMC technique to sample the posterior of phylogenetic networks given bi-allelic marker data. Our method has a very good performance in terms of accuracy and robustness as we demonstrate on simulated data, as well as a data set of multiple New Zealand species of the plant genus Ourisia (Plantaginaceae). We implemented the method in the publicly available, open-source PhyloNet software package.en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationZhu, Jiafan. "Statistical Inference of Phylogenetic Networks from Unlinked Bi-allelic Markers." (2018) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/105598">https://hdl.handle.net/1911/105598</a>.en_US
dc.identifier.urihttps://hdl.handle.net/1911/105598en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectmultispecies network coalescenten_US
dc.subjectphylogenetic networksen_US
dc.subjectbi-allelic markersen_US
dc.subjectreticulationen_US
dc.subjectincomplete lineage sortingen_US
dc.titleStatistical Inference of Phylogenetic Networks from Unlinked Bi-allelic Markersen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentComputer Scienceen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMaster of Scienceen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ZHU-DOCUMENT-2018.pdf
Size:
865.92 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.6 KB
Format:
Plain Text
Description: