Inference of species phylogenies from bi-allelic markers using pseudo-likelihood

dc.citation.firstpagei376en_US
dc.citation.issueNumber13en_US
dc.citation.journalTitleBioinformaticsen_US
dc.citation.lastpagei385en_US
dc.citation.volumeNumber34en_US
dc.contributor.authorZhu, Jiafanen_US
dc.contributor.authorNakhleh, Luay K.en_US
dc.date.accessioned2018-11-12T15:39:55Zen_US
dc.date.available2018-11-12T15:39:55Zen_US
dc.date.issued2018en_US
dc.description.abstractMOTIVATION: Phylogenetic networks represent reticulate evolutionary histories. Statistical methods for their inference under the multispecies coalescent have recently been developed. A particularly powerful approach uses data that consist of bi-allelic markers (e.g. single nucleotide polymorphism data) and allows for exact likelihood computations of phylogenetic networks while numerically integrating over all possible gene trees per marker. While the approach has good accuracy in terms of estimating the network and its parameters, likelihood computations remain a major computational bottleneck and limit the method's applicability. RESULTS: In this article, we first demonstrate why likelihood computations of networks take orders of magnitude more time when compared to trees. We then propose an approach for inference of phylogenetic networks based on pseudo-likelihood using bi-allelic markers. We demonstrate the scalability and accuracy of phylogenetic network inference via pseudo-likelihood computations on simulated data. Furthermore, we demonstrate aspects of robustness of the method to violations in the underlying assumptions of the employed statistical model. Finally, we demonstrate the application of the method to biological data. The proposed method allows for analyzing larger datasets in terms of the numbers of taxa and reticulation events. While pseudo-likelihood had been proposed before for data consisting of gene trees, the work here uses sequence data directly, offering several advantages as we discuss. AVAILABILITY AND IMPLEMENTATION: The methods have been implemented in PhyloNet (http://bioinfocs.rice.edu/phylonet).en_US
dc.identifier.citationZhu, Jiafan and Nakhleh, Luay K.. "Inference of species phylogenies from bi-allelic markers using pseudo-likelihood." <i>Bioinformatics,</i> 34, no. 13 (2018) Oxford University Press: i376-i385. https://doi.org/10.1093/bioinformatics/bty295.en_US
dc.identifier.digitalbty295en_US
dc.identifier.doihttps://doi.org/10.1093/bioinformatics/bty295en_US
dc.identifier.urihttps://hdl.handle.net/1911/103318en_US
dc.language.isoengen_US
dc.publisherOxford University Pressen_US
dc.rightsThis is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited.en_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/en_US
dc.titleInference of species phylogenies from bi-allelic markers using pseudo-likelihooden_US
dc.typeJournal articleen_US
dc.type.dcmiTexten_US
dc.type.publicationpublisher versionen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
bty295.pdf
Size:
1.29 MB
Format:
Adobe Portable Document Format