Inference of reticulate evolutionary histories by maximum likelihood: the performance of information criteria

dc.citation.conferenceDate2012-10-17en_US
dc.citation.conferenceNameTenth Annual Research in Computational Molecular Biology (RECOMB) Satellite Workshop on Comparative Genomicsen_US
dc.citation.firstpageS12en_US
dc.citation.issueNumberSuppl 19en_US
dc.citation.journalTitleBMC Bioinformaticsen_US
dc.citation.volumeNumber13en_US
dc.contributor.authorPark, Hyun Jungen_US
dc.contributor.authorNakhleh, Luayen_US
dc.date.accessioned2013-03-19T17:51:13Zen_US
dc.date.available2013-03-19T17:51:13Zen_US
dc.date.issued2012en_US
dc.description.abstractBackground: Maximum likelihood has been widely used for over three decades to infer phylogenetic trees from molecular data. When reticulate evolutionary events occur, several genomic regions may have conflicting evolutionary histories, and a phylogenetic network may provide a more adequate model for representing the evolutionary history of the genomes or species. A maximum likelihood (ML) model has been proposed for this case and accounts for both mutation within a genomic region and reticulation across the regions. However, the performance of this model in terms of inferring information about reticulate evolution and properties that affect this performance have not been studied. Results: In this paper, we study the effect of the evolutionary diameter and height of a reticulation event on its identifiability under ML. We find both of them, particularly the diameter, have a significant effect. Further, we find that the number of genes (which can be generalized to the concept of "non-recombining genomic regions") that are transferred across a reticulation edge affects its detectability. Last but not least, a fundamental challenge with phylogenetic networks is that they allow an arbitrary level of complexity, giving rise to the model selection problem. We investigate the performance of two information criteria, the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC), for addressing this problem. We find that BIC performs well in general for controlling the model complexity and preventing ML from grossly overestimating the number of reticulation events. Conclusion: Our results demonstrate that BIC provides a good framework for inferring reticulate evolutionary histories. Nevertheless, the results call for caution when interpreting the accuracy of the inference particularly for data sets with particular evolutionary features.en_US
dc.embargo.termsnoneen_US
dc.identifier.citationPark, Hyun Jung and Nakhleh, Luay. "Inference of reticulate evolutionary histories by maximum likelihood: the performance of information criteria." <i>BMC Bioinformatics,</i> 13, no. Suppl 19 (2012) BioMed Central: S12. https://doi.org/10.1186/1471-2105-13-S19-S12.en_US
dc.identifier.doihttps://doi.org/10.1186/1471-2105-13-S19-S12en_US
dc.identifier.urihttps://hdl.handle.net/1911/70724en_US
dc.language.isoengen_US
dc.publisherBioMed Centralen_US
dc.rightsThis article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.en_US
dc.rights.urihttps://creativecommons.org/licenses/by/2.0/en_US
dc.titleInference of reticulate evolutionary histories by maximum likelihood: the performance of information criteriaen_US
dc.typeJournal articleen_US
dc.type.dcmiTexten_US
dc.type.publicationpublisher versionen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ParkNakhleh-RECOMBCG12.pdf
Size:
754.52 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed upon to submission
Description: