Javascript must be enabled to continue!
Inference of Species Phylogenies from Bi-allelic Markers Using Pseudo-likelihood
View through CrossRef
Abstract
Motivation
Phylogenetic networks represent reticulate evolutionary histories. Statistical methods for their inference under the multispecies coalescent have recently been developed. A particularly powerful approach uses data that consist of bi-allelic markers (e.g., single nucleotide polymorphism data) and allows for exact likelihood computations of phylogenetic networks while numerically integrating over all possible gene trees per marker. While the approach has good accuracy in terms of estimating the network and its parameters, likelihood computations remain a major computational bottleneck and limit the method’s applicability.
Results
In this paper, we first demonstrate why likelihood computations of networks take orders of magnitude more time when compared to trees. We then propose an approach for inference of phylo-genetic networks based on pseudo-likelihood using bi-allelic markers. We demonstrate the scalability and accuracy of phylogenetic network inference via pseudo-likelihood computations on simulated data. Furthermore, we demonstrate aspects of robustness of the method to violations in the underlying assumptions of the employed statistical model. Finally, we demonstrate the application of the method to biological data. The proposed method allows for analyzing larger data sets in terms of the numbers of taxa and reticulation events. While pseudo-likelihood had been proposed before for data consisting of gene trees, the work here uses sequence data directly, offering several advantages as we discuss.
Availability
The methods have been implemented in PhyloNet (
http://bioinfocs.rice.edu/phylonet
).
Contact
jiafan.zhu@rice.edu
,
nakhleh@rice.edu
Title: Inference of Species Phylogenies from Bi-allelic Markers Using Pseudo-likelihood
Description:
Abstract
Motivation
Phylogenetic networks represent reticulate evolutionary histories.
Statistical methods for their inference under the multispecies coalescent have recently been developed.
A particularly powerful approach uses data that consist of bi-allelic markers (e.
g.
, single nucleotide polymorphism data) and allows for exact likelihood computations of phylogenetic networks while numerically integrating over all possible gene trees per marker.
While the approach has good accuracy in terms of estimating the network and its parameters, likelihood computations remain a major computational bottleneck and limit the method’s applicability.
Results
In this paper, we first demonstrate why likelihood computations of networks take orders of magnitude more time when compared to trees.
We then propose an approach for inference of phylo-genetic networks based on pseudo-likelihood using bi-allelic markers.
We demonstrate the scalability and accuracy of phylogenetic network inference via pseudo-likelihood computations on simulated data.
Furthermore, we demonstrate aspects of robustness of the method to violations in the underlying assumptions of the employed statistical model.
Finally, we demonstrate the application of the method to biological data.
The proposed method allows for analyzing larger data sets in terms of the numbers of taxa and reticulation events.
While pseudo-likelihood had been proposed before for data consisting of gene trees, the work here uses sequence data directly, offering several advantages as we discuss.
Availability
The methods have been implemented in PhyloNet (
http://bioinfocs.
rice.
edu/phylonet
).
Contact
jiafan.
zhu@rice.
edu
,
nakhleh@rice.
edu.
Related Results
Eficacia, seguridad y eficiencia de la radioterapia corporal estereotáctica aplicada con marcadores de referencia en oncología
Eficacia, seguridad y eficiencia de la radioterapia corporal estereotáctica aplicada con marcadores de referencia en oncología
Introduction
Stereotactic body radiotherapy (SBRT) is a technology that involves delivering high doses of radiation, in few sessios and with high precision, to a specific tumor loc...
Deep Learning from Phylogenies for Diversification Analyses
Deep Learning from Phylogenies for Diversification Analyses
ABSTRACT
Birth-death models are widely used in combination with species phylogenies to study past diversification dynamics. Current inference approaches typically r...
Integrated Likelihood for Phylogenomics under a No-Common-Mechanism Model
Integrated Likelihood for Phylogenomics under a No-Common-Mechanism Model
The availability of genome-wide sequence data from a large number of species as well as data from multiple individuals within a species has ushered in the era of phylogenomics. In ...
An Algorithmic Classification of Generalized Pseudo-Anosov Homeomorphisms via Geometric Markov Partitions
An Algorithmic Classification of Generalized Pseudo-Anosov Homeomorphisms via Geometric Markov Partitions
Une Classification Algorithmique des Homéomorphismes Pseudo-Anosov Généralisés via les Partitions Géométriques de Markov
Cette thèse vise à fournir une classificati...
Impacts of man-made structures on marine biodiversity and species status - native & non-native species
Impacts of man-made structures on marine biodiversity and species status - native & non-native species
<p>Coastal environments are exposed to anthropogenic activities such as frequent marine traffic and restructuring, i.e., addition, removal or replacing with man-made structur...
Recombination and the role of pseudo-overdominance in polyploid evolution
Recombination and the role of pseudo-overdominance in polyploid evolution
ABSTRACT
Natural selection is an imperfect force that can under some conditions fail to prevent the buildup of deleterious mutations. Small population sizes and the...
Weak pseudo-BCK algebras
Weak pseudo-BCK algebras
Abstract
In this paper we define and study the weak pseudo-BCK algebras as generalizations of weak BCK-algebras, extending some results given by Cı⃖rulis for weak BC...
MV-algebras with pseudo MV-valuations
MV-algebras with pseudo MV-valuations
The concept of pseudo MV-valuations is proposed in the paper, and some related characterizations of pseudo MV-valuations are investigated. The relationships between the pseudo MV-v...

