Javascript must be enabled to continue!
Empirical Performance of Tree-based Inference of Phylogenetic Networks
View through CrossRef
AbstractPhylogenetic networks extend the phylogenetic tree structure and allow for modeling vertical and horizontal evolution in a single framework. Statistical inference of phylogenetic networks is prohibitive and currently limited to small networks. An approach that could significantly improve phylogenetic network space exploration is based on first inferring an evolutionary tree of the species under consideration, and then augmenting the tree into a network by adding a set of “horizontal” edges to better fit the data.In this paper, we study the performance of such an approach on networks generated under a birth-hybridization model and explore its feasibility as an alternative to approaches that search the phylogenetic network space directly (without relying on a fixed underlying tree). We find that the concatenation method does poorly at obtaining a “backbone” tree that could be augmented into the correct network, whereas the popular species tree inference method ASTRAL does significantly better at such a task. We then evaluated the tree-to-network augmentation phase under the minimizing deep coalescence and pseudo-likelihood criteria. We find that even though this is a much faster approach than the direct search of the network space, the accuracy is much poorer, even when the backbone tree is a good starting tree.Our results show that tree-based inference of phylogenetic networks could yield very poor results. As exploration of the network space directly in search of maximum likelihood estimates or a representative sample of the posterior is very expensive, significant improvements to the computational complexity of phylogenetic network inference are imperative if analyses of large data sets are to be performed. We show that a recently developed divide-and-conquer approach significantly outperforms tree-based inference in terms of accuracy, albeit still at a higher computational cost.
Title: Empirical Performance of Tree-based Inference of Phylogenetic Networks
Description:
AbstractPhylogenetic networks extend the phylogenetic tree structure and allow for modeling vertical and horizontal evolution in a single framework.
Statistical inference of phylogenetic networks is prohibitive and currently limited to small networks.
An approach that could significantly improve phylogenetic network space exploration is based on first inferring an evolutionary tree of the species under consideration, and then augmenting the tree into a network by adding a set of “horizontal” edges to better fit the data.
In this paper, we study the performance of such an approach on networks generated under a birth-hybridization model and explore its feasibility as an alternative to approaches that search the phylogenetic network space directly (without relying on a fixed underlying tree).
We find that the concatenation method does poorly at obtaining a “backbone” tree that could be augmented into the correct network, whereas the popular species tree inference method ASTRAL does significantly better at such a task.
We then evaluated the tree-to-network augmentation phase under the minimizing deep coalescence and pseudo-likelihood criteria.
We find that even though this is a much faster approach than the direct search of the network space, the accuracy is much poorer, even when the backbone tree is a good starting tree.
Our results show that tree-based inference of phylogenetic networks could yield very poor results.
As exploration of the network space directly in search of maximum likelihood estimates or a representative sample of the posterior is very expensive, significant improvements to the computational complexity of phylogenetic network inference are imperative if analyses of large data sets are to be performed.
We show that a recently developed divide-and-conquer approach significantly outperforms tree-based inference in terms of accuracy, albeit still at a higher computational cost.
Related Results
PaNDA: Efficient Optimization of Phylogenetic Diversity in Networks
PaNDA: Efficient Optimization of Phylogenetic Diversity in Networks
Abstract
Phylogenetic diversity plays an important role in biodiversity, conservation, and evolutionary studies by measuring the diversity of a s...
Inferring Phylogenetic Networks Using PhyloNet
Inferring Phylogenetic Networks Using PhyloNet
AbstractPhyloNet was released in 2008 as a software package for representing and analyzing phylogenetic networks. At the time of its release, the main functionalities in PhyloNet c...
On the inference of complex phylogenetic networks by Markov Chain Monte-Carlo
On the inference of complex phylogenetic networks by Markov Chain Monte-Carlo
Abstract
For various species, high quality sequences and complete genomes are nowadays available for many individuals. This makes data analysis c...
Reliable estimation of tree branch lengths using deep neural networks
Reliable estimation of tree branch lengths using deep neural networks
Abstract
A phylogenetic tree represents hypothesized evolutionary history for a set of taxa. Besides the branching patterns (i.e., tree topology), phylogenies conta...
Inter-specific variations in tree stem methane and nitrous oxide exchanges in a tropical rainforest
Inter-specific variations in tree stem methane and nitrous oxide exchanges in a tropical rainforest
<p>Tropical forests are the most productive terrestrial ecosystems, global centres of biodiversity and important participants in the global carbon and water cycles. T...
Phylogenetic overdispersion of plant species in southern Brazilian savannas
Phylogenetic overdispersion of plant species in southern Brazilian savannas
Ecological communities are the result of not only present ecological processes, such as competition among species and environmental filtering, but also past and continuing evolutio...
Evolutionary Grammatical Inference
Evolutionary Grammatical Inference
Grammatical Inference (also known as grammar induction) is the problem of learning a grammar for a language from a set of examples. In a broad sense, some data is presented to the ...
Phylogenetic supertree reveals detailed evolution of SARS-CoV-2
Phylogenetic supertree reveals detailed evolution of SARS-CoV-2
Abstract
Corona Virus Disease 2019 (COVID-19) caused by the emerged coronavirus SARS-CoV-2 is spreading globally. The origin of SARS-Cov-19 and its evolutionary relationshi...

