Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Fast and Robust Inference of Phylogenetic Ornstein-Uhlenbeck Models Using Parallel Likelihood Calculation

View through CrossRef
A bstract Phylogenetic comparative methods have been used to model trait evolution, to test selection versus neutral hypotheses, to estimate optimal trait-values, and to quantify the rate of adaptation towards these optima. Several authors have proposed algorithms calculating the likelihood for trait evolution models, such as the Ornstein-Uhlenbeck (OU) process, in time proportional to the number of tips in the tree. Combined with gradient-based optimization, these algorithms enable maximum likelihood (ML) inference within seconds, even for trees exceeding 10,000 tips. Despite its useful statistical properties, ML has been criticised for being a point estimator prone to getting stuck in local optima. As an elegant alternative, Bayesian inference explores the entire information in the data and compares it to prior knowledge but, usually, runs in much longer time, even for small trees. Here, we propose an approach to use the full potential of ML and Bayesian inference, while keeping the runtime within minutes. Our approach combines (i) a new algorithm for parallel likelihood calculation; (ii) a previously published method for adaptive Metropolis sampling. In principle, the strategy of (i) and (ii) can be applied to any likelihood calculation on a tree which proceeds in a pruning-like fashion leading to enormous speed improvements. As a showcase, we implement the phylogenetic Ornstein-Uhlenbeck mixed model (POUMM) in the form of an easy-to-use and highly configurable R-package. In addition to the above-mentioned usage of comparative methods, the POUMM allows to estimate non-heritable variance and phylogenetic heritability. Using simulations and empirical data from 487 mammal species, we show that the POUMM is far more reliable in terms of unbiased estimates and false positive rate for stabilizing selection, compared to its alternative - the non-mixed Ornstein-Uhlenbeck model, which assumes a fully heritable and perfectly measurable trait. Further, our analysis reveals that the phylogenetic mixed model (PMM), which assumes neutral evolution (Brownian motion) can be a very unstable estimator of phylogenetic heritability, even if the Brownian motion assumption is only weakly violated. Our results prove the need for a simultaneous account for selection and non-heritable variance in phylogenetic evolutionary models and challenge stabilizing selection hypotheses stated in numerous macro-evolutionary studies.
Title: Fast and Robust Inference of Phylogenetic Ornstein-Uhlenbeck Models Using Parallel Likelihood Calculation
Description:
A bstract Phylogenetic comparative methods have been used to model trait evolution, to test selection versus neutral hypotheses, to estimate optimal trait-values, and to quantify the rate of adaptation towards these optima.
Several authors have proposed algorithms calculating the likelihood for trait evolution models, such as the Ornstein-Uhlenbeck (OU) process, in time proportional to the number of tips in the tree.
Combined with gradient-based optimization, these algorithms enable maximum likelihood (ML) inference within seconds, even for trees exceeding 10,000 tips.
Despite its useful statistical properties, ML has been criticised for being a point estimator prone to getting stuck in local optima.
As an elegant alternative, Bayesian inference explores the entire information in the data and compares it to prior knowledge but, usually, runs in much longer time, even for small trees.
Here, we propose an approach to use the full potential of ML and Bayesian inference, while keeping the runtime within minutes.
Our approach combines (i) a new algorithm for parallel likelihood calculation; (ii) a previously published method for adaptive Metropolis sampling.
In principle, the strategy of (i) and (ii) can be applied to any likelihood calculation on a tree which proceeds in a pruning-like fashion leading to enormous speed improvements.
As a showcase, we implement the phylogenetic Ornstein-Uhlenbeck mixed model (POUMM) in the form of an easy-to-use and highly configurable R-package.
In addition to the above-mentioned usage of comparative methods, the POUMM allows to estimate non-heritable variance and phylogenetic heritability.
Using simulations and empirical data from 487 mammal species, we show that the POUMM is far more reliable in terms of unbiased estimates and false positive rate for stabilizing selection, compared to its alternative - the non-mixed Ornstein-Uhlenbeck model, which assumes a fully heritable and perfectly measurable trait.
Further, our analysis reveals that the phylogenetic mixed model (PMM), which assumes neutral evolution (Brownian motion) can be a very unstable estimator of phylogenetic heritability, even if the Brownian motion assumption is only weakly violated.
Our results prove the need for a simultaneous account for selection and non-heritable variance in phylogenetic evolutionary models and challenge stabilizing selection hypotheses stated in numerous macro-evolutionary studies.

Related Results

phyr: An R package for phylogenetic species-distribution modelling in ecological communities
phyr: An R package for phylogenetic species-distribution modelling in ecological communities
SummaryModel-based approaches are increasingly popular in ecological studies. A good example of this trend is the use of joint species distribution models to ask questions about ec...
Inferring Phylogenetic Networks Using PhyloNet
Inferring Phylogenetic Networks Using PhyloNet
Abstract PhyloNet was released in 2008 as a software package for representing and analyzing phylogenetic networks. At the time of its release, the main functionalit...
Epidemic Waves in a Stochastic SIRVI Epidemic Model Incorporating the Ornstein–Uhlenbeck Process
Epidemic Waves in a Stochastic SIRVI Epidemic Model Incorporating the Ornstein–Uhlenbeck Process
The worldwide data for COVID-19 for active, infected individuals in multiple waves show that traditional epidemic models with constant parameters are not able to capture this kind ...
Generalized Ornstein-Uhlenbeck Processes in Ruin Theory
Generalized Ornstein-Uhlenbeck Processes in Ruin Theory
Les processus d'Ornstein-Uhlenbeck généralisés en théorie de la ruine Cette thèse contribue à l'étude des processus d’Ornstein-Uhlenbeck généralisés (GOU) et de leu...
Empirical Performance of Tree-based Inference of Phylogenetic Networks
Empirical Performance of Tree-based Inference of Phylogenetic Networks
Abstract Phylogenetic networks extend the phylogenetic tree structure and allow for modeling vertical and horizontal evolution in a single framework. Statistical in...
Phylogenetic overdispersion of plant species in southern Brazilian savannas
Phylogenetic overdispersion of plant species in southern Brazilian savannas
Ecological communities are the result of not only present ecological processes, such as competition among species and environmental filtering, but also past and continuing evolutio...
PaNDA: Efficient Optimization of Phylogenetic Diversity in Networks
PaNDA: Efficient Optimization of Phylogenetic Diversity in Networks
Abstract Phylogenetic diversity plays an important role in biodiversity, conservation, and evolutionary studies by measuring the diversity of a s...

Back to Top