Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Evaluating probabilistic programming and fast variational Bayesian inference in phylogenetics

View through CrossRef
AbstractRecent advances in statistical machine learning techniques have led to the creation of probabilistic programming frameworks. These frameworks enable probabilistic models to be rapidly prototyped and fit to data using scalable approximation methods such as variational inference. In this work, we explore the use of the Stan language for probabilistic programming in application to phylogenetic models. We show that many commonly used phylogenetic models including the general time reversible (GTR) substitution model, rate heterogeneity among sites, and a range of coalescent models can be implemented using a probabilistic programming language. The posterior probability distributions obtained via the black box variational inference engine in Stan were compared to those obtained with reference implementations of Markov chain Monte Carlo (MCMC) for phylogenetic inference. We find that black box variational inference in Stan is less accurate than MCMC methods for phylogenetic models, but requires far less compute time. Finally, we evaluate a custom implementation of mean-field variational inference on the Jukes-Cantor substitution model and show that a specialized implementation of variational inference can be two orders of magnitude faster and more accurate than a general purpose probabilistic implementation.
Cold Spring Harbor Laboratory
Title: Evaluating probabilistic programming and fast variational Bayesian inference in phylogenetics
Description:
AbstractRecent advances in statistical machine learning techniques have led to the creation of probabilistic programming frameworks.
These frameworks enable probabilistic models to be rapidly prototyped and fit to data using scalable approximation methods such as variational inference.
In this work, we explore the use of the Stan language for probabilistic programming in application to phylogenetic models.
We show that many commonly used phylogenetic models including the general time reversible (GTR) substitution model, rate heterogeneity among sites, and a range of coalescent models can be implemented using a probabilistic programming language.
The posterior probability distributions obtained via the black box variational inference engine in Stan were compared to those obtained with reference implementations of Markov chain Monte Carlo (MCMC) for phylogenetic inference.
We find that black box variational inference in Stan is less accurate than MCMC methods for phylogenetic models, but requires far less compute time.
Finally, we evaluate a custom implementation of mean-field variational inference on the Jukes-Cantor substitution model and show that a specialized implementation of variational inference can be two orders of magnitude faster and more accurate than a general purpose probabilistic implementation.

Related Results

Sample-efficient Optimization Using Neural Networks
Sample-efficient Optimization Using Neural Networks
<p>The solution to many science and engineering problems includes identifying the minimum or maximum of an unknown continuous function whose evaluation inflicts non-negligibl...
Inventory and pricing management in probabilistic selling
Inventory and pricing management in probabilistic selling
Context: Probabilistic selling is the strategy that the seller creates an additional probabilistic product using existing products. The exact information is unknown to customers u...
Figs S1-S9
Figs S1-S9
Fig. S1. Consensus phylogram (50 % majority rule) resulting from a Bayesian analysis of the ITS sequence alignment of sequences generated in this study and reference sequences from...
Theory of variational quantum simulation
Theory of variational quantum simulation
The variational method is a versatile tool for classical simulation of a variety of quantum systems. Great efforts have recently been devoted to its extension to quantum computing ...
Extended Bayesian inference incorporating symmetry bias
Extended Bayesian inference incorporating symmetry bias
AbstractIn this study, we start by proposing a causal induction model that incorporates symmetry bias. This model has two parameters that control the strength of symmetry bias and ...
Evolutionary Grammatical Inference
Evolutionary Grammatical Inference
Grammatical Inference (also known as grammar induction) is the problem of learning a grammar for a language from a set of examples. In a broad sense, some data is presented to the ...
Stochastic Variational Inference for Bayesian Phylogenetics: A Case of CAT Model
Stochastic Variational Inference for Bayesian Phylogenetics: A Case of CAT Model
AbstractThe pattern of molecular evolution varies among gene sites and genes in a genome. By taking into account the complex heterogeneity of evolutionary processes among sites in ...
Bayesian Inference for the Beta-Weibull Distribution with Applications to Cancer and Under-nutrition Data
Bayesian Inference for the Beta-Weibull Distribution with Applications to Cancer and Under-nutrition Data
Abstract The Weibull probability distribution can be used to model data from many subject areas such as survival data in health, reliability data in engineering and insuran...

Back to Top