Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Stochastic Variational Inference for Bayesian Phylogenetics: A Case of CAT Model

View through CrossRef
AbstractThe pattern of molecular evolution varies among gene sites and genes in a genome. By taking into account the complex heterogeneity of evolutionary processes among sites in a genome, Bayesian infinite mixture models of genomic evolution enable robust phylogenetic inference. With large modern data sets, however, the computational burden of Markov chain Monte Carlo sampling techniques becomes prohibitive. Here, we have developed a variational Bayesian procedure to speed up the widely used PhyloBayes MPI program, which deals with the heterogeneity of amino acid profiles. Rather than sampling from the posterior distribution, the procedure approximates the (unknown) posterior distribution using a manageable distribution called the variational distribution. The parameters in the variational distribution are estimated by minimizing Kullback-Leibler divergence. To examine performance, we analyzed three empirical data sets consisting of mitochondrial, plastid-encoded, and nuclear proteins. Our variational method accurately approximated the Bayesian phylogenetic tree, mixture proportions, and the amino acid propensity of each component of the mixture while using orders of magnitude less computational time.
Cold Spring Harbor Laboratory
Title: Stochastic Variational Inference for Bayesian Phylogenetics: A Case of CAT Model
Description:
AbstractThe pattern of molecular evolution varies among gene sites and genes in a genome.
By taking into account the complex heterogeneity of evolutionary processes among sites in a genome, Bayesian infinite mixture models of genomic evolution enable robust phylogenetic inference.
With large modern data sets, however, the computational burden of Markov chain Monte Carlo sampling techniques becomes prohibitive.
Here, we have developed a variational Bayesian procedure to speed up the widely used PhyloBayes MPI program, which deals with the heterogeneity of amino acid profiles.
Rather than sampling from the posterior distribution, the procedure approximates the (unknown) posterior distribution using a manageable distribution called the variational distribution.
The parameters in the variational distribution are estimated by minimizing Kullback-Leibler divergence.
To examine performance, we analyzed three empirical data sets consisting of mitochondrial, plastid-encoded, and nuclear proteins.
Our variational method accurately approximated the Bayesian phylogenetic tree, mixture proportions, and the amino acid propensity of each component of the mixture while using orders of magnitude less computational time.

Related Results

Hydatid Disease of The Brain Parenchyma: A Systematic Review
Hydatid Disease of The Brain Parenchyma: A Systematic Review
Abstarct Introduction Isolated brain hydatid disease (BHD) is an extremely rare form of echinococcosis. A prompt and timely diagnosis is a crucial step in disease management. This ...
Sample-efficient Optimization Using Neural Networks
Sample-efficient Optimization Using Neural Networks
<p>The solution to many science and engineering problems includes identifying the minimum or maximum of an unknown continuous function whose evaluation inflicts non-negligibl...
Why can't we be friends? Exploring factors associated with cat owners' perceptions of the cat-cat relationship in two-cat households
Why can't we be friends? Exploring factors associated with cat owners' perceptions of the cat-cat relationship in two-cat households
Most research examining cat behavior in multi-cat households lacks focus on one group size. This gap in knowledge reduces generalizability of research findings to specific composit...
Clinical characteristics of cat sensitized adults, cat ownership and cat owners' attitudes
Clinical characteristics of cat sensitized adults, cat ownership and cat owners' attitudes
Background: Cat allergen sensitization is a significant risk factor for allergic rhinitis and asthma. There are insufficient data on the preferences and attitudes of cat owners who...
Present climate characterization and future changes in Clear-Air Turbulence (CAT) over the northern hemisphere
Present climate characterization and future changes in Clear-Air Turbulence (CAT) over the northern hemisphere
&lt;p&gt;Airplanes spend about 1% of cruise time in Moderate-Or-Greater (MOG) CAT (Sharman et al. 2006), which is defined as any turbulence occurring in the atmosphere away...
Evaluating probabilistic programming and fast variational Bayesian inference in phylogenetics
Evaluating probabilistic programming and fast variational Bayesian inference in phylogenetics
AbstractRecent advances in statistical machine learning techniques have led to the creation of probabilistic programming frameworks. These frameworks enable probabilistic models to...
Figs S1-S9
Figs S1-S9
Fig. S1. Consensus phylogram (50 % majority rule) resulting from a Bayesian analysis of the ITS sequence alignment of sequences generated in this study and reference sequences from...
Theory of variational quantum simulation
Theory of variational quantum simulation
The variational method is a versatile tool for classical simulation of a variety of quantum systems. Great efforts have recently been devoted to its extension to quantum computing ...

Back to Top