Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Stochastic Variational Inference for Bayesian Phylogenetics: A Case of CAT Model

View through CrossRef
Abstract The pattern of molecular evolution varies among gene sites and genes in a genome. By taking into account the complex heterogeneity of evolutionary processes among sites in a genome, Bayesian infinite mixture models of genomic evolution enable robust phylogenetic inference. With large modern data sets, however, the computational burden of Markov chain Monte Carlo sampling techniques becomes prohibitive. Here, we have developed a variational Bayesian procedure to speed up the widely used PhyloBayes MPI program, which deals with the heterogeneity of amino acid profiles. Rather than sampling from the posterior distribution, the procedure approximates the (unknown) posterior distribution using a manageable distribution called the variational distribution. The parameters in the variational distribution are estimated by minimizing Kullback-Leibler divergence. To examine performance, we analyzed three empirical data sets consisting of mitochondrial, plastid-encoded, and nuclear proteins. Our variational method accurately approximated the Bayesian phylogenetic tree, mixture proportions, and the amino acid propensity of each component of the mixture while using orders of magnitude less computational time.
Title: Stochastic Variational Inference for Bayesian Phylogenetics: A Case of CAT Model
Description:
Abstract The pattern of molecular evolution varies among gene sites and genes in a genome.
By taking into account the complex heterogeneity of evolutionary processes among sites in a genome, Bayesian infinite mixture models of genomic evolution enable robust phylogenetic inference.
With large modern data sets, however, the computational burden of Markov chain Monte Carlo sampling techniques becomes prohibitive.
Here, we have developed a variational Bayesian procedure to speed up the widely used PhyloBayes MPI program, which deals with the heterogeneity of amino acid profiles.
Rather than sampling from the posterior distribution, the procedure approximates the (unknown) posterior distribution using a manageable distribution called the variational distribution.
The parameters in the variational distribution are estimated by minimizing Kullback-Leibler divergence.
To examine performance, we analyzed three empirical data sets consisting of mitochondrial, plastid-encoded, and nuclear proteins.
Our variational method accurately approximated the Bayesian phylogenetic tree, mixture proportions, and the amino acid propensity of each component of the mixture while using orders of magnitude less computational time.

Related Results

Because Neglect Isn't Cute: Tuxedo Stan's Campaign for a Humane World
Because Neglect Isn't Cute: Tuxedo Stan's Campaign for a Humane World
On 10 September 2012, a cat named Tuxedo Stan launched his campaign for mayor of the Halifax Regional Municipality in Nova Scotia, Canada (“Tuxedo Stan for Mayor”). Backed by his h...
Sample-efficient Optimization Using Neural Networks
Sample-efficient Optimization Using Neural Networks
<p>The solution to many science and engineering problems includes identifying the minimum or maximum of an unknown continuous function whose evaluation inflicts non-negligibl...
Hydatid Disease of The Brain Parenchyma: A Systematic Review
Hydatid Disease of The Brain Parenchyma: A Systematic Review
Abstarct Introduction Isolated brain hydatid disease (BHD) is an extremely rare form of echinococcosis. A prompt and timely diagnosis is a crucial step in disease management. This ...
Why can't we be friends? Exploring factors associated with cat owners' perceptions of the cat-cat relationship in two-cat households
Why can't we be friends? Exploring factors associated with cat owners' perceptions of the cat-cat relationship in two-cat households
Most research examining cat behavior in multi-cat households lacks focus on one group size. This gap in knowledge reduces generalizability of research findings to specific composit...
Environmental Surveillance Protocols for Highly Pathogenic Avian Influenza (HPAI) v2
Environmental Surveillance Protocols for Highly Pathogenic Avian Influenza (HPAI) v2
EnvironmentalSurveillance Protocols for Highly Pathogenic Avian Influenza (HPAI) This comprehensive protocol suite enables systematic environmental surveillance for avian influenza...
Tổng quan về cắt thực quản hiện đại
Tổng quan về cắt thực quản hiện đại
Tóm tắt Mục tiêu: Trình bày tổng quan về cắt thực quản thời hiện đại. Năm 1913, Franz John Torek (1861-1938) người Đức làm việc tại bệnh viện Lenox Hill, Newyork City, là người ...
Clinical characteristics of cat sensitized adults, cat ownership and cat owners' attitudes
Clinical characteristics of cat sensitized adults, cat ownership and cat owners' attitudes
Background: Cat allergen sensitization is a significant risk factor for allergic rhinitis and asthma. There are insufficient data on the preferences and attitudes of cat owners who...
Present climate characterization and future changes in Clear-Air Turbulence (CAT) over the northern hemisphere
Present climate characterization and future changes in Clear-Air Turbulence (CAT) over the northern hemisphere
&lt;p&gt;Airplanes spend about 1% of cruise time in Moderate-Or-Greater (MOG) CAT (Sharman et al. 2006), which is defined as any turbulence occurring in the atmosphere away...

Back to Top