Javascript must be enabled to continue!
Modeling Protein Evolution via Generative Inference From Monte Carlo Chains to Population Genetics
View through CrossRef
Generative models derived from large protein sequence alignments define complex fitness landscapes, but their utility for accurately modeling non-equilibrium evolutionary dynamics remains unclear. In this work, we perform a rigorous comparative analysis of three simulation schemes, designed to mimic evolution
in silico
by local sampling of the probability distribution defined by a generative model. We compare standard independent Markov Chain Monte Carlo, Monte Carlo on a phylogenetic tree, and a population genetics dynamics, benchmarking their outputs against deep sequencing data from four distinct
in vitro
evolution experiments. We find that standard Monte Carlo fails to reproduce the correct phylogenetic structure and generates unrealistic, gradual mutational sweeps. Performing Monte Carlo on a tree inferred from data improves phylogenetic fidelity and historical accuracy. The population genetics scheme successfully captures phylogenetic correlations, mutational abundances, and selective sweeps as emergent properties, without the need to infer additional information from data. However, the latter choice come at the price of not sampling the proper generative model distribution at long times. Our findings highlight the crucial role of phylogenetic correlations and finite-population effects in shaping evolutionary trajectories on fitness landscapes. These models therefore provide powerful tools for predicting complex adaptive paths and for reliably extrapolating evolutionary dynamics beyond current experimental limitations.
Title: Modeling Protein Evolution via Generative Inference From Monte Carlo Chains to Population Genetics
Description:
Generative models derived from large protein sequence alignments define complex fitness landscapes, but their utility for accurately modeling non-equilibrium evolutionary dynamics remains unclear.
In this work, we perform a rigorous comparative analysis of three simulation schemes, designed to mimic evolution
in silico
by local sampling of the probability distribution defined by a generative model.
We compare standard independent Markov Chain Monte Carlo, Monte Carlo on a phylogenetic tree, and a population genetics dynamics, benchmarking their outputs against deep sequencing data from four distinct
in vitro
evolution experiments.
We find that standard Monte Carlo fails to reproduce the correct phylogenetic structure and generates unrealistic, gradual mutational sweeps.
Performing Monte Carlo on a tree inferred from data improves phylogenetic fidelity and historical accuracy.
The population genetics scheme successfully captures phylogenetic correlations, mutational abundances, and selective sweeps as emergent properties, without the need to infer additional information from data.
However, the latter choice come at the price of not sampling the proper generative model distribution at long times.
Our findings highlight the crucial role of phylogenetic correlations and finite-population effects in shaping evolutionary trajectories on fitness landscapes.
These models therefore provide powerful tools for predicting complex adaptive paths and for reliably extrapolating evolutionary dynamics beyond current experimental limitations.
Related Results
Monte Carlo methods: barrier option pricing with stable Greeks and multilevel Monte Carlo learning
Monte Carlo methods: barrier option pricing with stable Greeks and multilevel Monte Carlo learning
For discretely observed barrier options, there exists no closed solution under the Black-Scholes model. Thus, it is often helpful to use Monte Carlo simulations, which are easily a...
Research on Multi-Group Monte Carlo Calculations Based on Group Constants Generated by RMC
Research on Multi-Group Monte Carlo Calculations Based on Group Constants Generated by RMC
Abstract
Nowadays, deterministic two-step or Monte Carlo methods are commonly used in core physics calculations. However, with the development of reactor core design, tradi...
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Objective: To determine the frequency of common chromosomal aberrations in local population idiopathic determine the frequency of common chromosomal aberrations in local population...
Evaluating View Factors Using a Hybrid Monte-Carlo Method
Evaluating View Factors Using a Hybrid Monte-Carlo Method
AbstractThis paper demonstrates that the well-known method for calculating view factors, the Monte Carlo method, combined with ray tracing is not necessarily the most efficient str...
Endothelial Protein C Receptor
Endothelial Protein C Receptor
IntroductionThe protein C anticoagulant pathway plays a critical role in the negative regulation of the blood clotting response. The pathway is triggered by thrombin, which allows ...
Automation of the Monte Carlo simulation of medical linear accelerators
Automation of the Monte Carlo simulation of medical linear accelerators
The main result of this thesis is a software system, called PRIMO, which simulates clinical linear accelerators and the subsequent dose distributions using the Monte Carlo method. ...
Perceived Gaps in Genetics Training Among Audiologists and Speech-Language Pathologists: Lessons From a National Survey
Perceived Gaps in Genetics Training Among Audiologists and Speech-Language Pathologists: Lessons From a National Survey
Purpose
The aim of this study was to assess knowledge, self-rated confidence, and perceived relevance of genetics in the clinical practice of audiologists and speech-la...
Artificial Intelligence for Monte Carlo Simulation in Medical Physics
Artificial Intelligence for Monte Carlo Simulation in Medical Physics
Monte Carlo simulation of particle tracking in matter is the reference simulation method in the field of medical physics. It is heavily used in various applications such as 1) pati...

