Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Evolutionary models for insertions and deletions in a probabilistic modeling framework

View through CrossRef
Abstract Background Probabilistic models for sequence comparison (such as hidden Markov models and pair hidden Markov models for proteins and mRNAs, or their context-free grammar counterparts for structural RNAs) often assume a fixed degree of divergence. Ideally we would like these models to be conditional on evolutionary divergence time. Probabilistic models of substitution events are well established, but there has not been a completely satisfactory theoretical framework for modeling insertion and deletion events. Results I have developed a method for extending standard Markov substitution models to include gap characters, and another method for the evolution of state transition probabilities in a probabilistic model. These methods use instantaneous rate matrices in a way that is more general than those used for substitution processes, and are sufficient to provide time-dependent models for standard linear and affine gap penalties, respectively. Given a probabilistic model, we can make all of its emission probabilities (including gap characters) and all its transition probabilities conditional on a chosen divergence time. To do this, we only need to know the parameters of the model at one particular divergence time instance, as well as the parameters of the model at the two extremes of zero and infinite divergence. I have implemented these methods in a new generation of the RNA genefinder QRNA (eQRNA). Conclusion These methods can be applied to incorporate evolutionary models of insertions and deletions into any hidden Markov model or stochastic context-free grammar, in a pair or profile form, for sequence modeling.
Springer Science and Business Media LLC
Title: Evolutionary models for insertions and deletions in a probabilistic modeling framework
Description:
Abstract Background Probabilistic models for sequence comparison (such as hidden Markov models and pair hidden Markov models for proteins and mRNAs, or their context-free grammar counterparts for structural RNAs) often assume a fixed degree of divergence.
Ideally we would like these models to be conditional on evolutionary divergence time.
Probabilistic models of substitution events are well established, but there has not been a completely satisfactory theoretical framework for modeling insertion and deletion events.
Results I have developed a method for extending standard Markov substitution models to include gap characters, and another method for the evolution of state transition probabilities in a probabilistic model.
These methods use instantaneous rate matrices in a way that is more general than those used for substitution processes, and are sufficient to provide time-dependent models for standard linear and affine gap penalties, respectively.
Given a probabilistic model, we can make all of its emission probabilities (including gap characters) and all its transition probabilities conditional on a chosen divergence time.
To do this, we only need to know the parameters of the model at one particular divergence time instance, as well as the parameters of the model at the two extremes of zero and infinite divergence.
I have implemented these methods in a new generation of the RNA genefinder QRNA (eQRNA).
Conclusion These methods can be applied to incorporate evolutionary models of insertions and deletions into any hidden Markov model or stochastic context-free grammar, in a pair or profile form, for sequence modeling.

Related Results

Inventory and pricing management in probabilistic selling
Inventory and pricing management in probabilistic selling
Context: Probabilistic selling is the strategy that the seller creates an additional probabilistic product using existing products. The exact information is unknown to customers u...
Structural types of noun adaptation in Kyrgyz-Russian code-switching.
Structural types of noun adaptation in Kyrgyz-Russian code-switching.
This article examines Kyrgyz-Russian code-switching, which is a natural and regular process in situations of language contact. The object of the study is the cases of code-switchin...
Evolution and the cell
Evolution and the cell
Genotype to phenotype, and back again Evolution is intimately linked to biology at the cellular scale- evolutionary processes act on the very genetic material that is carried and ...
Intragenomic rearrangements in SARS-CoV-2, other betacoronaviruses, and alphacoronaviruses
Intragenomic rearrangements in SARS-CoV-2, other betacoronaviruses, and alphacoronaviruses
AbstractVariation of the betacoronavirus SARS-CoV-2 has been the bane of COVID-19 control. Documented variation includes point mutations, deletions, insertions, and recombination a...
Embracing Opportunities and Avoiding Pitfalls of Probabilistic Modelling in Field Development Planning
Embracing Opportunities and Avoiding Pitfalls of Probabilistic Modelling in Field Development Planning
Abstract Uncertainty and risk analysis is an inseparable part of any decision making process in the field development planning. This study sheds light on the availab...
MitoDelta: identifying mitochondrial DNA deletions at cell-type resolution from single-cell RNA sequencing data
MitoDelta: identifying mitochondrial DNA deletions at cell-type resolution from single-cell RNA sequencing data
AbstractBackgroundDeletion variants in mitochondrial DNA (mtDNA) are associated with various diseases, such as mitochondrial disorders and neurodegenerative diseases. Traditionally...
Letting neural networks talk: exploring two probabilistic neural network models for input variable selection
Letting neural networks talk: exploring two probabilistic neural network models for input variable selection
Input variable selection (IVS) is an integral part of building data-driven models for hydrological applications. Carefully chosen input variables enable data-driven models to disce...
Detection and Quantification of Large Scale Deletions of Human Mitochondrial DNA by Real‐time PCR
Detection and Quantification of Large Scale Deletions of Human Mitochondrial DNA by Real‐time PCR
Large deletions of mitochondrial DNA (mtDNA) occur in mitochondrial disease, oxidative stress and aging. Three approaches were developed to identify and quantify the deletions in n...

Back to Top