Javascript must be enabled to continue!
Integrated Likelihood for Phylogenomics under a No-Common-Mechanism Model
View through CrossRef
The availability of genome-wide sequence data from a large number of species as well as data from multiple individuals within a species has ushered in the era of phylogenomics. In this era, species phylogeny inference is based on models of sequence evolution on gene trees as well as models of gene tree evolution within the branches of species phylogenies. Parsimony, likelihood, Bayesian, and distance methods have been introduced for species phylogeny inference based on such models. All methods, except for the parsimony ones, assume a common mechanism across all loci as captured by a single value of each branch length of the species phylogeny. In this paper, we propose a ``no common mechanism" (NCM) model, where every gene tree evolves according to its own parameters of the species phylogeny. An analogous model was proposed and explored, both mathematically and experimentally, for sites, or characters, in a sequence alignment in the context of the classical phylogeny problem. For example, a famous equivalence between the maximum parsimony and maximum likelihood phylogeny estimates was established under certain NCM models by Tuffley and Steel. Here we derive an analytically integrated likelihood of both species trees and networks given the gene trees of multiple loci under an NCM model. We demonstrate the performance of inference under this integrated likelihood on both simulated and biological data. The model presented here will afford opportunities for exploring connections among various methods for estimating species phylogenies from multiple, independent loci.
Title: Integrated Likelihood for Phylogenomics under a No-Common-Mechanism Model
Description:
The availability of genome-wide sequence data from a large number of species as well as data from multiple individuals within a species has ushered in the era of phylogenomics.
In this era, species phylogeny inference is based on models of sequence evolution on gene trees as well as models of gene tree evolution within the branches of species phylogenies.
Parsimony, likelihood, Bayesian, and distance methods have been introduced for species phylogeny inference based on such models.
All methods, except for the parsimony ones, assume a common mechanism across all loci as captured by a single value of each branch length of the species phylogeny.
In this paper, we propose a ``no common mechanism" (NCM) model, where every gene tree evolves according to its own parameters of the species phylogeny.
An analogous model was proposed and explored, both mathematically and experimentally, for sites, or characters, in a sequence alignment in the context of the classical phylogeny problem.
For example, a famous equivalence between the maximum parsimony and maximum likelihood phylogeny estimates was established under certain NCM models by Tuffley and Steel.
Here we derive an analytically integrated likelihood of both species trees and networks given the gene trees of multiple loci under an NCM model.
We demonstrate the performance of inference under this integrated likelihood on both simulated and biological data.
The model presented here will afford opportunities for exploring connections among various methods for estimating species phylogenies from multiple, independent loci.
Related Results
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Objective: To determine the frequency of common chromosomal aberrations in local population idiopathic determine the frequency of common chromosomal aberrations in local population...
Predicting protein inter-residue contacts using composite likelihood maximization and deep learning
Predicting protein inter-residue contacts using composite likelihood maximization and deep learning
AbstractBackgroundAccurate prediction of inter-residue contacts of a protein is important to calculating its tertiary structure. Analysis of co-evolutionary events among residues h...
SEGUL: An ultrafast, memory-efficient alignment manipulation and summary tool for phylogenomics
SEGUL: An ultrafast, memory-efficient alignment manipulation and summary tool for phylogenomics
Phylogenetic studies now routinely require manipulating and summarizing
thousands of data files. For most of these tasks, currently available
software requires considerable computi...
Implementation of Spatial Error Model through Maximum Likelihood
Implementation of Spatial Error Model through Maximum Likelihood
The implementation of spatial models through maximum likelihood is a great challenge due to the presence of Jacobian term in the log likelihood function. In literature, the impleme...
Phylogenomics of primates and their ancestral populations
Phylogenomics of primates and their ancestral populations
Genome assemblies are now available for nine primate species, and large-scale sequencing projects are underway or approved for six others. An explicitly evolutionary and phylogenet...
Likelihood of social-ecological genetic model
Likelihood of social-ecological genetic model
Abstract
The genetic structure of populations depends on two parallel processes - genetic and social-ecological - providing mutual information. Models that describe...
Maximum Likelihood Estimation for Semiparametric Density Ratio Model
Maximum Likelihood Estimation for Semiparametric Density Ratio Model
Abstract
In the statistical literature, the conditional density model specification is commonly used to study regression effects. One attractive model is the semi...
Establishment and Application of the Multi-Peak Forecasting Model
Establishment and Application of the Multi-Peak Forecasting Model
Abstract
After the development of the oil field, it is an important task to predict the production and the recoverable reserve opportunely by the production data....

