Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Refined Evolutionary Trees Through an Exceptionally Compatible Alignment-Substitution Model

View through CrossRef
A phylogenetic tree commonly represents evolutionary relationships within a set of protein sequences. Various methods and strategies have been used to improve the accuracy of phylogenetic trees, but their capacity to derive a biologically credible relationship appears to be overestimated. Although the quality of the protein sequence alignment and the choice of substitution matrix are preliminary constraints to define the biological accuracy of the overlapped residues, the alignment is not iteratively optimized through the statistical testing of residue-substitution models. The exact alignment protocol and substitution model information are by default used for every sequence set by a server to construct an often-irrelevant phylogenetic tree, and no sequence-based tailoring of phylogenetic strategy is implemented by any server. Rigorously constructing 270 evolutionary trees, constructed using IQ-TREE based on 13 different alignments (Clustal-Omega, Kalign, MAFFT, MUSCLE, TCoffee, and Promals3D, as well as their HHPred-based hidden Markov model [HMM] alignments using HHPred) and nine substitution models (Dayhoff, JJT, block substitution matrix62, WAG, probability matrix from blocks [PMB], direct computation with mutability [DCMUT], JTTDCmut, LG, and variable time), the present study highlights the failure of the current methods and emphasizes the need for a more accurate scrutiny of the entire phylogenetic methodology. MUSCLE alignment and the LG and Dayhoff matrices yield more accurate phylogenetic results for sequences shorter than 500 residues for the log-likelihood measure. Moreover, Kalign 1 HMM alignment yields the top-ranked tree with the lowest tree length score with only the PMB matrix, making this substitution model more accurate in terms of total tree length score. The suggested strategy would be beneficial for understanding the potential pitfalls of phylogenetic inference and would aid us in deriving a more accurate evolutionary relationship for a sequence dataset.
Title: Refined Evolutionary Trees Through an Exceptionally Compatible Alignment-Substitution Model
Description:
A phylogenetic tree commonly represents evolutionary relationships within a set of protein sequences.
Various methods and strategies have been used to improve the accuracy of phylogenetic trees, but their capacity to derive a biologically credible relationship appears to be overestimated.
Although the quality of the protein sequence alignment and the choice of substitution matrix are preliminary constraints to define the biological accuracy of the overlapped residues, the alignment is not iteratively optimized through the statistical testing of residue-substitution models.
The exact alignment protocol and substitution model information are by default used for every sequence set by a server to construct an often-irrelevant phylogenetic tree, and no sequence-based tailoring of phylogenetic strategy is implemented by any server.
Rigorously constructing 270 evolutionary trees, constructed using IQ-TREE based on 13 different alignments (Clustal-Omega, Kalign, MAFFT, MUSCLE, TCoffee, and Promals3D, as well as their HHPred-based hidden Markov model [HMM] alignments using HHPred) and nine substitution models (Dayhoff, JJT, block substitution matrix62, WAG, probability matrix from blocks [PMB], direct computation with mutability [DCMUT], JTTDCmut, LG, and variable time), the present study highlights the failure of the current methods and emphasizes the need for a more accurate scrutiny of the entire phylogenetic methodology.
MUSCLE alignment and the LG and Dayhoff matrices yield more accurate phylogenetic results for sequences shorter than 500 residues for the log-likelihood measure.
Moreover, Kalign 1 HMM alignment yields the top-ranked tree with the lowest tree length score with only the PMB matrix, making this substitution model more accurate in terms of total tree length score.
The suggested strategy would be beneficial for understanding the potential pitfalls of phylogenetic inference and would aid us in deriving a more accurate evolutionary relationship for a sequence dataset.

Related Results

Evolution and the cell
Evolution and the cell
Genotype to phenotype, and back again Evolution is intimately linked to biology at the cellular scale- evolutionary processes act on the very genetic material that is carried and ...
Evolutionary Biomechanics
Evolutionary Biomechanics
Life has diversified on Earth in many stunning ways. Understanding how this diversity arose and has been maintained is a common interest for many evolutionary biologists. One appro...
Decision support tools for the management in a Dry Afromontane Forest in Ethiopia
Decision support tools for the management in a Dry Afromontane Forest in Ethiopia
Ethiopia is one of the tropical countries endowed with diverse forest formations. These forests provide large amounts of wood that can be used for furniture, construction, and dome...
Ontology Alignment Techniques
Ontology Alignment Techniques
Sometimes the use of a single ontology is not sufficient to cover different vocabularies for the same domain, and it becomes necessary to use several ontologies in order to encompa...
Evolutionary Medicine
Evolutionary Medicine
Abstract Evolutionary medicine is a fast‐growing research field providing biomedical scientists with evolutionary perspective for the comprehens...
Multiple sequence alignment accuracy and evolutionary distance estimation
Multiple sequence alignment accuracy and evolutionary distance estimation
Abstract Background Sequence alignment is a common tool in bioinformatics and comparative genomics. It is generally assumed that multiple sequence a...
Influence of alignment uncertainty on homology and phylogenetic modeling
Influence of alignment uncertainty on homology and phylogenetic modeling
Most evolutionary analyses or structure modeling are based upon pre-estimated multiple sequence alignment (MSA) models. From a computational point of view, it is too complex to est...

Back to Top