Javascript must be enabled to continue!
Ancestral sequence alignment under optimal conditions
View through CrossRef
Abstract
Background
Multiple genome alignment is an important problem in bioinformatics. An important subproblem used by many multiple alignment approaches is that of aligning two multiple alignments. Many popular alignment algorithms for DNA use the sum-of-pairs heuristic, where the score of a multiple alignment is the sum of its induced pairwise alignment scores. However, the biological meaning of the sum-of-pairs of pairs heuristic is not obvious. Additionally, many algorithms based on the sum-of-pairs heuristic are complicated and slow, compared to pairwise alignment algorithms.
An alternative approach to aligning alignments is to first infer ancestral sequences for each alignment, and then align the two ancestral sequences. In addition to being fast, this method has a clear biological basis that takes into account the evolution implied by an underlying phylogenetic tree.
In this study we explore the accuracy of aligning alignments by ancestral sequence alignment. We examine the use of both maximum likelihood and parsimony to infer ancestral sequences. Additionally, we investigate the effect on accuracy of allowing ambiguity in our ancestral sequences.
Results
We use synthetic sequence data that we generate by simulating evolution on a phylogenetic tree. We use two different types of phylogenetic trees: trees with a period of rapid growth followed by a period of slow growth, and trees with a period of slow growth followed by a period of rapid growth.
We examine the alignment accuracy of four ancestral sequence reconstruction and alignment methods: parsimony, maximum likelihood, ambiguous parsimony, and ambiguous maximum likelihood. Additionally, we compare against the alignment accuracy of two sum-of-pairs algorithms: ClustalW and the heuristic of Ma, Zhang, and Wang.
Conclusion
We find that allowing ambiguity in ancestral sequences does not lead to better multiple alignments. Regardless of whether we use parsimony or maximum likelihood, the success of aligning ancestral sequences containing ambiguity is very sensitive to the choice of gap open cost. Surprisingly, we find that using maximum likelihood to infer ancestral sequences results in less accurate alignments than when using parsimony to infer ancestral sequences. Finally, we find that the sum-of-pairs methods produce better alignments than all of the ancestral alignment methods.
Title: Ancestral sequence alignment under optimal conditions
Description:
Abstract
Background
Multiple genome alignment is an important problem in bioinformatics.
An important subproblem used by many multiple alignment approaches is that of aligning two multiple alignments.
Many popular alignment algorithms for DNA use the sum-of-pairs heuristic, where the score of a multiple alignment is the sum of its induced pairwise alignment scores.
However, the biological meaning of the sum-of-pairs of pairs heuristic is not obvious.
Additionally, many algorithms based on the sum-of-pairs heuristic are complicated and slow, compared to pairwise alignment algorithms.
An alternative approach to aligning alignments is to first infer ancestral sequences for each alignment, and then align the two ancestral sequences.
In addition to being fast, this method has a clear biological basis that takes into account the evolution implied by an underlying phylogenetic tree.
In this study we explore the accuracy of aligning alignments by ancestral sequence alignment.
We examine the use of both maximum likelihood and parsimony to infer ancestral sequences.
Additionally, we investigate the effect on accuracy of allowing ambiguity in our ancestral sequences.
Results
We use synthetic sequence data that we generate by simulating evolution on a phylogenetic tree.
We use two different types of phylogenetic trees: trees with a period of rapid growth followed by a period of slow growth, and trees with a period of slow growth followed by a period of rapid growth.
We examine the alignment accuracy of four ancestral sequence reconstruction and alignment methods: parsimony, maximum likelihood, ambiguous parsimony, and ambiguous maximum likelihood.
Additionally, we compare against the alignment accuracy of two sum-of-pairs algorithms: ClustalW and the heuristic of Ma, Zhang, and Wang.
Conclusion
We find that allowing ambiguity in ancestral sequences does not lead to better multiple alignments.
Regardless of whether we use parsimony or maximum likelihood, the success of aligning ancestral sequences containing ambiguity is very sensitive to the choice of gap open cost.
Surprisingly, we find that using maximum likelihood to infer ancestral sequences results in less accurate alignments than when using parsimony to infer ancestral sequences.
Finally, we find that the sum-of-pairs methods produce better alignments than all of the ancestral alignment methods.
Related Results
Multiple sequence alignment accuracy and evolutionary distance estimation
Multiple sequence alignment accuracy and evolutionary distance estimation
Abstract
Background
Sequence alignment is a common tool in bioinformatics and comparative genomics. It is generally assumed that multiple sequence a...
Computational Design of Ancestral and Consensus Asian Dengue Envelope Protein for Vaccine Candidate
Computational Design of Ancestral and Consensus Asian Dengue Envelope Protein for Vaccine Candidate
Dengue is a mosquito-borne viral disease of which incidence has rapidly increased in the last few years. Despite the recent development of a licensed dengue vaccine, safer and more...
Figs S1-S9
Figs S1-S9
Fig. S1. Consensus phylogram (50 % majority rule) resulting from a Bayesian analysis of the ITS sequence alignment of sequences generated in this study and reference sequences from...
Influence of alignment uncertainty on homology and phylogenetic modeling
Influence of alignment uncertainty on homology and phylogenetic modeling
Most evolutionary analyses or structure modeling are based upon pre-estimated multiple sequence alignment (MSA) models. From a computational point of view, it is too complex to est...
[RETRACTED] Optimal Max Keto - Does It ReallyWork? v1
[RETRACTED] Optimal Max Keto - Does It ReallyWork? v1
[RETRACTED]Shedding the unwanted weight and controlling the calories of your body is the most challenging and complicated process. As we start aging, we have to deal with lots of...
Archaeological Discovery and Research into the Layout of the
Palaces and Ancestral Shrines of Han Dynasty Chang'an –A
Comparative Essay on the Capital Cities of Ancient Chinese
Archaeological Discovery and Research into the Layout of the
Palaces and Ancestral Shrines of Han Dynasty Chang'an –A
Comparative Essay on the Capital Cities of Ancient Chinese
The principal function of the ancient Chinese royal capital
city was political. From the perspective of archaeology, the
...
Ontology Alignment Techniques
Ontology Alignment Techniques
Sometimes the use of a single ontology is not sufficient to cover different vocabularies for the same domain, and it becomes necessary to use several ontologies in order to encompa...
An Alignment-free Method for Phylogeny Estimation using Maximum Likelihood
An Alignment-free Method for Phylogeny Estimation using Maximum Likelihood
Abstract
While alignment has traditionally been the primary approach for establishing homology prior to phylogenetic inference, alignment-free me...

