Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

An Alignment-free Method for Phylogeny Estimation using Maximum Likelihood

View through CrossRef
Abstract While alignment has traditionally been the primary approach for establishing homology prior to phylogenetic inference, alignment-free methods offer a simplified alternative, particularly beneficial when handling genome-wide data involving long sequences and complex events such as rearrangements. Moreover, alignment-free methods become crucial for data types like genome skims, where assembly is impractical. However, despite these benefits, alignment-free techniques have not gained widespread acceptance since they lack the accuracy of alignment-based techniques, primarily due to their reliance on simplified models of pairwise distance calculation. Here, we present a likelihood based alignment-free technique for phylogenetic tree construction. We encode the presence or absence of k -mers in genome sequences in a binary matrix, and estimate phylogenetic trees using a maximum likelihood approach. We analyze the performance of our method on seven real datasets and compare the results with the state of the art alignment-free methods. Results suggest that our method is competitive with existing alignment-free tools. This indicates that maximum likelihood based alignment-free methods may in the future be refined to outperform alignment-free methods relying on distance calculation as has been the case in the alignment-based setting. A likelihood based alignment-free method for phylogeny estimation is implemented for the first time in a software named P eafowl , which is available at: https://github.com/hasin-abrar/Peafowlrepo .
Title: An Alignment-free Method for Phylogeny Estimation using Maximum Likelihood
Description:
Abstract While alignment has traditionally been the primary approach for establishing homology prior to phylogenetic inference, alignment-free methods offer a simplified alternative, particularly beneficial when handling genome-wide data involving long sequences and complex events such as rearrangements.
Moreover, alignment-free methods become crucial for data types like genome skims, where assembly is impractical.
However, despite these benefits, alignment-free techniques have not gained widespread acceptance since they lack the accuracy of alignment-based techniques, primarily due to their reliance on simplified models of pairwise distance calculation.
Here, we present a likelihood based alignment-free technique for phylogenetic tree construction.
We encode the presence or absence of k -mers in genome sequences in a binary matrix, and estimate phylogenetic trees using a maximum likelihood approach.
We analyze the performance of our method on seven real datasets and compare the results with the state of the art alignment-free methods.
Results suggest that our method is competitive with existing alignment-free tools.
This indicates that maximum likelihood based alignment-free methods may in the future be refined to outperform alignment-free methods relying on distance calculation as has been the case in the alignment-based setting.
A likelihood based alignment-free method for phylogeny estimation is implemented for the first time in a software named P eafowl , which is available at: https://github.
com/hasin-abrar/Peafowlrepo .

Related Results

Ancestral sequence alignment under optimal conditions
Ancestral sequence alignment under optimal conditions
Abstract Background Multiple genome alignment is an important problem in bioinformatics. An important subproblem used by many multiple alignment app...
Integrated Likelihood for Phylogenomics under a No-Common-Mechanism Model
Integrated Likelihood for Phylogenomics under a No-Common-Mechanism Model
The availability of genome-wide sequence data from a large number of species as well as data from multiple individuals within a species has ushered in the era of phylogenomics. In ...
Alignment Free Phylogeny Construction Using Maximum Likelihood Using k -mer Counts
Alignment Free Phylogeny Construction Using Maximum Likelihood Using k -mer Counts
Estimating phylogenetic trees from molecular data often involves first performing a multiple sequence alignment of the sequences and then identifying the tree that maximizes likeli...
Implementation of Spatial Error Model through Maximum Likelihood
Implementation of Spatial Error Model through Maximum Likelihood
The implementation of spatial models through maximum likelihood is a great challenge due to the presence of Jacobian term in the log likelihood function. In literature, the impleme...
Ontology Alignment Techniques
Ontology Alignment Techniques
Sometimes the use of a single ontology is not sufficient to cover different vocabularies for the same domain, and it becomes necessary to use several ontologies in order to encompa...
Improving Methods of Measuring Curriculum Alignment
Improving Methods of Measuring Curriculum Alignment
Curriculum alignment studies demonstrate relationships among curriculum, instruction, and assessment (CIA). A well-aligned CIA ensures better opportunities for the students to lear...
Environmental Surveillance Protocols for Highly Pathogenic Avian Influenza (HPAI) v2
Environmental Surveillance Protocols for Highly Pathogenic Avian Influenza (HPAI) v2
EnvironmentalSurveillance Protocols for Highly Pathogenic Avian Influenza (HPAI) This comprehensive protocol suite enables systematic environmental surveillance for avian influenza...
Multiple sequence alignment accuracy and evolutionary distance estimation
Multiple sequence alignment accuracy and evolutionary distance estimation
Abstract Background Sequence alignment is a common tool in bioinformatics and comparative genomics. It is generally assumed that multiple sequence a...

Back to Top