Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

QT-WEAVER: Correcting quartet distribution improves phylogenomic analyses despite gene tree estimation error

View through CrossRef
AbstractSummarizing individual gene trees into species phylogenies using coalescent-based methods has become a standard approach in phylogenomics. However, gene tree estimation error (GTEE) arising from a combination of reasons (ranging from analytical factors to more biological causes, as in short gene sequences) can potentially impact the accuracy of phylogenomic inference. We, for the first time, introduce the problem of correcting the quartet distribution induced by a set of estimated gene trees, which involves updating the weights of the quartets to better reflect their relative importance within the gene tree distribution. We present QT-WEAVER, the first method of its kind, which learns the conflicts within the quartet distribution induced by a given set of gene trees and generates an updated quartet distribution by adjusting the weights accordingly. QT-WEAVER is a general-purpose technique needing no explicit modeling of the subject system or reasons for GTEE or gene tree heterogeneity. Experimental studies on a collection of simulated and empirical data sets suggest that QT-WEAVER can effectively account for GTEE, which results in a substantial improvement in the species tree accuracy. Additionally, the concept of quartet conflicts and related algorithmic and combinatorial innovations introduced in this study will benefit various quartet-based computations. Therefore, QT-WEAVER advances the state-of-the-art in species tree estimation from gene trees in the face of GTEE. QT-WEAVER is freely available in open-source form athttps://github.com/navidh86/QT-WEAVER.
Title: QT-WEAVER: Correcting quartet distribution improves phylogenomic analyses despite gene tree estimation error
Description:
AbstractSummarizing individual gene trees into species phylogenies using coalescent-based methods has become a standard approach in phylogenomics.
However, gene tree estimation error (GTEE) arising from a combination of reasons (ranging from analytical factors to more biological causes, as in short gene sequences) can potentially impact the accuracy of phylogenomic inference.
We, for the first time, introduce the problem of correcting the quartet distribution induced by a set of estimated gene trees, which involves updating the weights of the quartets to better reflect their relative importance within the gene tree distribution.
We present QT-WEAVER, the first method of its kind, which learns the conflicts within the quartet distribution induced by a given set of gene trees and generates an updated quartet distribution by adjusting the weights accordingly.
QT-WEAVER is a general-purpose technique needing no explicit modeling of the subject system or reasons for GTEE or gene tree heterogeneity.
Experimental studies on a collection of simulated and empirical data sets suggest that QT-WEAVER can effectively account for GTEE, which results in a substantial improvement in the species tree accuracy.
Additionally, the concept of quartet conflicts and related algorithmic and combinatorial innovations introduced in this study will benefit various quartet-based computations.
Therefore, QT-WEAVER advances the state-of-the-art in species tree estimation from gene trees in the face of GTEE.
QT-WEAVER is freely available in open-source form athttps://github.
com/navidh86/QT-WEAVER.

Related Results

Expression and polymorphism of genes in gallstones
Expression and polymorphism of genes in gallstones
ABSTRACT Through the method of clinical case control study, to explore the expression and genetic polymorphism of KLF14 gene (rs4731702 and rs972283) and SR-B1 gene (rs...
DISCO: Species Tree Inference using Multicopy Gene Family Tree Decomposition
DISCO: Species Tree Inference using Multicopy Gene Family Tree Decomposition
AbstractSpecies tree inference from gene family trees is a significant problem in computational biology. However, gene tree heterogeneity, which can be caused by several factors in...
REGULAR ARTICLES
REGULAR ARTICLES
L. Cowen and C. J. Schwarz       657Les Radio‐tags, en raison de leur détectabilitéélevée, ...
The First Professional String Quartet?: ReExamining an Account Attributed to Giuseppe Maria Cambini
The First Professional String Quartet?: ReExamining an Account Attributed to Giuseppe Maria Cambini
This study examines an 1804 essay about string-quartet performance published in the Allgemeine musikalische Zeitung , and attributed to the Italian-born and Paris-domiciled compose...
Inter-specific variations in tree stem methane and nitrous oxide exchanges in a tropical rainforest
Inter-specific variations in tree stem methane and nitrous oxide exchanges in a tropical rainforest
<p>Tropical forests are the most productive terrestrial ecosystems, global centres of biodiversity and important participants in the global carbon and water cycles. T...
The Sensitivity Feature Analysis for Tree Species Based on Image Statistical Properties
The Sensitivity Feature Analysis for Tree Species Based on Image Statistical Properties
While the statistical properties of images are vital in forestry engineering, the usefulness of these properties in various forestry tasks may vary, and certain image properties mi...
Spatial patterns of argan-tree influence on soil quality of intertree areas in open woodlands of South Morocco
Spatial patterns of argan-tree influence on soil quality of intertree areas in open woodlands of South Morocco
Abstract. The endemic argan tree (Argania spinosa) populations in South Morocco are highly degraded due to overbrowsing, illegal firewood extraction and the expansion of intensive ...
Agroforestry and Tree management in Kivuuvu Parish, Maanyi Subcounty, Mityana District. Uganda
Agroforestry and Tree management in Kivuuvu Parish, Maanyi Subcounty, Mityana District. Uganda
Abstract Agroforestry is an important alternative in land management systems to improve rural livelihoods. Timber and Non-timber Forest Products (NTFPs) have been the most ...

Back to Top