Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Phylogenomic terraces: presence and implication in species tree estimation from gene trees

View through CrossRef
Abstract Species tree estimation from multi-locus dataset is extremely challenging, especially in the presence of gene tree heterogeneity across the genome due to incomplete lineage sorting (ILS). Summary methods have been developed which estimate gene trees and then combine the gene trees to estimate a species tree by optimizing various optimization scores. In this study, we have formalized the concept of “phylogenomic terraces” in the species tree space, where multiple species trees with distinct topologies may have exactly the same optimization score ( quartet score, extra lineage score , etc.) with respect to a collection of gene trees. We investigated the presence and implication of terraces in species tree estimation from multi-locus data by taking ILS into account. We analyzed two of the most popular ILS-aware optimization criteria: maximize quartet consistency (MQC) and minimize deep coalescence (MDC). Methods based on MQC are provably statistically consistent, whereas MDC is not a consistent criterion for species tree estimation. Our experiments, on a collection of dataset simulated under ILS, indicate that MDC-based methods may achieve competitive or identical quartet consistency score as MQC but could be significantly worse than MQC in terms of tree accuracy – demonstrating the presence and affect of phylogenomic terraces. This is the first known study that formalizes the concept of phylogenomic terraces in the context of species tree estimation from multi-locus data, and reports the presence and implications of terraces in species tree estimation under ILS.
Title: Phylogenomic terraces: presence and implication in species tree estimation from gene trees
Description:
Abstract Species tree estimation from multi-locus dataset is extremely challenging, especially in the presence of gene tree heterogeneity across the genome due to incomplete lineage sorting (ILS).
Summary methods have been developed which estimate gene trees and then combine the gene trees to estimate a species tree by optimizing various optimization scores.
In this study, we have formalized the concept of “phylogenomic terraces” in the species tree space, where multiple species trees with distinct topologies may have exactly the same optimization score ( quartet score, extra lineage score , etc.
) with respect to a collection of gene trees.
We investigated the presence and implication of terraces in species tree estimation from multi-locus data by taking ILS into account.
We analyzed two of the most popular ILS-aware optimization criteria: maximize quartet consistency (MQC) and minimize deep coalescence (MDC).
Methods based on MQC are provably statistically consistent, whereas MDC is not a consistent criterion for species tree estimation.
Our experiments, on a collection of dataset simulated under ILS, indicate that MDC-based methods may achieve competitive or identical quartet consistency score as MQC but could be significantly worse than MQC in terms of tree accuracy – demonstrating the presence and affect of phylogenomic terraces.
This is the first known study that formalizes the concept of phylogenomic terraces in the context of species tree estimation from multi-locus data, and reports the presence and implications of terraces in species tree estimation under ILS.

Related Results

Changes in Terrace Structures and Soil Properties in Hani Paddy Terraces after Conversion to Upland Terraces
Changes in Terrace Structures and Soil Properties in Hani Paddy Terraces after Conversion to Upland Terraces
<p>Terraces are important practice to conserve soil and water in farming systems in mountain areas. Since the mid- 20<sup>th</sup> century...
DISCO: Species Tree Inference using Multicopy Gene Family Tree Decomposition
DISCO: Species Tree Inference using Multicopy Gene Family Tree Decomposition
AbstractSpecies tree inference from gene family trees is a significant problem in computational biology. However, gene tree heterogeneity, which can be caused by several factors in...
Inter-specific variations in tree stem methane and nitrous oxide exchanges in a tropical rainforest
Inter-specific variations in tree stem methane and nitrous oxide exchanges in a tropical rainforest
<p>Tropical forests are the most productive terrestrial ecosystems, global centres of biodiversity and important participants in the global carbon and water cycles. T...
Upper plate deformation and its relationship to the underlying Hikurangi subduction interface, southern North Island, New Zealand
Upper plate deformation and its relationship to the underlying Hikurangi subduction interface, southern North Island, New Zealand
<p>At the southern Hikurangi margin, the subduction interface between the Australian and Pacific plates, beneath the southern North Island of New Zealand, is ‘locked’. It has...
QT-WEAVER: Correcting quartet distribution improves phylogenomic analyses despite gene tree estimation error
QT-WEAVER: Correcting quartet distribution improves phylogenomic analyses despite gene tree estimation error
Abstract Summarizing individual gene trees into species phylogenies using coalescent-based methods has become a standard approach in phylogenomic...
The Sensitivity Feature Analysis for Tree Species Based on Image Statistical Properties
The Sensitivity Feature Analysis for Tree Species Based on Image Statistical Properties
While the statistical properties of images are vital in forestry engineering, the usefulness of these properties in various forestry tasks may vary, and certain image properties mi...
Inventarisasi Pohon Plus Dalam Blok Koleksi Di Taman Hutan Raya Wan Abdul Rachman
Inventarisasi Pohon Plus Dalam Blok Koleksi Di Taman Hutan Raya Wan Abdul Rachman
Plus tree inventory was an activity for collecting and compiling data.Collection block was an area within Great Forest Park region that contains different types of plant, either en...
Automatic mapping of terrace systems at large scales: a case study of Cyprus
Automatic mapping of terrace systems at large scales: a case study of Cyprus
Agricultural terraces are among the most significant anthropogenic land modifications in the Mediterranean. They are constructed to reduce local slope gradients and facilitate farm...

Back to Top