Javascript must be enabled to continue!
Gentrius: identifying equally scoring trees in phylogenomics with incomplete data
View through CrossRef
Abstract
Phylogenetic trees are routinely built from huge and yet incomplete multi-locus datasets often leading to phylogenetic terraces – topologically distinct equally scoring trees, which induce the same set of per locus subtrees. As typical tree inference software outputs only a single tree, identifying all trees with identical score challenges phylogenomics. Generating all trees from a terrace requires constructing a so-called stand for the corresponding set of induced locus subtrees. Here, we introduce Gentrius – an efficient algorithm that tackles this problem for unrooted trees. Despite stand generation being computationally intractable, we showed on simulated and biological datasets that Gentrius generates stands with millions of trees in feasible time. Depending on the distribution of missing data across species and loci and the inferred phylogeny, the number of equally optimal terrace trees varies tremendously. The strict consensus tree computed from them displays all the branches unaffected by the pattern of missing data. Thus, Gentrius provides an important systematic assessment of phylogenetic trees inferred from incomplete data. Furthermore, Gentrius can aid theoretical research by fostering understanding of tree space structure imposed by missing data.
One-Sentence Summary
Gentrius - the algorithm to generate a complete stand, i.e. all binary unrooted trees compatible with the same set of subtrees.
Title: Gentrius: identifying equally scoring trees in phylogenomics with incomplete data
Description:
Abstract
Phylogenetic trees are routinely built from huge and yet incomplete multi-locus datasets often leading to phylogenetic terraces – topologically distinct equally scoring trees, which induce the same set of per locus subtrees.
As typical tree inference software outputs only a single tree, identifying all trees with identical score challenges phylogenomics.
Generating all trees from a terrace requires constructing a so-called stand for the corresponding set of induced locus subtrees.
Here, we introduce Gentrius – an efficient algorithm that tackles this problem for unrooted trees.
Despite stand generation being computationally intractable, we showed on simulated and biological datasets that Gentrius generates stands with millions of trees in feasible time.
Depending on the distribution of missing data across species and loci and the inferred phylogeny, the number of equally optimal terrace trees varies tremendously.
The strict consensus tree computed from them displays all the branches unaffected by the pattern of missing data.
Thus, Gentrius provides an important systematic assessment of phylogenetic trees inferred from incomplete data.
Furthermore, Gentrius can aid theoretical research by fostering understanding of tree space structure imposed by missing data.
One-Sentence Summary
Gentrius - the algorithm to generate a complete stand, i.
e.
all binary unrooted trees compatible with the same set of subtrees.
Related Results
Effective customer selection for marketing campaigns based on net scores
Effective customer selection for marketing campaigns based on net scores
Purpose
This paper aims to address the effective selection of customers for direct marketing campaigns. It introduces a new method to forecast campaign-related uplifts (also known ...
A Development of Electronic Scoring System for Artistic Gymnastics Competitions Based on International Gymnastics Rules and Regulations
A Development of Electronic Scoring System for Artistic Gymnastics Competitions Based on International Gymnastics Rules and Regulations
Background and Aim: In China, artistic gymnastics is one of the traditionally advantageous programs in competitive sports, and it has long been in the leading position in the world...
Genetic Programming for Symbolic Regression on Incomplete Data
Genetic Programming for Symbolic Regression on Incomplete Data
<p><b>Symbolic regression is the process of constructing mathematical expressions that best fit given data sets, where a target variable is expressed in terms of input ...
Clinical impact of manual scoring of peripheral arterial tonometry in patients with sleep apnea
Clinical impact of manual scoring of peripheral arterial tonometry in patients with sleep apnea
Abstract
Purpose
The objective was to analyze the clinical implications of manual scoring of sleep studies using peripheral arterial tonometry (PAT)...
A reproducible approach for scoring TIL in residual tumors after neoadjuvant treatment of breast cancer patients
A reproducible approach for scoring TIL in residual tumors after neoadjuvant treatment of breast cancer patients
Abstract
Neoadjuvant chemotherapy (NAC) is standard of care for patients with locally advanced breast cancer. TIL scoring is prognostic for response and has additional pred...
KARATE SCORING SYSTEM: Aplikasi Skoring Berbasis Android
KARATE SCORING SYSTEM: Aplikasi Skoring Berbasis Android
Perkembangan ilmu pengetahuan dan teknologi menuntut para penyelenggara kegiatan olahraga untuk melakukan inovasi dalam menghadirkan suatu pertandingan yang efektif dan efisien. WK...
The effect of the periodic investigation model on attentive control and learning the scoring skill from persistence and peaceful scoring in female basketball
The effect of the periodic investigation model on attentive control and learning the scoring skill from persistence and peaceful scoring in female basketball
The study aimed to identify the degree of attention control among students of the second stage in the college of university knowledge / Department of Physical Education and Sports ...
Mapping geographical inequalities of incomplete immunization in Ethiopia: a spatial with multilevel analysis
Mapping geographical inequalities of incomplete immunization in Ethiopia: a spatial with multilevel analysis
BackgroundImmunization is one of the most cost-effective interventions, averting 3.5–5 million deaths every year worldwide. However, incomplete immunization remains a major public ...

