Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Comparative gene annotation of 304 species of Drosophilidae

View through CrossRef
AbstractHigh-quality genome annotations are essential if we are to address central questions in comparative genomics, such as the origin of new genes, the drivers of genome size variation, and the evolutionary forces shaping gene content and structure. Here, we present protein-coding gene annotations for 304 species of the family Drosophilidae, generated using the Comparative Annotation Toolkit (CAT) and BRAKER3, and incorporating available RNA-seq and protein evidence. We take a comparative phylogenetic approach to annotation, with the aim of improving consistency and accuracy, and to generate a robust set of gene annotations and orthology assignments. We analyze our annotations using a phylogenetic mixed-model approach and find that gene number and CDS length exhibit moderate phylogenetic heritability (43.3% and 12.3%, respectively). This suggests that while evolutionary history contributes to variation in these traits, species-specific factors, including assembly error, play a substantial role in shaping observed differences. To illustrate the utility of our annotations for comparative analyses, we investigate codon usage bias and amino acid composition across Drosophilidae. We find that codon usage is correlated with overall GC content and evolves slowly, but that it is also strongly shaped by selection, such that, in general, species with the strongest selection on synonymous codon usage show the lowest GC bias in third codon positions. This comparative annotation dataset forms part of an on-going collaborative project to sequence and annotate all species of Drosophilidae, with data and annotations being made rapidly and freely available on an on-going basis. We hope that this effort will serve as a foundation for studies in evolutionary and functional genomics and comparative biology across Drosophilidae.
Title: Comparative gene annotation of 304 species of Drosophilidae
Description:
AbstractHigh-quality genome annotations are essential if we are to address central questions in comparative genomics, such as the origin of new genes, the drivers of genome size variation, and the evolutionary forces shaping gene content and structure.
Here, we present protein-coding gene annotations for 304 species of the family Drosophilidae, generated using the Comparative Annotation Toolkit (CAT) and BRAKER3, and incorporating available RNA-seq and protein evidence.
We take a comparative phylogenetic approach to annotation, with the aim of improving consistency and accuracy, and to generate a robust set of gene annotations and orthology assignments.
We analyze our annotations using a phylogenetic mixed-model approach and find that gene number and CDS length exhibit moderate phylogenetic heritability (43.
3% and 12.
3%, respectively).
This suggests that while evolutionary history contributes to variation in these traits, species-specific factors, including assembly error, play a substantial role in shaping observed differences.
To illustrate the utility of our annotations for comparative analyses, we investigate codon usage bias and amino acid composition across Drosophilidae.
We find that codon usage is correlated with overall GC content and evolves slowly, but that it is also strongly shaped by selection, such that, in general, species with the strongest selection on synonymous codon usage show the lowest GC bias in third codon positions.
This comparative annotation dataset forms part of an on-going collaborative project to sequence and annotate all species of Drosophilidae, with data and annotations being made rapidly and freely available on an on-going basis.
We hope that this effort will serve as a foundation for studies in evolutionary and functional genomics and comparative biology across Drosophilidae.

Related Results

Primerjalna književnost na prelomu tisočletja
Primerjalna književnost na prelomu tisočletja
In a comprehensive and at times critical manner, this volume seeks to shed light on the development of events in Western (i.e., European and North American) comparative literature ...
Section-level genome sequencing and comparative genomics of Aspergillus sections Cavernicolus and Usti
Section-level genome sequencing and comparative genomics of Aspergillus sections Cavernicolus and Usti
Fig. S1. A cladogram representation of the phylogenetic relations between the species in this paper. The red labels show bootstrap values of 100 % and the black labels show bootstr...
Comportamiento de la Resistencia al desgaste erosivo de recubrimientos de alúmina
Comportamiento de la Resistencia al desgaste erosivo de recubrimientos de alúmina
 En este artículo se presentan los resultados de la investigación relacionada con el comportamiento experimental de recubrimientos de alúmina, obtenidos a partir de partículas de t...
Benchmarking Hayai-Annotation Plants: A Re-evaluation Using Standard Evaluation Metrics
Benchmarking Hayai-Annotation Plants: A Re-evaluation Using Standard Evaluation Metrics
Abstract The rapid growth of next-generation sequencing (NGS) technology has led to a surge in the determination of whole genome sequences in pla...
Expression and polymorphism of genes in gallstones
Expression and polymorphism of genes in gallstones
ABSTRACT Through the method of clinical case control study, to explore the expression and genetic polymorphism of KLF14 gene (rs4731702 and rs972283) and SR-B1 gene...
An extensible genome annotation workbench based on the Galaxy Platform
An extensible genome annotation workbench based on the Galaxy Platform
Introduction Falling costs of genetic sequencing have allowed sequencing and annotation of the genomes of non-model organism. In annotating non-mod...
Impacts of man-made structures on marine biodiversity and species status - native & non-native species
Impacts of man-made structures on marine biodiversity and species status - native & non-native species
<p>Coastal environments are exposed to anthropogenic activities such as frequent marine traffic and restructuring, i.e., addition, removal or replacing with man-made structur...
Pinaceae show elevated rates of gene duplication and gene loss that are robust to incomplete gene annotation
Pinaceae show elevated rates of gene duplication and gene loss that are robust to incomplete gene annotation
Abstract Gene duplications and gene losses are major determinants of genome evolution and phenotypic diversity. The frequency of gene turnover (gene gains and gene ...

Back to Top