Javascript must be enabled to continue!
Comparative gene annotation of 304 species of Drosophilidae
View through CrossRef
AbstractHigh-quality genome annotations are essential if we are to address central questions in comparative genomics, such as the origin of new genes, the drivers of genome size variation, and the evolutionary forces shaping gene content and structure. Here, we present protein-coding gene annotations for 304 species of the family Drosophilidae, generated using the Comparative Annotation Toolkit (CAT) and BRAKER3, and incorporating available RNA-seq and protein evidence. We take a comparative phylogenetic approach to annotation, with the aim of improving consistency and accuracy, and to generate a robust set of gene annotations and orthology assignments. We analyze our annotations using a phylogenetic mixed-model approach and find that gene number and CDS length exhibit moderate phylogenetic heritability (43.3% and 12.3%, respectively). This suggests that while evolutionary history contributes to variation in these traits, species-specific factors, including assembly error, play a substantial role in shaping observed differences. To illustrate the utility of our annotations for comparative analyses, we investigate codon usage bias and amino acid composition across Drosophilidae. We find that codon usage is correlated with overall GC content and evolves slowly, but that it is also strongly shaped by selection, such that, in general, species with the strongest selection on synonymous codon usage show the lowest GC bias in third codon positions. This comparative annotation dataset forms part of an on-going collaborative project to sequence and annotate all species of Drosophilidae, with data and annotations being made rapidly and freely available on an on-going basis. We hope that this effort will serve as a foundation for studies in evolutionary and functional genomics and comparative biology across Drosophilidae.
Title: Comparative gene annotation of 304 species of Drosophilidae
Description:
AbstractHigh-quality genome annotations are essential if we are to address central questions in comparative genomics, such as the origin of new genes, the drivers of genome size variation, and the evolutionary forces shaping gene content and structure.
Here, we present protein-coding gene annotations for 304 species of the family Drosophilidae, generated using the Comparative Annotation Toolkit (CAT) and BRAKER3, and incorporating available RNA-seq and protein evidence.
We take a comparative phylogenetic approach to annotation, with the aim of improving consistency and accuracy, and to generate a robust set of gene annotations and orthology assignments.
We analyze our annotations using a phylogenetic mixed-model approach and find that gene number and CDS length exhibit moderate phylogenetic heritability (43.
3% and 12.
3%, respectively).
This suggests that while evolutionary history contributes to variation in these traits, species-specific factors, including assembly error, play a substantial role in shaping observed differences.
To illustrate the utility of our annotations for comparative analyses, we investigate codon usage bias and amino acid composition across Drosophilidae.
We find that codon usage is correlated with overall GC content and evolves slowly, but that it is also strongly shaped by selection, such that, in general, species with the strongest selection on synonymous codon usage show the lowest GC bias in third codon positions.
This comparative annotation dataset forms part of an on-going collaborative project to sequence and annotate all species of Drosophilidae, with data and annotations being made rapidly and freely available on an on-going basis.
We hope that this effort will serve as a foundation for studies in evolutionary and functional genomics and comparative biology across Drosophilidae.
Related Results
Primerjalna književnost na prelomu tisočletja
Primerjalna književnost na prelomu tisočletja
In a comprehensive and at times critical manner, this volume seeks to shed light on the development of events in Western (i.e., European and North American) comparative literature ...
Comportamiento de la Resistencia al desgaste erosivo de recubrimientos de alúmina
Comportamiento de la Resistencia al desgaste erosivo de recubrimientos de alúmina
En este artículo se presentan los resultados de la investigación relacionada con el comportamiento experimental de recubrimientos de alúmina, obtenidos a partir de partículas de t...
Expression and polymorphism of genes in gallstones
Expression and polymorphism of genes in gallstones
ABSTRACT
Through the method of clinical case control study, to explore the expression and genetic polymorphism of KLF14 gene (rs4731702 and rs972283) and SR-B1 gene (rs...
Impacts of man-made structures on marine biodiversity and species status - native & non-native species
Impacts of man-made structures on marine biodiversity and species status - native & non-native species
<p>Coastal environments are exposed to anthropogenic activities such as frequent marine traffic and restructuring, i.e., addition, removal or replacing with man-made structur...
Duplex hardening on SUS 304 with pack carburizing – nitriding process
Duplex hardening on SUS 304 with pack carburizing – nitriding process
The object of research: The object of research in this study is Austenic Sus 304 stains that are subject to surface treatment through the duplex hardening process using a combinati...
Mining sequence annotation databanks for association patterns
Mining sequence annotation databanks for association patterns
Abstract
Motivation: Millions of protein sequences currently being deposited to sequence databanks will never be annotated manually. Similarity-based annotation gene...
QALB: Qatar Arabic language bank
QALB: Qatar Arabic language bank
Automatic text correction has been attracting research attention for English and some other western languages. Applications for automatic text correction vary from improving langua...
A Framework for Comparing Phenotype Annotations of Orthologous Genes
A Framework for Comparing Phenotype Annotations of Orthologous Genes
Objectives: Animal models are a key resource for the investigation of human diseases. In contrast to functional annotation, phenotype annotation is less standard, and comparing phe...

