Javascript must be enabled to continue!

Varia: a tool for prediction, analysis and visualisation of variable genes

Abstract Background Parasites use polymorphic gene families to evade the immune system or interact with the host. Assessing the diversity and expression of such gene families in pathogens can inform on the repertoire or host interaction phenotypes of clinical relevance. However, obtaining the sequences and quantifying their expression is a challenge. In Plasmodium falciparum, the highly polymorphic var genes encode the major virulence protein, PfEMP1, which bind a range of human receptors through varying combinations of DBL and CIDR domains. Here we present a tool, Varia, to predict near full-length gene sequences and domain compositions of query genes from database genes sharing short sequence tags. Varia generates output through two complementary pipelines. Varia_VIP returns all putative gene sequences and domain compositions of the query gene from any partial sequence provided, thereby enabling experimental validation of specific genes of interest and detailed assessment of their putative domain structure. Varia_GEM accommodates rapid profiling of var gene expression in complex patient samples from DBLα expression sequence tags (EST), by computing a sample overall transcript profile stratified by PfEMP1 domain types. Results Varia_VIP was tested querying sequence tags from all DBL domain types using different search criteria. On average 92% of query tags had one or more 99% identical database hits, resulting in the full-length query gene sequence being identified (> 99% identical DNA > 80% of query gene) among the five most prominent database hits, for ~ 33% of the query genes. Optimized Varia_GEM settings allowed correct prediction of > 90% of domains placed among the four most N-terminal domains, including the DBLα domain, and > 70% of C-terminal domains. With this accuracy, N-terminal domains could be predicted for > 80% of queries, whereas prediction rates of C-terminal domains dropped with the distance from the DBLα from 70 to 40%. Conclusion Prediction of var sequence and domain composition is possible from short sequence tags. Varia can be used to guide experimental validation of PfEMP1 sequences of interest and conduct high-throughput analysis of var type expression in patient samples.

Springer Science and Business Media LLC

Gavin Mackenzie Rasmus W. Jensen Thomas Lavstsen Thomas D. Otto

BMC Bioinformatics

2022

Title: Varia: a tool for prediction, analysis and visualisation of variable genes

Description:

Abstract Background Parasites use polymorphic gene families to evade the immune system or interact with the host.

Assessing the diversity and expression of such gene families in pathogens can inform on the repertoire or host interaction phenotypes of clinical relevance.

However, obtaining the sequences and quantifying their expression is a challenge.

In Plasmodium falciparum, the highly polymorphic var genes encode the major virulence protein, PfEMP1, which bind a range of human receptors through varying combinations of DBL and CIDR domains.

Here we present a tool, Varia, to predict near full-length gene sequences and domain compositions of query genes from database genes sharing short sequence tags.

Varia generates output through two complementary pipelines.

Varia_VIP returns all putative gene sequences and domain compositions of the query gene from any partial sequence provided, thereby enabling experimental validation of specific genes of interest and detailed assessment of their putative domain structure.

Varia_GEM accommodates rapid profiling of var gene expression in complex patient samples from DBLα expression sequence tags (EST), by computing a sample overall transcript profile stratified by PfEMP1 domain types.

Results Varia_VIP was tested querying sequence tags from all DBL domain types using different search criteria.

On average 92% of query tags had one or more 99% identical database hits, resulting in the full-length query gene sequence being identified (> 99% identical DNA > 80% of query gene) among the five most prominent database hits, for ~ 33% of the query genes.

Optimized Varia_GEM settings allowed correct prediction of > 90% of domains placed among the four most N-terminal domains, including the DBLα domain, and > 70% of C-terminal domains.

With this accuracy, N-terminal domains could be predicted for > 80% of queries, whereas prediction rates of C-terminal domains dropped with the distance from the DBLα from 70 to 40%.

Conclusion Prediction of var sequence and domain composition is possible from short sequence tags.

Varia can be used to guide experimental validation of PfEMP1 sequences of interest and conduct high-throughput analysis of var type expression in patient samples.

Back

Related Results

Situated Visualization in Motion

Visualisation localisée en mouvement Dans ma thèse, je définis ce qu'est la visualisation en mouvement et j'apporte plusieurs contributions sur la manière de visual...

Are Barred Owls Displacing Spotted Owls?

AbstractBarred Owls (Strix varia) have expanded their range into the Pacific Northwest, and anecdotal evidence suggests that they may be displacing the federally threatened Norther...

Physical and tangible information visualization

Visualisation physique et tangible de l'information Les visualisations, dans le sens général de représentations externes et physiques de données, sont plus ancienne...

Increased life expectancy of heart failure patients in a rural center by a multidisciplinary program

Abstract Funding Acknowledgements Type of funding sources: None. INTRODUCTION Patients with heart failure (HF)...

Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing

Smart manufacturing has been developed since the introduction of Industry 4.0. It consists of resource sharing and networking, predictive engineering, and material and data analyti...

Pengaruh Kepemimpinan Kepala Sekolah, Lingkungan Kerja, dan Sarana Pembelajaran terhadap Kinerja Guru Melalui Motivasi Kerja

Penelitian ini mengkaji pengaruh kepemimpinan kepala sekolah, lingkungan sekolah, dan sarana pembelajaran terhadap kinerja guru SMAS Reformasi Plus, dengan motivasi guru sebagai va...

Information Visualization for Decision Making : Identifying Biases and Moving Beyond the Visual Analysis Paradigm

La visualisation d’information pour la prise de décision : identifier les biais et aller au-delà du paradigme de l'analyse visuelle Certains problèmes ne peuvent êt...

Varia: Prediction, analysis and visualisation of variable genes

Summary Assessing the diversity or expression of variable gene families in pathogens can inform about immune escape mechanisms or host interactio...

Email:
Password:

Email: