Javascript must be enabled to continue!
The BEL Information Extraction Workflow (BELIEF): updates and evaluation
View through CrossRef
Ever-increasing scientific literature enhances our understanding on how toxicants impact biological systems. In order to utilize this information in the growing field of systems biology and toxicology, the published knowledge must be transformed into a structured format highly efficient for modelling, reasoning, and ultimately high throughput data analysis and interpretation. Consequently, there is an increasing demand from systems biologists and toxicologists to access such knowledge in a computable format, here biological network models.
The Biological Expression Language (BEL) is a machine- and human-readable language that represents molecular relationships and events as semantic triples: subject–relationship–object. These triples are called BEL statements. BEL statements associated with their evidence are encapsulated into the large BEL document. The BEL document also captures additional information such as article references, PMID of the scientific article and a large annotation dataset that accurately defines the context of knowledge such as the organism, tissue and disease state. BEL statements can computationally be assembled to biological network models.
To facilitate encoding and curation of biological findings in BEL, a BEL Information Extraction workFlow (BELIEF) was developed. BELIEF contains a text mining pipeline for the automatic generation of BEL compliant knowledge statements and a web-based curation interface – the BELIEF Dashboard – that facilitates manual curation of the automatically generated BEL statements. The text mining pipeline is UIMA-based and accommodates several named entity recognition processes and relationship extraction methods in order to detect concepts and BEL relationships from any text resource. The BELIEF Dashboard reuses the output of the BELIEF pipeline to create a web-based interface and facilitate the manual curation task. Although BEL itself enables curation given the human-readability of the syntax, BELIEF simplifies the curation process by highlighting named entities and disambiguating gene and protein names.
In addition to the new features of BELIEF, we also present the competitive performance results based on the BioCreative V BEL track evaluation. The BELIEF pipeline automatically extracts normalized concepts with the best F-score of 76.8%. The detection of full relationships and entirely correct statements was achieved with the F-score of 43.1% (second-best) and 30.8% (highest). The participation at the Interactive task (IAT) track in BioCreative V revealed a Systems Usability Scale (SUS) of 67. Given the complexity of the task for untrained users this score certifies a high usability for BELIEF. In conclusion, BELIEF simplifies the curation process and facilitates the construction of biological network models that can be fully contextualized and used for the interpretation of systems biology and systems toxicology data. This workflow is currently being further developed to be used in an industrial setup for product risk assessment.
F1000 Research Ltd
Title: The BEL Information Extraction Workflow (BELIEF): updates and evaluation
Description:
Ever-increasing scientific literature enhances our understanding on how toxicants impact biological systems.
In order to utilize this information in the growing field of systems biology and toxicology, the published knowledge must be transformed into a structured format highly efficient for modelling, reasoning, and ultimately high throughput data analysis and interpretation.
Consequently, there is an increasing demand from systems biologists and toxicologists to access such knowledge in a computable format, here biological network models.
The Biological Expression Language (BEL) is a machine- and human-readable language that represents molecular relationships and events as semantic triples: subject–relationship–object.
These triples are called BEL statements.
BEL statements associated with their evidence are encapsulated into the large BEL document.
The BEL document also captures additional information such as article references, PMID of the scientific article and a large annotation dataset that accurately defines the context of knowledge such as the organism, tissue and disease state.
BEL statements can computationally be assembled to biological network models.
To facilitate encoding and curation of biological findings in BEL, a BEL Information Extraction workFlow (BELIEF) was developed.
BELIEF contains a text mining pipeline for the automatic generation of BEL compliant knowledge statements and a web-based curation interface – the BELIEF Dashboard – that facilitates manual curation of the automatically generated BEL statements.
The text mining pipeline is UIMA-based and accommodates several named entity recognition processes and relationship extraction methods in order to detect concepts and BEL relationships from any text resource.
The BELIEF Dashboard reuses the output of the BELIEF pipeline to create a web-based interface and facilitate the manual curation task.
Although BEL itself enables curation given the human-readability of the syntax, BELIEF simplifies the curation process by highlighting named entities and disambiguating gene and protein names.
In addition to the new features of BELIEF, we also present the competitive performance results based on the BioCreative V BEL track evaluation.
The BELIEF pipeline automatically extracts normalized concepts with the best F-score of 76.
8%.
The detection of full relationships and entirely correct statements was achieved with the F-score of 43.
1% (second-best) and 30.
8% (highest).
The participation at the Interactive task (IAT) track in BioCreative V revealed a Systems Usability Scale (SUS) of 67.
Given the complexity of the task for untrained users this score certifies a high usability for BELIEF.
In conclusion, BELIEF simplifies the curation process and facilitates the construction of biological network models that can be fully contextualized and used for the interpretation of systems biology and systems toxicology data.
This workflow is currently being further developed to be used in an industrial setup for product risk assessment.
Related Results
The BEL Information Extraction Workflow (BELIEF): updates and evaluation
The BEL Information Extraction Workflow (BELIEF): updates and evaluation
Ever-increasing scientific literature enhances our understanding on how toxicants impact biological systems. In order to utilize this information in the growing field of systems bi...
Bellidifolin Inhibits Proliferation of A549 Cells by Regulating STAT3/COX-2 Expression and Protein Activity
Bellidifolin Inhibits Proliferation of A549 Cells by Regulating STAT3/COX-2 Expression and Protein Activity
Objectives. Bellidifolin (BEL) is one type of tetraoxygenated xanthone that is particularly found in Swertia and Gentiana (Gentianaceae). Despite its broad range of pharmacological...
Crystal Structure of the Pseudomonas aeruginosa BEL-1 Extended-Spectrum β-Lactamase and Its Complexes with Moxalactam and Imipenem
Crystal Structure of the Pseudomonas aeruginosa BEL-1 Extended-Spectrum β-Lactamase and Its Complexes with Moxalactam and Imipenem
ABSTRACT
BEL-1 is an acquired class A extended-spectrum β-lactamase (ESBL) found in
Pseudomonas aeruginosa
clinical isolates from Bel...
Authentication and Integrity of UEFI Capsule Updates
Authentication and Integrity of UEFI Capsule Updates
The security of firmware updates is a fundamental aspect of protecting modern computing systems from vulnerabilities and malicious threats. UEFI (Unified Extensible Firmware Interf...
Religious Belief in Philosophy and Psychology
Religious Belief in Philosophy and Psychology
What is the nature of religious belief, and what factors influence an individual’s formation and maintenance of religious beliefs? This article collects resources that address thes...
Non-Recommended Publishing Lists: Strategies for Detecting Deceitful Journals
Non-Recommended Publishing Lists: Strategies for Detecting Deceitful Journals
Abstract
The rapid growth of open access publishing (OAP) has significantly improved the accessibility and dissemination of scientific knowledge. However, this expansion has also c...
ЛЕГИТИМИЗАЦИЯ ФРАНЦУЗСКОЙ РЕЧЕВОЙ ПРАКТИКИ LE BON / BEL USAGE
ЛЕГИТИМИЗАЦИЯ ФРАНЦУЗСКОЙ РЕЧЕВОЙ ПРАКТИКИ LE BON / BEL USAGE
В статье представлены результаты исследования социальных значений языковой единицы le bon/bel usage , осмысленных в рамках социокультурного контекста формирования и легитимизации н...
Optimization of ultrasonic extraction of
Lycium barbarum
polysaccharides using response surface methodology
Optimization of ultrasonic extraction of
Lycium barbarum
polysaccharides using response surface methodology
Abstract
Ultrasonic extraction was a new development method to achieve high-efficiency extraction of
Lycium barbarum
...

