Javascript must be enabled to continue!
HDI Corpus: A Dataset for Named Entity Recognition for In-Context Herb-Drug Interactions
View through CrossRef
Introduction
This article proposes a new dataset for Named Entity Recognition based on PubMed articles and aiming to address the problem of Herb-Drug Interactions. It aims to offer a new dataset for recognizing herb-drug interaction entities, including contextual information.
Background
Machine learning and Deep learning provide users with powerful tools for task automation, but require large quantities of data to perform well. In the field of Natural Language Processing, training Deep Learning models requires the annotation of large corpora of text. While some corpora exist in medical literature, each specific task requires an adapted corpus.
Methods
The dataset was tested using a classical Named Entity Recognition pipeline, as well as new possibilities offered by generative AI.
Results
The dataset proposes annotated sentences of around a hundred articles and covers 15 entities, including herbs, drugs, and pathologies, as well as contextual information, such as cohort composition, patient information, or pharmacological clues.
Discussion
The study demonstrates that this dataset performs comparably to the DDI (Drug-Drug Interaction) corpus — a standard dataset in the drug Named Entity Recognition — for drug recognition, and performs well on most of the entities.
Conclusion
: We believe this corpus could help diversify pharmacological Named Entity Recognition.
Bentham Science Publishers Ltd.
Title: HDI Corpus: A Dataset for Named Entity Recognition for In-Context Herb-Drug Interactions
Description:
Introduction
This article proposes a new dataset for Named Entity Recognition based on PubMed articles and aiming to address the problem of Herb-Drug Interactions.
It aims to offer a new dataset for recognizing herb-drug interaction entities, including contextual information.
Background
Machine learning and Deep learning provide users with powerful tools for task automation, but require large quantities of data to perform well.
In the field of Natural Language Processing, training Deep Learning models requires the annotation of large corpora of text.
While some corpora exist in medical literature, each specific task requires an adapted corpus.
Methods
The dataset was tested using a classical Named Entity Recognition pipeline, as well as new possibilities offered by generative AI.
Results
The dataset proposes annotated sentences of around a hundred articles and covers 15 entities, including herbs, drugs, and pathologies, as well as contextual information, such as cohort composition, patient information, or pharmacological clues.
Discussion
The study demonstrates that this dataset performs comparably to the DDI (Drug-Drug Interaction) corpus — a standard dataset in the drug Named Entity Recognition — for drug recognition, and performs well on most of the entities.
Conclusion
: We believe this corpus could help diversify pharmacological Named Entity Recognition.
Related Results
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND
As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Statistical analyses on the correlation of corruption perception index and some other indices in Nigeria
Statistical analyses on the correlation of corruption perception index and some other indices in Nigeria
This study investigated the statistical analysis of Corruption Perception Index (CPI) in Nigeria considering some other indices which are, Human Development Index (HDI), Global Pea...
Žanrovska analiza pomorskopravnih tekstova i ostvarenje prijevodnih univerzalija u njihovim prijevodima s engleskoga jezika
Žanrovska analiza pomorskopravnih tekstova i ostvarenje prijevodnih univerzalija u njihovim prijevodima s engleskoga jezika
Genre implies formal and stylistic conventions of a particular text type, which inevitably affects the translation process. This „force of genre bias“ (Prieto Ramos, 2014) has been...
Efficacy of an Extended Half-Life GlycoPEGylated rFVIII (N8-GP): Pooled Analysis of ABR (Results from Two Clinical Trials)
Efficacy of an Extended Half-Life GlycoPEGylated rFVIII (N8-GP): Pooled Analysis of ABR (Results from Two Clinical Trials)
Abstract
Introduction
The short half-life of standard factor VIII (FVIII) products means that frequent injections (3 to 4 times/week) are needed for e...
Pharmacokinetics of herb-drug interactions: Experimental models in Nigeria
Pharmacokinetics of herb-drug interactions: Experimental models in Nigeria
<p style="text-align: justify;">Herbs have been a vital renewable source of medicine throughout human history as a large proportion of the global po...
Perioperative and anesthesia-related cardiac arrests in geriatric patients: a systematic review using meta-regression analysis
Perioperative and anesthesia-related cardiac arrests in geriatric patients: a systematic review using meta-regression analysis
AbstractThe worldwide population is aging, and the number of surgeries performed in geriatric patients is increasing. This systematic review evaluated anesthetic procedures to asse...
Revealing the Role of Technological Innovation, Institutional Quality and Foreign Direct Investment in Impacting Human Development in Developing Countries: Insights from FGLS and PCSE Models
Revealing the Role of Technological Innovation, Institutional Quality and Foreign Direct Investment in Impacting Human Development in Developing Countries: Insights from FGLS and PCSE Models
The study reveals how technological innovation, institutional quality, and foreign direct investment drive sustainable human development, providing strategies for policymakers to a...
A Phase 1b, Dose-Finding Study Of Ruxolitinib Plus Panobinostat In Patients With Primary Myelofibrosis (PMF), Post–Polycythemia Vera MF (PPV-MF), Or Post–Essential Thrombocythemia MF (PET-MF): Identification Of The Recommended Phase 2 Dose
A Phase 1b, Dose-Finding Study Of Ruxolitinib Plus Panobinostat In Patients With Primary Myelofibrosis (PMF), Post–Polycythemia Vera MF (PPV-MF), Or Post–Essential Thrombocythemia MF (PET-MF): Identification Of The Recommended Phase 2 Dose
Abstract
Background
Myelofibrosis (MF) is a myeloproliferative neoplasm associated with progressive, debilitating symptoms that ...

