Javascript must be enabled to continue!

Classifiers of Medical Eponymy in Scientific Texts

Many concepts in the medical literature are named after persons. Frequent ambiguities and spelling varieties, however, complicate the automatic recognition of such eponyms with natural language processing (NLP) tools. Recently developed methods include word vectors and transformer models that incorporate context information into the downstream layers of a neural network architecture. To evaluate these models for classifying medical eponymy, we label eponyms and counterexamples mentioned in a convenience sample of 1,079 Pubmed abstracts, and fit logistic regression models to the vectors from the first (vocabulary) and last (contextualized) layers of a SciBERT language model. According to the area under sensitivity-specificity curves, models based on contextualized vectors achieved a median performance of 98.0% in held-out phrases. This outperformed models based on vocabulary vectors (95.7%) by a median of 2.3 percentage points. When processing unlabeled inputs, such classifiers appeared to generalize to eponyms that did not appear among any annotations. These findings attest to the effectiveness of developing domain-specific NLP functions based on pre-trained language models, and underline the utility of context information for classifying potential eponyms.

IOS Press

Dennis Toddenroth

Studies in Health Technology and Informatics

2023

Title: Classifiers of Medical Eponymy in Scientific Texts

Description:

Many concepts in the medical literature are named after persons.

Frequent ambiguities and spelling varieties, however, complicate the automatic recognition of such eponyms with natural language processing (NLP) tools.

Recently developed methods include word vectors and transformer models that incorporate context information into the downstream layers of a neural network architecture.

To evaluate these models for classifying medical eponymy, we label eponyms and counterexamples mentioned in a convenience sample of 1,079 Pubmed abstracts, and fit logistic regression models to the vectors from the first (vocabulary) and last (contextualized) layers of a SciBERT language model.

According to the area under sensitivity-specificity curves, models based on contextualized vectors achieved a median performance of 98.

0% in held-out phrases.

This outperformed models based on vocabulary vectors (95.

7%) by a median of 2.

3 percentage points.

When processing unlabeled inputs, such classifiers appeared to generalize to eponyms that did not appear among any annotations.

These findings attest to the effectiveness of developing domain-specific NLP functions based on pre-trained language models, and underline the utility of context information for classifying potential eponyms.

Back

Genre implies formal and stylistic conventions of a particular text type, which inevitably affects the translation process. This „force of genre bias“ (Prieto Ramos, 2014) has been...

Epônimos em textos científicos

Eponyms are linguistic phenomena present in scientific language in various fields of science. Aiming to contribute to new themes and objects of study in the field of Information Sc...

A Comparative Study of Indonesian and Japanese Classifiers

Abstract Classifiers belong to open class noun. All languages are naturally occupied with classifiers, yet the usage is various depending on how the language tre...

Responsibilised Resilience? Reworking Neoliberal Social Policy Texts

Introduction This essay begins with the premise that resilience, broadly defined as positive adaptation despite adversity (Garmezy and Rutter), and resilience building are importa...

Biblical Texts and Interpretations in the Dead Sea Scrolls: Biblical Texts

The introduction to this entry places the Dead Sea Scrolls in their historical and chronological context and discusses the popularity and provenance of the texts found in the Judea...

Machine Learning and Semantic Orientation Ensemble Methods for Egyptian Telecom Tweets Sentiment Analysis

The vast amount of data currently available online attracted many parties to analyze sentiments expressed in these data extracting valuable knowledge. Many approaches have been pro...

Numeral classifiers in Japanese

This examination of numeral classifiers in standard Japanese (hyoojungo) focuses on their interaction with nouns and their referents in terms of both meaning and function. By uniti...

Supervised Machine Learning for Aiding Diagnosis of Knee Osteoarthritis: A Systematic Review and Meta-Analysis

Background Knee osteoarthritis (OA) remains a leading aetiology of disability worldwide. Clinical assessment of such knee-related conditions has im...

Email:
Password:

Email:

Classifiers of Medical Eponymy in Scientific Texts

Related Results