Javascript must be enabled to continue!
Classifiers of Medical Eponymy in Scientific Texts
View through CrossRef
Many concepts in the medical literature are named after persons. Frequent ambiguities and spelling varieties, however, complicate the automatic recognition of such eponyms with natural language processing (NLP) tools. Recently developed methods include word vectors and transformer models that incorporate context information into the downstream layers of a neural network architecture. To evaluate these models for classifying medical eponymy, we label eponyms and counterexamples mentioned in a convenience sample of 1,079 Pubmed abstracts, and fit logistic regression models to the vectors from the first (vocabulary) and last (contextualized) layers of a SciBERT language model. According to the area under sensitivity-specificity curves, models based on contextualized vectors achieved a median performance of 98.0% in held-out phrases. This outperformed models based on vocabulary vectors (95.7%) by a median of 2.3 percentage points. When processing unlabeled inputs, such classifiers appeared to generalize to eponyms that did not appear among any annotations. These findings attest to the effectiveness of developing domain-specific NLP functions based on pre-trained language models, and underline the utility of context information for classifying potential eponyms.
Title: Classifiers of Medical Eponymy in Scientific Texts
Description:
Many concepts in the medical literature are named after persons.
Frequent ambiguities and spelling varieties, however, complicate the automatic recognition of such eponyms with natural language processing (NLP) tools.
Recently developed methods include word vectors and transformer models that incorporate context information into the downstream layers of a neural network architecture.
To evaluate these models for classifying medical eponymy, we label eponyms and counterexamples mentioned in a convenience sample of 1,079 Pubmed abstracts, and fit logistic regression models to the vectors from the first (vocabulary) and last (contextualized) layers of a SciBERT language model.
According to the area under sensitivity-specificity curves, models based on contextualized vectors achieved a median performance of 98.
0% in held-out phrases.
This outperformed models based on vocabulary vectors (95.
7%) by a median of 2.
3 percentage points.
When processing unlabeled inputs, such classifiers appeared to generalize to eponyms that did not appear among any annotations.
These findings attest to the effectiveness of developing domain-specific NLP functions based on pre-trained language models, and underline the utility of context information for classifying potential eponyms.
Related Results
Žanrovska analiza pomorskopravnih tekstova i ostvarenje prijevodnih univerzalija u njihovim prijevodima s engleskoga jezika
Žanrovska analiza pomorskopravnih tekstova i ostvarenje prijevodnih univerzalija u njihovim prijevodima s engleskoga jezika
Genre implies formal and stylistic conventions of a particular text type, which inevitably affects the translation process. This „force of genre bias“ (Prieto Ramos, 2014) has been...
Epônimos em textos científicos
Epônimos em textos científicos
Eponyms are linguistic phenomena present in scientific language in various fields of science. Aiming to contribute to new themes and objects of study in the field of Information Sc...
A Comparative Study of Indonesian and Japanese Classifiers
A Comparative Study of Indonesian and Japanese Classifiers
Abstract
Classifiers belong to open class noun. All languages are naturally occupied with classifiers, yet the usage is various depending on how the language tre...
Responsibilised Resilience? Reworking Neoliberal Social Policy Texts
Responsibilised Resilience? Reworking Neoliberal Social Policy Texts
Introduction This essay begins with the premise that resilience, broadly defined as positive adaptation despite adversity (Garmezy and Rutter), and resilience building are importa...
Biblical Texts and Interpretations in the Dead Sea Scrolls: Biblical Texts
Biblical Texts and Interpretations in the Dead Sea Scrolls: Biblical Texts
The introduction to this entry places the Dead Sea Scrolls in their historical and chronological context and discusses the popularity and provenance of the texts found in the Judea...
Machine Learning and Semantic Orientation Ensemble Methods for Egyptian Telecom Tweets Sentiment Analysis
Machine Learning and Semantic Orientation Ensemble Methods for Egyptian Telecom Tweets Sentiment Analysis
The vast amount of data currently available online attracted many parties to analyze sentiments expressed in these data extracting valuable knowledge. Many approaches have been pro...
Numeral classifiers in Japanese
Numeral classifiers in Japanese
This examination of numeral classifiers in standard Japanese (hyoojungo) focuses on their interaction with nouns and their referents in terms of both meaning and function. By uniti...
Supervised Machine Learning for Aiding Diagnosis of Knee Osteoarthritis: A Systematic Review and Meta-Analysis
Supervised Machine Learning for Aiding Diagnosis of Knee Osteoarthritis: A Systematic Review and Meta-Analysis
Background
Knee osteoarthritis (OA) remains a leading aetiology of disability
worldwide. Clinical assessment of such knee-related conditions has
im...

