Javascript must be enabled to continue!
What Are We Missing? A Systematic Approach to Overlap Analyses of Local and Global Repositories
View through CrossRef
Local repositories, managed by institutions, often differ in coverage and metadata from the research
output affiliated with the concerned institutions in global repositories such as OpenAlex, which
aggregate records from numerous sources for broader visibility. This paper introduces a DOI
Screening System that systematically identifies and explains mismatches between local and global
repositories by classifying publications as local-only, matched, or global-only. The system applies
predefined rules and allows identifying patterns such as misattributed affiliations, unrecognized DOI
prefixes, incomplete metadata, or underrepresented publication types. Based on these patterns, one
can derive ‘curative’ actions. We demonstrate the system’s utility by comparing the repositories of
EPFL and of ETH Zurich to OpenAlex, showing how subtle inconsistencies in identifiers and
affiliations can account for many discrepancies. The system provides insights into how targeted
interventions addressing the root causes of these discrepancies can be used to enhance coverage and
reliability in both local and global repositories.
Institute for Informatics and Automation Problems of NAS RA
Title: What Are We Missing? A Systematic Approach to Overlap Analyses of Local and Global Repositories
Description:
Local repositories, managed by institutions, often differ in coverage and metadata from the research
output affiliated with the concerned institutions in global repositories such as OpenAlex, which
aggregate records from numerous sources for broader visibility.
This paper introduces a DOI
Screening System that systematically identifies and explains mismatches between local and global
repositories by classifying publications as local-only, matched, or global-only.
The system applies
predefined rules and allows identifying patterns such as misattributed affiliations, unrecognized DOI
prefixes, incomplete metadata, or underrepresented publication types.
Based on these patterns, one
can derive ‘curative’ actions.
We demonstrate the system’s utility by comparing the repositories of
EPFL and of ETH Zurich to OpenAlex, showing how subtle inconsistencies in identifiers and
affiliations can account for many discrepancies.
The system provides insights into how targeted
interventions addressing the root causes of these discrepancies can be used to enhance coverage and
reliability in both local and global repositories.
Related Results
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Abstract
The Physical Activity Guidelines for Americans (Guidelines) advises older adults to be as active as possible. Yet, despite the well documented benefits of physical a...
Microwave Ablation with or Without Chemotherapy in Management of Non-Small Cell Lung Cancer: A Systematic Review
Microwave Ablation with or Without Chemotherapy in Management of Non-Small Cell Lung Cancer: A Systematic Review
Abstract
Introduction
Microwave ablation (MWA) has emerged as a minimally invasive treatment for patients with inoperable non-small cell lung cancer (NSCLC). However, whether it i...
Do evidence summaries increase health policy‐makers' use of evidence from systematic reviews? A systematic review
Do evidence summaries increase health policy‐makers' use of evidence from systematic reviews? A systematic review
This review summarizes the evidence from six randomized controlled trials that judged the effectiveness of systematic review summaries on policymakers' decision making, or the most...
Towards Transparent Presentation of FAIR-enabling Data Repository Functions & Characteristics
Towards Transparent Presentation of FAIR-enabling Data Repository Functions & Characteristics
Identifying, finding and gaining a sufficient overview of the functions and characteristics of data repositories and their catalogues is essential for users of data repositories an...
Long-range superharmonic Josephson current and spin-triplet pairing correlations in a junction with ferromagnetic bilayers
Long-range superharmonic Josephson current and spin-triplet pairing correlations in a junction with ferromagnetic bilayers
AbstractThe long-range spin-triplet supercurrent transport is an interesting phenomenon in the superconductor/ferromagnet ("Equation missing") heterostructure containing noncolline...
Handling Missing Data in COVID-19 Incidence Estimation: Secondary Data Analysis
Handling Missing Data in COVID-19 Incidence Estimation: Secondary Data Analysis
Abstract
Background
The COVID-19 pandemic has revealed significant challenges in disease forecasting and in developing a public health response, ...
How is missing data handled in cluster randomized controlled trials? A review of trials published in the NIHR Journals Library 1997–2024
How is missing data handled in cluster randomized controlled trials? A review of trials published in the NIHR Journals Library 1997–2024
Background:
Cluster randomized controlled trials are increasingly used to evaluate the effectiveness of interventions in clinical and public health research. However, m...
Breast Carcinoma within Fibroadenoma: A Systematic Review
Breast Carcinoma within Fibroadenoma: A Systematic Review
Abstract
Introduction
Fibroadenoma is the most common benign breast lesion; however, it carries a potential risk of malignant transformation. This systematic review provides an ove...

