Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Evaluating Large Language Models in Interpreting Cervical Cytology

View through CrossRef
Abstract Background Large language models (LLMs) have shown promise in medical imaging, but their utility in cytology remains underexplored. This study evaluates GPT-5 and Gemini 2.5 Pro for Pap smear interpretation. Methods Digital cervical Pap smear images of 100 cases were obtained from the Hologic Education Site, with Hologic diagnoses considered the gold standard. Representative images were uploaded into GPT-5 and Gemini 2.5 Pro and prompted to provide a diagnosis based on the Third Edition of the Bethesda System for Reporting Cervical Cytopathology. Cases with infectious organisms were assessed using additional images. Concordance was evaluated at exact diagnosis and clinical management groupings, wherein diagnoses with similar management implications were grouped. Sensitivity and specificity for abnormal cytology were also calculated. Results Concordance of both LLMs for exact diagnostic matches were comparable (GPT-5: 47%, Gemini: 48%) and increased to 66% for clinical management grouping. GPT-5 performed best for low-grade squamous intraepithelial lesions (75%), whereas Gemini 2.5 Pro showed the highest concordance in the high-grade squamous intraepithelial lesion (HSIL) category (82%), although this was largely attributable to its strong tendency to overcall cases as HSIL. Sensitivity for detecting abnormal cytology was 74% for GPT-5 and 84% for Gemini, with specificity of 74% and 71%, respectively. GPT-5 better identified glandular lesions, while Gemini detected organisms more accurately (71% vs. 20%). Conclusions Current LLMs demonstrate moderate ability to identify cytologic abnormalities but are not yet reliable for independent Pap smear interpretation. Targeted fine-tuning, prompt optimization, and cytology-specific training could enhance their utility as adjunctive tools in cytology workflows.
Title: Evaluating Large Language Models in Interpreting Cervical Cytology
Description:
Abstract Background Large language models (LLMs) have shown promise in medical imaging, but their utility in cytology remains underexplored.
This study evaluates GPT-5 and Gemini 2.
5 Pro for Pap smear interpretation.
Methods Digital cervical Pap smear images of 100 cases were obtained from the Hologic Education Site, with Hologic diagnoses considered the gold standard.
Representative images were uploaded into GPT-5 and Gemini 2.
5 Pro and prompted to provide a diagnosis based on the Third Edition of the Bethesda System for Reporting Cervical Cytopathology.
Cases with infectious organisms were assessed using additional images.
Concordance was evaluated at exact diagnosis and clinical management groupings, wherein diagnoses with similar management implications were grouped.
Sensitivity and specificity for abnormal cytology were also calculated.
Results Concordance of both LLMs for exact diagnostic matches were comparable (GPT-5: 47%, Gemini: 48%) and increased to 66% for clinical management grouping.
GPT-5 performed best for low-grade squamous intraepithelial lesions (75%), whereas Gemini 2.
5 Pro showed the highest concordance in the high-grade squamous intraepithelial lesion (HSIL) category (82%), although this was largely attributable to its strong tendency to overcall cases as HSIL.
Sensitivity for detecting abnormal cytology was 74% for GPT-5 and 84% for Gemini, with specificity of 74% and 71%, respectively.
GPT-5 better identified glandular lesions, while Gemini detected organisms more accurately (71% vs.
20%).
Conclusions Current LLMs demonstrate moderate ability to identify cytologic abnormalities but are not yet reliable for independent Pap smear interpretation.
Targeted fine-tuning, prompt optimization, and cytology-specific training could enhance their utility as adjunctive tools in cytology workflows.

Related Results

Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...
Cervical cancer screening utilization and predictors among eligible women in Ethiopia: A systematic review and meta-analysis
Cervical cancer screening utilization and predictors among eligible women in Ethiopia: A systematic review and meta-analysis
BackgroundDespite a remarkable progress in the reduction of global rate of maternal mortality, cervical cancer has been identified as the leading cause of maternal morbidity and mo...
Cervical Cancer or Cervical Endometriosis – Review and Case Report
Cervical Cancer or Cervical Endometriosis – Review and Case Report
According to cancer death rates for women worldwide, this form of cancer ranks fourth after breast, bronchopulmonary, and colorectal cancer, affecting around 570,000 women annually...
Efficacy of cervical pessary versus cervical cerclage in preventing spontaneous preterm birth: a meta-analysis
Efficacy of cervical pessary versus cervical cerclage in preventing spontaneous preterm birth: a meta-analysis
Backgroud: The clinical efficacy and safety of cervical pessary versus cervical cerclage in preventing spontaneous preterm birth remain uncertain. Objectives: To systematically rev...
The Women Who Fear the Unknown: Potential Drivers of the Cervical Cancer Epidemic in Rural Nigeria
The Women Who Fear the Unknown: Potential Drivers of the Cervical Cancer Epidemic in Rural Nigeria
Background: Visual inspection of the cervix under acetic acid is the most cost-effective method for the control of cervical cancer in sub-Saharan Africa. The region bears about 90%...
C/EBPβ expression decreases in cervical cancer and leads to tumorigenesis
C/EBPβ expression decreases in cervical cancer and leads to tumorigenesis
Abstract Background Cervical cancer is currently estimated to be the fourth most common cancer among women worldwide and the leading cause of cancer...

Back to Top