Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

From Lectures to Learning Outcomes: Meaningful Integration of AI-Generated Content in Pre-Clerkship Medical Training

View through CrossRef
AbstractLarge Language Models (LLMs) have shown considerable promise in knowledge processing and synthesis across various medical disciplines. In medical education, most applications have focused on comparing LLM outputs to trainee performance or using LLMs for standardized assessment. However, few studies have systematically evaluated the effects of standardized, LLM-powered, curricular interventions on medical learning.This case study, conducted at The Warren Alpert Medical School of Brown University, assessed the impact of AI-generated Anki flashcards and lecture summaries specifically optimized for the pre-clerkship phase. These materials were developed using a rigorous, specific, and content-agnostic prompt engineering process and validated through standardized human grading to ensure both accuracy and relevance. The final prompts used demonstrated hallucination rates of 0 per summary and 1 per 21 flashcards and average coverage of 100% of faculty-identified learning objectives. Materials were given to students for two 3-week academic blocks, covering genetics and pharmacology.Student exam scores and survey-based feedback were used to evaluate the effectiveness of these AI-generated resources. The study was conducted in a resource-rich pre-clerkship setting where students already have access to faculty-created materials, commercial content, and student-curated resources. We aimed to determine whether AI-generated content could offer measurable quantitative improvements or subjective qualitative benefits in a saturated learning environment.Among participating first-year medical students, overall exam performance between those who used the AI-generated summaries and those who did not was comparable in both the genetics block (p = 0.76) and the pharmacology block (p = 0.35). Similarly, use of the AI-generated Anki flashcards was not associated with significant differences in exam scores for either genetics (p = 0.86) or pharmacology (p = 0.05). Qualitative analyses demonstrated widespread time saving for Anki flashcards (74%) and AI-generated summaries (61%), with 91% of users finding the custom AI-generated content more time-saving than default GPT-4o. There was a significant usage-dependent relationship of higher AI-usage correlating with increased agreement of equivalency or utility over faculty-generated lecture notes (Pearson’s r2=0.55) and student-created flashcards (Pearson’s r2=0.79).These findings suggest that students who used AI-generated content maintained comparable educational outcomes in the pre-clerkship setting. Moreover, subjective perceptions among learners, such as time saved and content usefulness, highlight the potential value of LLM-powered tools when layered on top of an existing well-resourced curricular structure. Future work will examine the benefits of this work in less structured medical education settings, such as clinical and surgical education.
Title: From Lectures to Learning Outcomes: Meaningful Integration of AI-Generated Content in Pre-Clerkship Medical Training
Description:
AbstractLarge Language Models (LLMs) have shown considerable promise in knowledge processing and synthesis across various medical disciplines.
In medical education, most applications have focused on comparing LLM outputs to trainee performance or using LLMs for standardized assessment.
However, few studies have systematically evaluated the effects of standardized, LLM-powered, curricular interventions on medical learning.
This case study, conducted at The Warren Alpert Medical School of Brown University, assessed the impact of AI-generated Anki flashcards and lecture summaries specifically optimized for the pre-clerkship phase.
These materials were developed using a rigorous, specific, and content-agnostic prompt engineering process and validated through standardized human grading to ensure both accuracy and relevance.
The final prompts used demonstrated hallucination rates of 0 per summary and 1 per 21 flashcards and average coverage of 100% of faculty-identified learning objectives.
Materials were given to students for two 3-week academic blocks, covering genetics and pharmacology.
Student exam scores and survey-based feedback were used to evaluate the effectiveness of these AI-generated resources.
The study was conducted in a resource-rich pre-clerkship setting where students already have access to faculty-created materials, commercial content, and student-curated resources.
We aimed to determine whether AI-generated content could offer measurable quantitative improvements or subjective qualitative benefits in a saturated learning environment.
Among participating first-year medical students, overall exam performance between those who used the AI-generated summaries and those who did not was comparable in both the genetics block (p = 0.
76) and the pharmacology block (p = 0.
35).
Similarly, use of the AI-generated Anki flashcards was not associated with significant differences in exam scores for either genetics (p = 0.
86) or pharmacology (p = 0.
05).
Qualitative analyses demonstrated widespread time saving for Anki flashcards (74%) and AI-generated summaries (61%), with 91% of users finding the custom AI-generated content more time-saving than default GPT-4o.
There was a significant usage-dependent relationship of higher AI-usage correlating with increased agreement of equivalency or utility over faculty-generated lecture notes (Pearson’s r2=0.
55) and student-created flashcards (Pearson’s r2=0.
79).
These findings suggest that students who used AI-generated content maintained comparable educational outcomes in the pre-clerkship setting.
Moreover, subjective perceptions among learners, such as time saved and content usefulness, highlight the potential value of LLM-powered tools when layered on top of an existing well-resourced curricular structure.
Future work will examine the benefits of this work in less structured medical education settings, such as clinical and surgical education.

Related Results

Does emergency medicine clerkship change students' misconceptions towards this specialty? Pre- and post-clerkship perceptions
Does emergency medicine clerkship change students' misconceptions towards this specialty? Pre- and post-clerkship perceptions
Objective: This study aims to understand students' skills based on their self-assessment and their perceptions regarding emergency medicine (EM) physicians, EM patients, and choosi...
Vaginal delivery simulation in the Obstetrics and Gynaecology clerkship
Vaginal delivery simulation in the Obstetrics and Gynaecology clerkship
Summary Background Although simulation is now used in other areas of obstetrics and gynaecology, i...
Evaluation of Student Expectations and Clinical Competence Skills Performance during Surgery Clerkship Programme: A Comprehensive Study
Evaluation of Student Expectations and Clinical Competence Skills Performance during Surgery Clerkship Programme: A Comprehensive Study
The existence of a good clerkship system and quality lecturers will undoubtedly enhance the medical students’ knowledge and skills that align with their competencies. This study ai...
Psychiatry Curriculum: How Does It Affect Medical Students’ Attitude Toward Psychiatry?
Psychiatry Curriculum: How Does It Affect Medical Students’ Attitude Toward Psychiatry?
Abstract BackgroundThe purpose of this study was to identify the attitude of South Korean medical school students towards psychiatry and to analyze how lectures and clinica...
Small Cell Lung Cancer and Tarlatamab: A Meta-Analysis of Clinical Trials
Small Cell Lung Cancer and Tarlatamab: A Meta-Analysis of Clinical Trials
Abstract Introduction Tarlatamab is a Delta-like ligand 3 (DLL3) -directed bispecific T-cell engager recently approved for use in patients with advanced small cell lung cancer (SCL...
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Student perceptions towards online learning in medical education during the COVID-19 pandemic: a mixed-methods study
Student perceptions towards online learning in medical education during the COVID-19 pandemic: a mixed-methods study
Background: This mixed-methods study was undertaken to ascertain undergraduate medical students’ perceptions of remote learning following the COVID-19 restrictions. 545 students pa...
Training Evaluation
Training Evaluation
Training evaluation is the systematic collection of data to better manage training programs and training systems. To be effective, evaluation should answer two questions: How did I...

Back to Top