Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

A Systematic Literature Review of Retrieval-Augmented Generation Implementation for Enhancing Large Language Models in Education

View through CrossRef
The rapid advancement of Large Language Models (LLM) has led to the creation of increasingly adaptive intelligent learning systems. However, many educational implementations of LLMs still rely on internal model knowledge without sufficient grounding in reliable external sources, which may reduce response accuracy, contextual relevance, and trustworthiness. One approach proposed to address this limitation is Retrieval-Augmented Generation (RAG), which combines the LLM’s generative capabilities with external information retrieval systems. Nevertheless, evidence regarding how RAG has been implemented, optimized, and evaluated in educational contexts remains fragmented. This study aimed to evaluate the RAG implementation in supporting LLM performance in learning environments by examining (1) the most frequent types of learning activities involving RAG, (2) the RAG implementation effectiveness in improving response quality, (3) the optimization techniques used to improve RAG results, and (4) the challenges and opportunities faced in its integration in education. The study was conducted systematically using the PICO framework, drawing on articles from the Scopus and IEEE databases, spanning the period from 2021 to 2025. The analysis of 50 studies revealed that RAG is most applied in the contexts of question answering, personalized learning, and tutoring, with significant improvements in the aspects of accuracy, relevance, and personalization of responses. However, not all studies explicitly reported the implementation of optimization techniques to RAG. These techniques comprised knowledge injection, prompt engineering, and query expansion. Challenges remain in reasoning, retrieval accuracy, and system integration, but there is strong potential for developing more adaptive and contextualized learning systems. This review recommends stronger retrieval optimization, better alignment with pedagogical objectives, and broader evaluation using educational outcome measures to maximize the impact of RAG-enhanced LLMs in digital education.
Title: A Systematic Literature Review of Retrieval-Augmented Generation Implementation for Enhancing Large Language Models in Education
Description:
The rapid advancement of Large Language Models (LLM) has led to the creation of increasingly adaptive intelligent learning systems.
However, many educational implementations of LLMs still rely on internal model knowledge without sufficient grounding in reliable external sources, which may reduce response accuracy, contextual relevance, and trustworthiness.
One approach proposed to address this limitation is Retrieval-Augmented Generation (RAG), which combines the LLM’s generative capabilities with external information retrieval systems.
Nevertheless, evidence regarding how RAG has been implemented, optimized, and evaluated in educational contexts remains fragmented.
This study aimed to evaluate the RAG implementation in supporting LLM performance in learning environments by examining (1) the most frequent types of learning activities involving RAG, (2) the RAG implementation effectiveness in improving response quality, (3) the optimization techniques used to improve RAG results, and (4) the challenges and opportunities faced in its integration in education.
The study was conducted systematically using the PICO framework, drawing on articles from the Scopus and IEEE databases, spanning the period from 2021 to 2025.
The analysis of 50 studies revealed that RAG is most applied in the contexts of question answering, personalized learning, and tutoring, with significant improvements in the aspects of accuracy, relevance, and personalization of responses.
However, not all studies explicitly reported the implementation of optimization techniques to RAG.
These techniques comprised knowledge injection, prompt engineering, and query expansion.
Challenges remain in reasoning, retrieval accuracy, and system integration, but there is strong potential for developing more adaptive and contextualized learning systems.
This review recommends stronger retrieval optimization, better alignment with pedagogical objectives, and broader evaluation using educational outcome measures to maximize the impact of RAG-enhanced LLMs in digital education.

Related Results

Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Abstract The Physical Activity Guidelines for Americans (Guidelines) advises older adults to be as active as possible. Yet, despite the well documented benefits of physical a...
Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga
Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga
The actual use of classroom language is principally limited to the classroom environment. As far as foreign language learning is concerned, the classroom often turns out to be the ...
Implementasi Pembelajaran IPS Sebagai Penguatan Pendidikan Karakter di Sekolah Dasar
Implementasi Pembelajaran IPS Sebagai Penguatan Pendidikan Karakter di Sekolah Dasar
This study aims to analyze the implementation of social studies learning as strengthening character education in elementary schools. The research method used is a qualitative descr...
Do evidence summaries increase health policy‐makers' use of evidence from systematic reviews? A systematic review
Do evidence summaries increase health policy‐makers' use of evidence from systematic reviews? A systematic review
This review summarizes the evidence from six randomized controlled trials that judged the effectiveness of systematic review summaries on policymakers' decision making, or the most...
Primerjalna književnost na prelomu tisočletja
Primerjalna književnost na prelomu tisočletja
In a comprehensive and at times critical manner, this volume seeks to shed light on the development of events in Western (i.e., European and North American) comparative literature ...
Unconventional Method of Subsea Umbilical Retrieval Using Anchor Handling Vessel
Unconventional Method of Subsea Umbilical Retrieval Using Anchor Handling Vessel
Abstract A deepwater field in West Africa was decommissioned and subsea facilities retrieval operation was carried out as part of the Abandonment and Decommissioning...

Back to Top