Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Large Language Model-Based Chatbots for Depression and Anxiety: Systematic Review and Meta-Analysis (Preprint)

View through CrossRef
BACKGROUND The global mental health crisis is becoming increasingly severe, with over 280 million patients suffering from depression and over 301 million patients suffering from anxiety disorders. Due to the shortage of mental health professionals, high treatment costs, and insufficient accessibility of services, there is an urgent need for scalable and low-cost intervention methods. Chatbots based on Large Language Models (LLMs) have become a new tool for providing psychological support with advanced natural language processing and deep learning techniques. However, their therapeutic effects are not yet fully studied, especially in terms of differential effects on depression and anxiety. There are obvious limitations to existing research: the psychological assessment tools used in different studies are mixed (e.g., PHQ-9, GAD-7, STAI), and there are significant differences in intervention design (dialogue system architecture, intervention duration), which makes it difficult to directly compare the efficacy evaluation results, especially for the differentiated effects on depression and anxiety. OBJECTIVE This study systematically evaluates the effectiveness of LLM chatbots in alleviating symptoms of depression and anxiety, and analyzes the moderating effects of intervention duration, control group type (e.g., blank control vs. traditional therapy), and demographic characteristics (e.g., age). METHODS By systematically searching PubMed, Cochrane, Scopus, and CNKI, 10 studies (17 effect measures) published between 2020 and 2025 were included, including randomized controlled trials (RCTs) and pre post test designs. The research subjects are adults aged 16 and above with symptoms of depression or anxiety. Using Hedges' g as the measure of effect, a random effects model was employed due to moderate heterogeneity (depression I² = 35.6%, anxiety I² = 54.96%). Subgroup analysis explored the effects of control group type, intervention duration, and age. RESULTS Summary analysis shows that chatbots have a marginal significant improvement effect on depression symptoms (g = 0.157, 95% CI [-0.013, 0.328], p = 0.071), and a significant improvement effect on anxiety symptoms (g = 0.277, 95% CI [0.069, 0.486], p = 0.009). Subgroup analysis found that for depressive symptoms, the effect size of the blank control group (g = 0.309) was higher than that of the traditional therapy group (g = 0.193); For anxiety symptoms, long-term intervention (> 4 weeks) has a significant effect (g = 0.572), far superior to short-term intervention (g = 0.063); Elderly people (> 50 years) benefit more (g = 0.372). CONCLUSIONS LLM chatbots have clinical significance for anxiety, especially in the elderly and long-term interventions, but their improvement on depression is limited. Future research should optimize depression intervention designs, integrate multimodal therapies, and focus on long-term effectiveness.
Title: Large Language Model-Based Chatbots for Depression and Anxiety: Systematic Review and Meta-Analysis (Preprint)
Description:
BACKGROUND The global mental health crisis is becoming increasingly severe, with over 280 million patients suffering from depression and over 301 million patients suffering from anxiety disorders.
Due to the shortage of mental health professionals, high treatment costs, and insufficient accessibility of services, there is an urgent need for scalable and low-cost intervention methods.
Chatbots based on Large Language Models (LLMs) have become a new tool for providing psychological support with advanced natural language processing and deep learning techniques.
However, their therapeutic effects are not yet fully studied, especially in terms of differential effects on depression and anxiety.
There are obvious limitations to existing research: the psychological assessment tools used in different studies are mixed (e.
g.
, PHQ-9, GAD-7, STAI), and there are significant differences in intervention design (dialogue system architecture, intervention duration), which makes it difficult to directly compare the efficacy evaluation results, especially for the differentiated effects on depression and anxiety.
OBJECTIVE This study systematically evaluates the effectiveness of LLM chatbots in alleviating symptoms of depression and anxiety, and analyzes the moderating effects of intervention duration, control group type (e.
g.
, blank control vs.
traditional therapy), and demographic characteristics (e.
g.
, age).
METHODS By systematically searching PubMed, Cochrane, Scopus, and CNKI, 10 studies (17 effect measures) published between 2020 and 2025 were included, including randomized controlled trials (RCTs) and pre post test designs.
The research subjects are adults aged 16 and above with symptoms of depression or anxiety.
Using Hedges' g as the measure of effect, a random effects model was employed due to moderate heterogeneity (depression I² = 35.
6%, anxiety I² = 54.
96%).
Subgroup analysis explored the effects of control group type, intervention duration, and age.
RESULTS Summary analysis shows that chatbots have a marginal significant improvement effect on depression symptoms (g = 0.
157, 95% CI [-0.
013, 0.
328], p = 0.
071), and a significant improvement effect on anxiety symptoms (g = 0.
277, 95% CI [0.
069, 0.
486], p = 0.
009).
Subgroup analysis found that for depressive symptoms, the effect size of the blank control group (g = 0.
309) was higher than that of the traditional therapy group (g = 0.
193); For anxiety symptoms, long-term intervention (> 4 weeks) has a significant effect (g = 0.
572), far superior to short-term intervention (g = 0.
063); Elderly people (> 50 years) benefit more (g = 0.
372).
CONCLUSIONS LLM chatbots have clinical significance for anxiety, especially in the elderly and long-term interventions, but their improvement on depression is limited.
Future research should optimize depression intervention designs, integrate multimodal therapies, and focus on long-term effectiveness.

Related Results

Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...
Revolutionizing public health: The importance of chatbots
Revolutionizing public health: The importance of chatbots
Introduction: Public health is a crucial aspect of maintaining the well-being and health of the community. The ever-growing demands of the modern w...
App review of anxiety and depression chatbots and their self-care features (Preprint)
App review of anxiety and depression chatbots and their self-care features (Preprint)
BACKGROUND Anxiety and depression rates are at an all-time high along with other mental health disorders. Smartphone-based mental health chatbots or convers...
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Abstract The Physical Activity Guidelines for Americans (Guidelines) advises older adults to be as active as possible. Yet, despite the well documented benefits of physical a...
Digital Mental Health Landscaping in Low- and Middle-Income Countries 
Digital Mental Health Landscaping in Low- and Middle-Income Countries 
Introduction The aim of this project was to map the landscape of who is doing what and where in digital mental health, and to pr...
Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga
Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga
The actual use of classroom language is principally limited to the classroom environment. As far as foreign language learning is concerned, the classroom often turns out to be the ...
Investigating the Psychological Impact of Corrective Feedback on ESL Students’ Language Anxiety
Investigating the Psychological Impact of Corrective Feedback on ESL Students’ Language Anxiety
This study investigates the psychological impact of corrective feedback on English as a Second Language (ESL) students' language anxiety using a quantitative research approach. Con...
Anxiety and Depression Chatbot Features: A Scoping Review (Preprint)
Anxiety and Depression Chatbot Features: A Scoping Review (Preprint)
BACKGROUND Apps and web-based chatbots can provide valuable and meaningful support to healthcare workers in assessing and guiding management of various heal...

Back to Top