Javascript must be enabled to continue!

Reading Between the Lines: LLMs Match or Exceed Human Empathic Accuracy Using Text Alone

Empathy plays a central role in human emotional relationships. Empathic accuracy, the ability to accurately infer another person’s emotional state, varies by informational modality and, in humans, is often intertwined with emotional and motivational processes. This study examines whether state-of-the-art Large Language Models (LLMs) - GPT-4, Claude, and Gemini - demonstrate empathic accuracy, and how their accuracy compares to that of humans when presented with only the semantic content (transcripts of recorded videos) of ecological, complex autobiographical emotional narratives. We compared the empathic accuracy of LLMs’ to that of human participants (N = 127, randomly sampled students, both in-lab and online) who either read the same transcripts or watched the original videos, which enabled them to use facial and bodily expressions, as well as paralinguistic cues, in addition to semantics. LLMs were able to infer emotional states from semantic content alone with a precision that is equal to or surpasses human performance. This was true both generally and when analyzing positive and negative emotions separately. Theoretically, these findings suggest that semantic information alone can support high empathic accuracy, though humans may not fully leverage this potential. Practical implications are discussed regarding the use of LLMs in introspective and emotional contexts, while raising critical concerns about privacy, ethical risks, and the potential reshaping of emotional understanding, intimacy, and human connection in an increasingly AI-mediated world.

Center for Open Science

Noa Oded Matan Rubin Shir Genzer Anat Perry

2025

Title: Reading Between the Lines: LLMs Match or Exceed Human Empathic Accuracy Using Text Alone

Description:

Empathy plays a central role in human emotional relationships.

Empathic accuracy, the ability to accurately infer another person’s emotional state, varies by informational modality and, in humans, is often intertwined with emotional and motivational processes.

This study examines whether state-of-the-art Large Language Models (LLMs) - GPT-4, Claude, and Gemini - demonstrate empathic accuracy, and how their accuracy compares to that of humans when presented with only the semantic content (transcripts of recorded videos) of ecological, complex autobiographical emotional narratives.

We compared the empathic accuracy of LLMs’ to that of human participants (N = 127, randomly sampled students, both in-lab and online) who either read the same transcripts or watched the original videos, which enabled them to use facial and bodily expressions, as well as paralinguistic cues, in addition to semantics.

LLMs were able to infer emotional states from semantic content alone with a precision that is equal to or surpasses human performance.

This was true both generally and when analyzing positive and negative emotions separately.

Theoretically, these findings suggest that semantic information alone can support high empathic accuracy, though humans may not fully leverage this potential.

Practical implications are discussed regarding the use of LLMs in introspective and emotional contexts, while raising critical concerns about privacy, ethical risks, and the potential reshaping of emotional understanding, intimacy, and human connection in an increasingly AI-mediated world.

Back

Abstract Introduction The exact manner in which large language models (LLMs) will be integrated into pathology is not yet fully comprehended. This study examines the accuracy, bene...

Sleep Habits and Occurrence of Lowback Pain among Craftsmen

<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...

Sleep Habits and Occurrence of Lowback Pain among Craftsmen

<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...

Perspectives and Experiences With Large Language Models in Health Care: Survey Study (Preprint)

BACKGROUND Large language models (LLMs) are transforming how data is used, including within the health care sector. However, frameworks including the Unifie...

Perspectives and Experiences With Large Language Models in Health Care: Survey Study

Background Large language models (LLMs) are transforming how data is used, including within the health care sector. However, frameworks including the Unified Th...

A Systematic Review of ChatGPT and Other Conversational Large Language Models in Healthcare

Abstract Background The launch of the Chat Generative Pre-trained Transformer (ChatGPT) in November 2022 has attracted public a...

Large Language Models for Predicting Empathic Accuracy Between a Designer and a User

Abstract Empathic design research aims to gain deep and accurate user understanding. We can measure the designer's empathic ability as empathic accuracy (EA) in unde...

Possibilities of Forming Empathic Reactions in Children with ASD

The purpose of this work was to analyze the possibilities of formation and manifestation of empathy in people with ASD. According to the Diagnostic and Statistical Manual of Mental...

Email:
Password:

Email:

Reading Between the Lines: LLMs Match or Exceed Human Empathic Accuracy Using Text Alone

Related Results