Javascript must be enabled to continue!
Reading Between the Lines: LLMs Match or Exceed Human Empathic Accuracy Using Text Alone
View through CrossRef
Empathy plays a central role in human emotional relationships. Empathic accuracy, the ability to accurately infer another person’s emotional state, varies by informational modality and, in humans, is often intertwined with emotional and motivational processes. This study examines whether state-of-the-art Large Language Models (LLMs) - GPT-4, Claude, and Gemini - demonstrate empathic accuracy, and how their accuracy compares to that of humans when presented with only the semantic content (transcripts of recorded videos) of ecological, complex autobiographical emotional narratives. We compared the empathic accuracy of LLMs’ to that of human participants (N = 127, randomly sampled students, both in-lab and online) who either read the same transcripts or watched the original videos, which enabled them to use facial and bodily expressions, as well as paralinguistic cues, in addition to semantics. LLMs were able to infer emotional states from semantic content alone with a precision that is equal to or surpasses human performance. This was true both generally and when analyzing positive and negative emotions separately. Theoretically, these findings suggest that semantic information alone can support high empathic accuracy, though humans may not fully leverage this potential. Practical implications are discussed regarding the use of LLMs in introspective and emotional contexts, while raising critical concerns about privacy, ethical risks, and the potential reshaping of emotional understanding, intimacy, and human connection in an increasingly AI-mediated world.
Title: Reading Between the Lines: LLMs Match or Exceed Human Empathic Accuracy Using Text Alone
Description:
Empathy plays a central role in human emotional relationships.
Empathic accuracy, the ability to accurately infer another person’s emotional state, varies by informational modality and, in humans, is often intertwined with emotional and motivational processes.
This study examines whether state-of-the-art Large Language Models (LLMs) - GPT-4, Claude, and Gemini - demonstrate empathic accuracy, and how their accuracy compares to that of humans when presented with only the semantic content (transcripts of recorded videos) of ecological, complex autobiographical emotional narratives.
We compared the empathic accuracy of LLMs’ to that of human participants (N = 127, randomly sampled students, both in-lab and online) who either read the same transcripts or watched the original videos, which enabled them to use facial and bodily expressions, as well as paralinguistic cues, in addition to semantics.
LLMs were able to infer emotional states from semantic content alone with a precision that is equal to or surpasses human performance.
This was true both generally and when analyzing positive and negative emotions separately.
Theoretically, these findings suggest that semantic information alone can support high empathic accuracy, though humans may not fully leverage this potential.
Practical implications are discussed regarding the use of LLMs in introspective and emotional contexts, while raising critical concerns about privacy, ethical risks, and the potential reshaping of emotional understanding, intimacy, and human connection in an increasingly AI-mediated world.
Related Results
Exploring Large Language Models Integration in the Histopathologic Diagnosis of Skin Diseases: A Comparative Study
Exploring Large Language Models Integration in the Histopathologic Diagnosis of Skin Diseases: A Comparative Study
Abstract
Introduction
The exact manner in which large language models (LLMs) will be integrated into pathology is not yet fully comprehended. This study examines the accuracy, bene...
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...
Perspectives and Experiences With Large Language Models in Health Care: Survey Study (Preprint)
Perspectives and Experiences With Large Language Models in Health Care: Survey Study (Preprint)
BACKGROUND
Large language models (LLMs) are transforming how data is used, including within the health care sector. However, frameworks including the Unifie...
Perspectives and Experiences With Large Language Models in Health Care: Survey Study
Perspectives and Experiences With Large Language Models in Health Care: Survey Study
Background
Large language models (LLMs) are transforming how data is used, including within the health care sector. However, frameworks including the Unified Th...
A Systematic Review of ChatGPT and Other Conversational Large Language Models in Healthcare
A Systematic Review of ChatGPT and Other Conversational Large Language Models in Healthcare
Abstract
Background
The launch of the Chat Generative Pre-trained Transformer (ChatGPT) in November 2022 has attracted public a...
Large Language Models for Predicting Empathic Accuracy Between a Designer and a User
Large Language Models for Predicting Empathic Accuracy Between a Designer and a User
Abstract
Empathic design research aims to gain deep and accurate user understanding. We can measure the designer's empathic ability as empathic accuracy (EA) in unde...
Possibilities of Forming Empathic Reactions in Children with ASD
Possibilities of Forming Empathic Reactions in Children with ASD
The purpose of this work was to analyze the possibilities of formation and manifestation of empathy in people with ASD. According to the Diagnostic and Statistical Manual of Mental...

