Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Surprise acts as a reducer of outcome value in human reinforcement learning

View through CrossRef
Surprise occurs because of differences between a decision outcome and its predicted outcome (prediction error), regardless of whether the error is positive or negative. It has recently been postulated that surprise affects the reward value of the action outcome itself; studies have indicated that increasing surprise, as absolute value of prediction error, decreases the value of the outcome. However, how surprise affects the value of the outcome and subsequent decision making is unclear. We suggested that, on the assumption that surprise decreases the outcome value, agents will increase their risk averse choices when an outcome is often surprisal. Here, we propose the surprise-sensitive utility model, a reinforcement learning model that states that surprise decreases the outcome value, to explain how surprise affects subsequent decision-making. To investigate the assumption, we compared this model with previous reinforcement learning models on a risky probabilistic learning task with simulation analysis, and model selection with two experimental datasets with different tasks and population. We further simulated a simple decision-making task to investigate how parameters within the proposed model modulate the choice preference. As a result, we found the proposed model explains the risk averse choices in a manner similar to the previous models, and risk averse choices increased as the surprise-based modulation parameter of outcome value increased. The model fits these datasets better than the other models, with same free parameters, thus providing a more parsimonious and robust account for risk averse choices. These findings indicate that surprise acts as a reducer of outcome value and decreases the action value for risky choices in which prediction error often occurs.
Center for Open Science
Title: Surprise acts as a reducer of outcome value in human reinforcement learning
Description:
Surprise occurs because of differences between a decision outcome and its predicted outcome (prediction error), regardless of whether the error is positive or negative.
It has recently been postulated that surprise affects the reward value of the action outcome itself; studies have indicated that increasing surprise, as absolute value of prediction error, decreases the value of the outcome.
However, how surprise affects the value of the outcome and subsequent decision making is unclear.
We suggested that, on the assumption that surprise decreases the outcome value, agents will increase their risk averse choices when an outcome is often surprisal.
Here, we propose the surprise-sensitive utility model, a reinforcement learning model that states that surprise decreases the outcome value, to explain how surprise affects subsequent decision-making.
To investigate the assumption, we compared this model with previous reinforcement learning models on a risky probabilistic learning task with simulation analysis, and model selection with two experimental datasets with different tasks and population.
We further simulated a simple decision-making task to investigate how parameters within the proposed model modulate the choice preference.
As a result, we found the proposed model explains the risk averse choices in a manner similar to the previous models, and risk averse choices increased as the surprise-based modulation parameter of outcome value increased.
The model fits these datasets better than the other models, with same free parameters, thus providing a more parsimonious and robust account for risk averse choices.
These findings indicate that surprise acts as a reducer of outcome value and decreases the action value for risky choices in which prediction error often occurs.

Related Results

Installation Analysis of Matterhorn Pipeline Replacement
Installation Analysis of Matterhorn Pipeline Replacement
Abstract The paper describes the installation analysis for the Matterhorn field pipeline replacement, located in water depths between 800-ft to 1200-ft in the Gul...
Philippe Quinault, le poète de la surprise
Philippe Quinault, le poète de la surprise
Le but poursuivi par notre étude est de montrer comment Quinault, mû par la volonté de séduire le spectateur, a délibérément centré sa dramaturgie sur la surprise, ce qui explique ...
STRENGTH OF BUTT WELDED BUTT JOINT OF REINFORCEMENT OF CLASS A500C
STRENGTH OF BUTT WELDED BUTT JOINT OF REINFORCEMENT OF CLASS A500C
The paper presents the results of experimental studies of the strength of cross-shaped welded joints of types К1-Кт and К3-Рр [1] of thermomechanically hardened reinforcement of cl...
Development of a Universal Ranking for Friction Reducer Performance
Development of a Universal Ranking for Friction Reducer Performance
Abstract In hydraulic fracturing, large amounts of water are pumped at high speed down the wellbore. To reduce pump pressure and costs, a friction reducer is added t...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
Design and Testing of a New Type of Planetary Traction Drive Bearing-Type Reducer
Design and Testing of a New Type of Planetary Traction Drive Bearing-Type Reducer
This paper presents the design and development of a new type of planetary traction drive bearing-type reducer. In this design, the transmission outer ring is replaced with an elast...
Earnings surprise and share price of firms in Nigeria
Earnings surprise and share price of firms in Nigeria
AbstractThis study examines earnings surprise and share price of firms in Nigeria. It sought to evaluate the impact of earnings surprise in predicting share price of firms. The pap...

Back to Top