Javascript must be enabled to continue!

Surprise acts as a reducer of outcome value in human reinforcement learning

Surprise occurs because of differences between a decision outcome and its predicted outcome (prediction error), regardless of whether the error is positive or negative. It has recently been postulated that surprise affects the reward value of the action outcome itself; studies have indicated that increasing surprise, as absolute value of prediction error, decreases the value of the outcome. However, how surprise affects the value of the outcome and subsequent decision making is unclear. We suggested that, on the assumption that surprise decreases the outcome value, agents will increase their risk averse choices when an outcome is often surprisal. Here, we propose the surprise-sensitive utility model, a reinforcement learning model that states that surprise decreases the outcome value, to explain how surprise affects subsequent decision-making. To investigate the assumption, we compared this model with previous reinforcement learning models on a risky probabilistic learning task with simulation analysis, and model selection with two experimental datasets with different tasks and population. We further simulated a simple decision-making task to investigate how parameters within the proposed model modulate the choice preference. As a result, we found the proposed model explains the risk averse choices in a manner similar to the previous models, and risk averse choices increased as the surprise-based modulation parameter of outcome value increased. The model fits these datasets better than the other models, with same free parameters, thus providing a more parsimonious and robust account for risk averse choices. These findings indicate that surprise acts as a reducer of outcome value and decreases the action value for risky choices in which prediction error often occurs.

Center for Open Science

Motofumi Sumiya Kentaro Katahira

2019

Title: Surprise acts as a reducer of outcome value in human reinforcement learning

Description:

Surprise occurs because of differences between a decision outcome and its predicted outcome (prediction error), regardless of whether the error is positive or negative.

It has recently been postulated that surprise affects the reward value of the action outcome itself; studies have indicated that increasing surprise, as absolute value of prediction error, decreases the value of the outcome.

However, how surprise affects the value of the outcome and subsequent decision making is unclear.

We suggested that, on the assumption that surprise decreases the outcome value, agents will increase their risk averse choices when an outcome is often surprisal.

Here, we propose the surprise-sensitive utility model, a reinforcement learning model that states that surprise decreases the outcome value, to explain how surprise affects subsequent decision-making.

To investigate the assumption, we compared this model with previous reinforcement learning models on a risky probabilistic learning task with simulation analysis, and model selection with two experimental datasets with different tasks and population.

We further simulated a simple decision-making task to investigate how parameters within the proposed model modulate the choice preference.

As a result, we found the proposed model explains the risk averse choices in a manner similar to the previous models, and risk averse choices increased as the surprise-based modulation parameter of outcome value increased.

The model fits these datasets better than the other models, with same free parameters, thus providing a more parsimonious and robust account for risk averse choices.

These findings indicate that surprise acts as a reducer of outcome value and decreases the action value for risky choices in which prediction error often occurs.

Back

Abstract The paper describes the installation analysis for the Matterhorn field pipeline replacement, located in water depths between 800-ft to 1200-ft in the Gul...

Philippe Quinault, le poète de la surprise

Le but poursuivi par notre étude est de montrer comment Quinault, mû par la volonté de séduire le spectateur, a délibérément centré sa dramaturgie sur la surprise, ce qui explique ...

STRENGTH OF BUTT WELDED BUTT JOINT OF REINFORCEMENT OF CLASS A500C

The paper presents the results of experimental studies of the strength of cross-shaped welded joints of types К1-Кт and К3-Рр [1] of thermomechanically hardened reinforcement of cl...

Development of a Universal Ranking for Friction Reducer Performance

Abstract In hydraulic fracturing, large amounts of water are pumped at high speed down the wellbore. To reduce pump pressure and costs, a friction reducer is added t...

CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021

The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...

Design and Testing of a New Type of Planetary Traction Drive Bearing-Type Reducer

This paper presents the design and development of a new type of planetary traction drive bearing-type reducer. In this design, the transmission outer ring is replaced with an elast...

Earnings surprise and share price of firms in Nigeria

AbstractThis study examines earnings surprise and share price of firms in Nigeria. It sought to evaluate the impact of earnings surprise in predicting share price of firms. The pap...

Fiber reinforcement as an alternative to the compressed zone linear reinforcement and the flexible concrete elements stretched zone prestressing

Abstract The results of a numerical experiment in the framework of a theoretical study of the strength and crack resistance of the reinforced concrete beams availabl...

Email:
Password:

Email:

Surprise acts as a reducer of outcome value in human reinforcement learning

Related Results