Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Learning the payoffs and costs of actions

View through CrossRef
Abstract A set of sub-cortical nuclei called basal ganglia is critical for learning the values of actions. The basal ganglia include two pathways, which have been associated with approach and avoid behavior respectively, and are differentially modulated by dopamine projections from the midbrain. According to the influential opponent actor learning model, these pathways represent learned estimates of the positive and negative consequences (payoffs and costs) of actions. The level of dopamine release controls to what extent payoffs and costs enter the overall evaluation of actions. How the knowledge about payoff and cost is acquired is still an open question, even though many theories describe learning from feedback in the basal ganglia. We examine whether a set of plasticity rules proposed to model reinforcement learning in the pathways of the basal ganglia is suitable to extract payoffs and costs from a reward prediction error signal. First, we determine the result of such learning, both analytically and via simulations, for different reward schedules that feature payoffs and costs. Then, we combine the plasticity rules with a decision rule to examine the emerging effect of dopaminergic modulation on the willingness to work for reward. We find that the plasticity rules are suitable to infer the mean payoffs and costs of actions, if those occur at different moments in time. Successful learning requires differential effects of positive and negative reward prediction errors on the two pathways, and a weak decay of synaptic weights over trials. We also confirm that dopaminergic modulation produces effects on the willingness to work for reward similar to those observed in classical experiments. Author summary The basal ganglia are structures underneath the surface of the vertebrate brain, associated with error driven learning. Much is known about the anatomical and biological features of the basal ganglia; scientists now try to understand the algorithms implemented by these structures. Numerous models aspire to capture the learning functionality, but many of them only cover some specific aspect of the algorithm. Instead of further adding to that pool of partial models, we unify two existing ones - one which captures what the basal ganglia learns, and one that describes the learning mechanism itself. The first model suggests that the basal ganglia keeps track of both positive and negative consequences of frequent opportunities, and weighs these by the motivational state in decisions. It explains how payoff and cost are represented, but not how those representations arise. The other model consists of biologically plausible plasticity rules, which describe how learning takes place, but not how the brain makes use of what is learned. We show that the two theories are compatible. Together, they form a model of learning and decision making that integrates the motivational state as well as the learned payoffs and costs of opportunities.
Title: Learning the payoffs and costs of actions
Description:
Abstract A set of sub-cortical nuclei called basal ganglia is critical for learning the values of actions.
The basal ganglia include two pathways, which have been associated with approach and avoid behavior respectively, and are differentially modulated by dopamine projections from the midbrain.
According to the influential opponent actor learning model, these pathways represent learned estimates of the positive and negative consequences (payoffs and costs) of actions.
The level of dopamine release controls to what extent payoffs and costs enter the overall evaluation of actions.
How the knowledge about payoff and cost is acquired is still an open question, even though many theories describe learning from feedback in the basal ganglia.
We examine whether a set of plasticity rules proposed to model reinforcement learning in the pathways of the basal ganglia is suitable to extract payoffs and costs from a reward prediction error signal.
First, we determine the result of such learning, both analytically and via simulations, for different reward schedules that feature payoffs and costs.
Then, we combine the plasticity rules with a decision rule to examine the emerging effect of dopaminergic modulation on the willingness to work for reward.
We find that the plasticity rules are suitable to infer the mean payoffs and costs of actions, if those occur at different moments in time.
Successful learning requires differential effects of positive and negative reward prediction errors on the two pathways, and a weak decay of synaptic weights over trials.
We also confirm that dopaminergic modulation produces effects on the willingness to work for reward similar to those observed in classical experiments.
Author summary The basal ganglia are structures underneath the surface of the vertebrate brain, associated with error driven learning.
Much is known about the anatomical and biological features of the basal ganglia; scientists now try to understand the algorithms implemented by these structures.
Numerous models aspire to capture the learning functionality, but many of them only cover some specific aspect of the algorithm.
Instead of further adding to that pool of partial models, we unify two existing ones - one which captures what the basal ganglia learns, and one that describes the learning mechanism itself.
The first model suggests that the basal ganglia keeps track of both positive and negative consequences of frequent opportunities, and weighs these by the motivational state in decisions.
It explains how payoff and cost are represented, but not how those representations arise.
The other model consists of biologically plausible plasticity rules, which describe how learning takes place, but not how the brain makes use of what is learned.
We show that the two theories are compatible.
Together, they form a model of learning and decision making that integrates the motivational state as well as the learned payoffs and costs of opportunities.

Related Results

CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
Method of evaluating and diagnosing costs for event management
Method of evaluating and diagnosing costs for event management
The article develops a method of evaluating and diagnosing costs for event management in the form of a matrix that takes into account the directions of managing event processes of ...
On the Existence and Determining Stationary Nash Equilibria for Switching Controller Stochastic Games
On the Existence and Determining Stationary Nash Equilibria for Switching Controller Stochastic Games
In this paper we consider the problem of the existence and determining stationary Nash equilibria for switching controller stochastic games with discounted and average payoffs. The...
Laboratory Costs in the Context of Disease
Laboratory Costs in the Context of Disease
Abstract Background: To determine the contribution of laboratory costs to the overall costs of managing hospital patients with different diseases, we studied the cos...
Blood transfusion costs by diagnosis‐related groupsin 60 university hospitals in 1995
Blood transfusion costs by diagnosis‐related groupsin 60 university hospitals in 1995
BACKGROUND: Transfusion services are frequently challenged to initiate efforts to reduce blood transfusion costs. One approach is to analyze blood transfusion costs for individual ...
Impact of cardiovascular events on primary and hospital care costs: findings from UK Biobank study
Impact of cardiovascular events on primary and hospital care costs: findings from UK Biobank study
Abstract Background Need for primary and secondary healthcare increases following cardiovascular disease (CVD) events but there ...
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic 
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic 
Abstract Background: To minimize the risk of infection during the COVID-19 pandemic, the learning mode of universities in China has been adjusted, and the online learning o...

Back to Top