Javascript must be enabled to continue!
Experience resetting in reinforcement learning facilitates exploration–exploitation transitions during a behavioral task for primates
View through CrossRef
AbstractThe exploration–exploitation trade-off is a fundamental problem in re-inforcement learning. To study the neural mechanisms involved in this problem, a target search task in which exploration and exploitation phases appear alternately is useful. Monkeys well trained in this task clearly understand that they have entered the exploratory phase and quickly acquire new experiences by resetting their previous experiences. In this study, we used a simple model to show that experience resetting in the exploratory phase improves performance rather than decreasing the greediness of action selection, and we then present a neural network-type model enabling experience resetting.
Title: Experience resetting in reinforcement learning facilitates exploration–exploitation transitions during a behavioral task for primates
Description:
AbstractThe exploration–exploitation trade-off is a fundamental problem in re-inforcement learning.
To study the neural mechanisms involved in this problem, a target search task in which exploration and exploitation phases appear alternately is useful.
Monkeys well trained in this task clearly understand that they have entered the exploratory phase and quickly acquire new experiences by resetting their previous experiences.
In this study, we used a simple model to show that experience resetting in the exploratory phase improves performance rather than decreasing the greediness of action selection, and we then present a neural network-type model enabling experience resetting.
Related Results
Balancing exploration and exploitation: task-targeted exploration for scientific decision-making
Balancing exploration and exploitation: task-targeted exploration for scientific decision-making
How do we collect observational data that reveal fundamental properties of scientific phenomena? This is a key challenge in modern scientific discovery. Scientific phenomena are co...
The Effect of Compression Reinforcement on the Shear Behavior of Concrete Beams with Hybrid Reinforcement
The Effect of Compression Reinforcement on the Shear Behavior of Concrete Beams with Hybrid Reinforcement
Abstract
This study examines the impact of steel compression reinforcement on the shear behavior of concrete beams reinforced with glass fiber reinforced polymer (GFRP) bar...
Study on Scheme Optimization of bridge reinforcement increasing ratio
Study on Scheme Optimization of bridge reinforcement increasing ratio
Abstract
The bridge reinforcement methods, each method has its advantages and disadvantages. The load-bearing capacity of bridge members is controlled by the ultimat...
Attenuated directed exploration during reinforcement learning in gambling disorder
Attenuated directed exploration during reinforcement learning in gambling disorder
AbstractGambling disorder is a behavioral addiction associated with impairments in value-based decision-making and behavioral flexibility and might be linked to changes in the dopa...
Dopamine regulates decision thresholds in human reinforcement learning
Dopamine regulates decision thresholds in human reinforcement learning
AbstractDopamine fundamentally contributes to reinforcement learning by encoding prediction errors, deviations of an outcome from expectation. Prediction error coding in dopaminerg...
User Experience of Cognitive Behavioral Therapy Apps for Depression: An Analysis of App Functionality and User Reviews (Preprint)
User Experience of Cognitive Behavioral Therapy Apps for Depression: An Analysis of App Functionality and User Reviews (Preprint)
BACKGROUND
Hundreds of mental health apps are available to the general public. With increasing pressures on health care systems, they offer a potential way ...
Eksploitasi Pekerja Anak: Kajian Terhadap Pekerja Anak di Sektor Perikanan
Eksploitasi Pekerja Anak: Kajian Terhadap Pekerja Anak di Sektor Perikanan
The purpose of this research is to analyze the forms of exploitation of child labour in the fisheries sector and the resistance of workers to the exploitation they experience. The ...
S82. REINFORCEMENT LEARNING IMPAIRMENT IN PATIENTS WITH EARLY-STAGE PSYCHOTIC BIPOLAR DISORDER
S82. REINFORCEMENT LEARNING IMPAIRMENT IN PATIENTS WITH EARLY-STAGE PSYCHOTIC BIPOLAR DISORDER
Abstract
Background
Abnormal reward sensitivity is a biosignature to mood disorders spectrum. Recent data suggested either eleva...

