Javascript must be enabled to continue!
Exploring Replay
View through CrossRef
Exploration is vital for animals and artificial agents who face uncertainty about their environments due to initial ignorance or subsequent changes. Their choices need to balance exploitation of the knowledge already acquired, with exploration to resolve uncertainty [1, 2]. However, the exact algorithmic structure of exploratory choices in the brain still remains largely elusive. A venerable idea in reinforcement learning is that agents can plan appropriate exploratory choices offline, during the equivalent of quiet wakefulness or sleep. Although offline processing in humans and other animals, in the form of hippocampal replay and preplay, has recently been the subject of highly successful modelling [3–5], existing methods only apply to known environments. Thus, they cannot predict exploratory replay choices during learning and/or behaviour in dynamic environments. Here, we extend the theory of Mattar & Daw [3] to examine the potential role of replay in approximately optimal exploration, deriving testable predictions for the patterns of exploratory replay choices in a paradigmatic spatial navigation task. Our modelling provides a normative interpretation of the available experimental data suggestive of exploratory replay. Furthermore, we highlight the importance of sequence replay, and license a range of new experimental paradigms that should further our understanding of offline processing.
Title: Exploring Replay
Description:
Exploration is vital for animals and artificial agents who face uncertainty about their environments due to initial ignorance or subsequent changes.
Their choices need to balance exploitation of the knowledge already acquired, with exploration to resolve uncertainty [1, 2].
However, the exact algorithmic structure of exploratory choices in the brain still remains largely elusive.
A venerable idea in reinforcement learning is that agents can plan appropriate exploratory choices offline, during the equivalent of quiet wakefulness or sleep.
Although offline processing in humans and other animals, in the form of hippocampal replay and preplay, has recently been the subject of highly successful modelling [3–5], existing methods only apply to known environments.
Thus, they cannot predict exploratory replay choices during learning and/or behaviour in dynamic environments.
Here, we extend the theory of Mattar & Daw [3] to examine the potential role of replay in approximately optimal exploration, deriving testable predictions for the patterns of exploratory replay choices in a paradigmatic spatial navigation task.
Our modelling provides a normative interpretation of the available experimental data suggestive of exploratory replay.
Furthermore, we highlight the importance of sequence replay, and license a range of new experimental paradigms that should further our understanding of offline processing.
Related Results
Evaluating hippocampal replay without a ground truth
Evaluating hippocampal replay without a ground truth
AbstractDuring rest and sleep, memory traces replay in the brain. The dialogue between brain regions during replay is thought to stabilize labile memory traces for long-term storag...
Exploring the roles of memory replay in targeted memory reactivation and birdsong development: Insights from computational models of complementary learning systems
Exploring the roles of memory replay in targeted memory reactivation and birdsong development: Insights from computational models of complementary learning systems
AbstractReplay facilitates memory consolidation in both biological and artificial systems. Using the complementary learning systems (CLS) framework, we study replay in both humans ...
The Role of Experience in Prioritizing Hippocampal Replay
The Role of Experience in Prioritizing Hippocampal Replay
Abstract
During sleep, recent memories are consolidated, whereby behavioral episodes first encoded by the hippocampus get transformed into long-term memories. However, the ...
Optimism and pessimism in optimised replay
Optimism and pessimism in optimised replay
ABSTRACTThe replay of task-relevant trajectories is known to contribute to memory consolidation and improved task performance. A wide variety of experimental data show that the con...
Generative replay for compositional visual understanding in the prefrontal-hippocampal circuit
Generative replay for compositional visual understanding in the prefrontal-hippocampal circuit
AbstractUnderstanding the visual world is a constructive process. Whilst a frontal-hippocampal circuit is known to be essential for this task, little is known about the associated ...
Prioritized Experience Replay Based on dynamics priority
Prioritized Experience Replay Based on dynamics priority
Abstract
Experience replay has been instrumental in achieving significant advancements in reinforcement learning by increasing the utilization of data. To further improve t...
Effect of Video-Recorded Replay on Physical Education Performance: Input to Physical Education Instructional Approach
Effect of Video-Recorded Replay on Physical Education Performance: Input to Physical Education Instructional Approach
This study determined the effect of video-recorded replay on the physical education performance of college students at Northern Iloilo State University. Seventy-two (72) randomly s...
The Role of Experience in Prioritizing Hippocampal Replay
The Role of Experience in Prioritizing Hippocampal Replay
SUMMARYDuring sleep, recent memories are consolidated, whereby behavioral episodes first encoded by the hippocampus get transformed into long-term memories. However, the brain cann...

