Javascript must be enabled to continue!
Offline Replay Supports Planning: fMRI Evidence from Reward Revaluation
View through CrossRef
Abstract
Making decisions in sequentially structured tasks requires integrating distally acquired information. The extensive computational cost of such integration challenges planning methods that integrate online, at decision time. Furthermore, it remains unclear whether “offline” integration during replay supports planning, and if so which memories should be replayed. Inspired by machine learning, we propose that (a) offline replay of trajectories facilitates integrating representations that guide decisions, and (b) unsigned prediction errors (uncertainty) trigger such integrative replay. We designed a 2-step revaluation task for fMRI, whereby participants needed to integrate changes in rewards with past knowledge to optimally replan decisions. As predicted, we found that (a) multi-voxel pattern evidence for off-task replay predicts subsequent replanning; (b) neural sensitivity to uncertainty predicts subsequent replay and replanning; (c) off-task hippocampus and anterior cingulate activity increase when revaluation is required. These findings elucidate how the brain leverages offline mechanisms in planning and goal-directed behavior under uncertainty.
Title: Offline Replay Supports Planning: fMRI Evidence from Reward Revaluation
Description:
Abstract
Making decisions in sequentially structured tasks requires integrating distally acquired information.
The extensive computational cost of such integration challenges planning methods that integrate online, at decision time.
Furthermore, it remains unclear whether “offline” integration during replay supports planning, and if so which memories should be replayed.
Inspired by machine learning, we propose that (a) offline replay of trajectories facilitates integrating representations that guide decisions, and (b) unsigned prediction errors (uncertainty) trigger such integrative replay.
We designed a 2-step revaluation task for fMRI, whereby participants needed to integrate changes in rewards with past knowledge to optimally replan decisions.
As predicted, we found that (a) multi-voxel pattern evidence for off-task replay predicts subsequent replanning; (b) neural sensitivity to uncertainty predicts subsequent replay and replanning; (c) off-task hippocampus and anterior cingulate activity increase when revaluation is required.
These findings elucidate how the brain leverages offline mechanisms in planning and goal-directed behavior under uncertainty.
Related Results
Reward does not facilitate visual perceptual learning until sleep occurs
Reward does not facilitate visual perceptual learning until sleep occurs
ABSTRACTA growing body of evidence indicates that visual perceptual learning (VPL) is enhanced by reward provided during training. Another line of studies has shown that sleep foll...
An examination of how reward associations differentially facilitate and impair Stroop performance
An examination of how reward associations differentially facilitate and impair Stroop performance
Behavioral performance is improved when the color of a Stroop stimulus is tied to a potential reward but is impaired when the irrelevant word meaning is reward related. The facilit...
Post-learning replay of hippocampal-striatal activity is biased by reward-prediction signals
Post-learning replay of hippocampal-striatal activity is biased by reward-prediction signals
Abstract
Neural activity encoding recent experiences is replayed during sleep and rest to promote consolidation of memories. However, precisely which features of ex...
Theta-band phase locking during encoding leads to coordinated entorhinal-hippocampal replay
Theta-band phase locking during encoding leads to coordinated entorhinal-hippocampal replay
Abstract
Precisely timed interactions between hippocampal and cortical neurons during replay epochs are thought to support memory consolidation. ...
An examination of how reward associations facilitate and impair Stroop performance
An examination of how reward associations facilitate and impair Stroop performance
Rewarded stimuli are prioritized by the attentional system. Behavioral performance is improved when the task-relevant dimension is tied to a potential reward but is impaired when t...
Evaluating hippocampal replay without a ground truth
Evaluating hippocampal replay without a ground truth
AbstractDuring rest and sleep, memory traces replay in the brain. The dialogue between brain regions during replay is thought to stabilize labile memory traces for long-term storag...
Exploring the roles of memory replay in targeted memory reactivation and birdsong development: Insights from computational models of complementary learning systems
Exploring the roles of memory replay in targeted memory reactivation and birdsong development: Insights from computational models of complementary learning systems
Abstract
Replay facilitates memory consolidation in both biological and artificial systems. Using the complementary learning systems (CLS) framework, we study repla...
Differential and temporally dynamic involvement of primate amygdala nuclei in face animacy and reward information processing
Differential and temporally dynamic involvement of primate amygdala nuclei in face animacy and reward information processing
Abstract
Decision-making is influenced by both expected reward and social factors, such as who offered the outcomes. Thus, although a reward might originally be ind...

