Javascript must be enabled to continue!
Temporal discounting correlates with directed exploration but not with random exploration
View through CrossRef
Abstract
The explore-exploit dilemma describes the trade off that occurs any time we must choose between exploring unknown options and exploiting options we know well. Implicit in this trade off is how we value future rewards — exploiting is usually better in the short term, but in the longer term the benefits of exploration can be huge. Thus, in theory there should be a tight connection between how much people value future rewards, i.e. how much they discount future rewards relative to immediate rewards, and how likely they are to explore, with less ‘temporal discounting’ associated with more exploration. By measuring individual differences in temporal discounting and correlating them with explore-exploit behavior, we tested whether this theoretical prediction holds in practice. We used the 27-item Delay-Discounting Questionnaire to estimate temporal discounting and the Horizon Task to quantify two strategies of explore-exploit behavior: directed exploration, where information drives exploration by choice, and random exploration, where behavioral variability drives exploration by chance. We find a clear correlation between temporal discounting and directed exploration, with more temporal discounting leading to less directed exploration. Conversely, we find no relationship between temporal discounting and random exploration. Unexpectedly, we find that the relationship with directed exploration appears to be driven by a correlation between temporal discounting and uncertainty seeking at short time horizons, rather than information seeking at long horizons. Taken together our results suggest a nuanced relationship between temporal discounting and explore-exploit behavior that may be mediated by multiple factors.
Title: Temporal discounting correlates with directed exploration but not with random exploration
Description:
Abstract
The explore-exploit dilemma describes the trade off that occurs any time we must choose between exploring unknown options and exploiting options we know well.
Implicit in this trade off is how we value future rewards — exploiting is usually better in the short term, but in the longer term the benefits of exploration can be huge.
Thus, in theory there should be a tight connection between how much people value future rewards, i.
e.
how much they discount future rewards relative to immediate rewards, and how likely they are to explore, with less ‘temporal discounting’ associated with more exploration.
By measuring individual differences in temporal discounting and correlating them with explore-exploit behavior, we tested whether this theoretical prediction holds in practice.
We used the 27-item Delay-Discounting Questionnaire to estimate temporal discounting and the Horizon Task to quantify two strategies of explore-exploit behavior: directed exploration, where information drives exploration by choice, and random exploration, where behavioral variability drives exploration by chance.
We find a clear correlation between temporal discounting and directed exploration, with more temporal discounting leading to less directed exploration.
Conversely, we find no relationship between temporal discounting and random exploration.
Unexpectedly, we find that the relationship with directed exploration appears to be driven by a correlation between temporal discounting and uncertainty seeking at short time horizons, rather than information seeking at long horizons.
Taken together our results suggest a nuanced relationship between temporal discounting and explore-exploit behavior that may be mediated by multiple factors.
Related Results
The Impact of Experiencing Outcomes on Delay and Probability Discounting
The Impact of Experiencing Outcomes on Delay and Probability Discounting
<p>Temporal and probability discounting refer to the decrease in subjective value of rewards that are either delayed or uncertain. Comparatively high degrees of discounting h...
Delay discounting correlates with depression but does not predict relapse after antidepressant discontinuation
Delay discounting correlates with depression but does not predict relapse after antidepressant discontinuation
Background: Approximately one third of people with major depressive disorder experience a relapse within six months of discontinuing antidepressant medication (ADM), however, relia...
High delay discounting relates to core symptoms and to pulvinar atrophy in frontotemporal dementia
High delay discounting relates to core symptoms and to pulvinar atrophy in frontotemporal dementia
AbstractBehavioural variant frontotemporal dementia (bvFTD) is a neurodegenerative disorder characterized by behavioural changes and atrophy in brain regions important for decision...
Trial-wise exposure to visual appetitive cues increases physiological arousal but not temporal discounting
Trial-wise exposure to visual appetitive cues increases physiological arousal but not temporal discounting
AbstractHumans and many animals devalue future rewards as a function of time (temporal discounting). Increased discounting has been linked to various psychiatric conditions, includ...
The role of memory in temporal discounting
The role of memory in temporal discounting
A widely observed phenomenon in intertemporal choice is temporal discounting; people prefer to have rewards sooner rather than later, even if the delayed rewards are larger. Despit...
Role of the Frontal Lobes in the Propagation of Mesial Temporal Lobe Seizures
Role of the Frontal Lobes in the Propagation of Mesial Temporal Lobe Seizures
Summary: The depth ictal electroencephalographic (EEG) propagation sequence accompanying 78 complex partial seizures of mesial temporal origin was reviewed in 24 patients (15 from...
A meta-analysis of neural systems underlying delay discounting: implications for transdiagnostic research
A meta-analysis of neural systems underlying delay discounting: implications for transdiagnostic research
ABSTRACTDelay discounting is a promising paradigm for transdiagnostic research because both excessive and insufficient tendency to discount future rewards have been reported across...
Topographical distribution of structural impairments mediating increased impatience for reward
Topographical distribution of structural impairments mediating increased impatience for reward
Abstract
Choices involving trade-offs between larger later (LL) and smaller sooner (SS) rewards—a process known as delay discounting—are altered in many psychiatric and neu...

