Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Balancing exploration and exploitation: task-targeted exploration for scientific decision-making

View through CrossRef
How do we collect observational data that reveal fundamental properties of scientific phenomena? This is a key challenge in modern scientific discovery. Scientific phenomena are complex—they have high-dimensional and continuous state, exhibit chaotic dynamics, and generate noisy sensor observations. Additionally, scientific experimentation often requires significant time, money, and human effort. In the face of these challenges, we propose to leverage autonomous decision-making to augment and accelerate human scientific discovery. Autonomous decision-making in scientific domains faces an important and classical challenge: balancing exploration and exploitation when making decisions under uncertainty. This thesis argues that efficient decision-making in real-world, scientific domains requires task-targeted exploration—exploration strategies that are tuned to a specific task. By quantifying the change in task performance due to exploratory actions, we enable decision-makers that can contend with highly uncertain real-world environments, performing exploration parsimoniously to improve task performance. The thesis presents three novel paradigms for task-targeted exploration that are motivated by and applied to real-world scientific problems. We first consider exploration in partially observable Markov decision processes (POMDPs) and present two novel planners that leverage task-driven information measures to balance exploration and exploitation. These planners drive robots in simulation and oceanographic field trials to robustly identify plume sources and track targets with stochastic dynamics. We next consider the exploration- exploitation trade-off in online learning paradigms, a robust alternative to POMDPs when the environment is adversarial or difficult to model. We present novel online learning algorithms that balance exploitative and exploratory plays optimally under real-world constraints, including delayed feedback, partial predictability, and short regret horizons. We use these algorithms to perform model selection for subseasonal temperature and precipitation forecasting, achieving state-of-the-art forecasting accuracy. The human scientific endeavor is poised to benefit from our emerging capacity to integrate observational data into the process of model development and validation. Realizing the full potential of these data requires autonomous decision-makers that can contend with the inherent uncertainty of real-world scientific domains. This thesis highlights the critical role that task-targeted exploration plays in efficient scientific decision-making and proposes three novel methods to achieve task-targeted exploration in real-world oceanographic and climate science applications.
Title: Balancing exploration and exploitation: task-targeted exploration for scientific decision-making
Description:
How do we collect observational data that reveal fundamental properties of scientific phenomena? This is a key challenge in modern scientific discovery.
Scientific phenomena are complex—they have high-dimensional and continuous state, exhibit chaotic dynamics, and generate noisy sensor observations.
Additionally, scientific experimentation often requires significant time, money, and human effort.
In the face of these challenges, we propose to leverage autonomous decision-making to augment and accelerate human scientific discovery.
Autonomous decision-making in scientific domains faces an important and classical challenge: balancing exploration and exploitation when making decisions under uncertainty.
This thesis argues that efficient decision-making in real-world, scientific domains requires task-targeted exploration—exploration strategies that are tuned to a specific task.
By quantifying the change in task performance due to exploratory actions, we enable decision-makers that can contend with highly uncertain real-world environments, performing exploration parsimoniously to improve task performance.
The thesis presents three novel paradigms for task-targeted exploration that are motivated by and applied to real-world scientific problems.
We first consider exploration in partially observable Markov decision processes (POMDPs) and present two novel planners that leverage task-driven information measures to balance exploration and exploitation.
These planners drive robots in simulation and oceanographic field trials to robustly identify plume sources and track targets with stochastic dynamics.
We next consider the exploration- exploitation trade-off in online learning paradigms, a robust alternative to POMDPs when the environment is adversarial or difficult to model.
We present novel online learning algorithms that balance exploitative and exploratory plays optimally under real-world constraints, including delayed feedback, partial predictability, and short regret horizons.
We use these algorithms to perform model selection for subseasonal temperature and precipitation forecasting, achieving state-of-the-art forecasting accuracy.
The human scientific endeavor is poised to benefit from our emerging capacity to integrate observational data into the process of model development and validation.
Realizing the full potential of these data requires autonomous decision-makers that can contend with the inherent uncertainty of real-world scientific domains.
This thesis highlights the critical role that task-targeted exploration plays in efficient scientific decision-making and proposes three novel methods to achieve task-targeted exploration in real-world oceanographic and climate science applications.

Related Results

Autonomy on Trial
Autonomy on Trial
Photo by CHUTTERSNAP on Unsplash Abstract This paper critically examines how US bioethics and health law conceptualize patient autonomy, contrasting the rights-based, individualist...
Eksploitasi Pekerja Anak: Kajian Terhadap Pekerja Anak di Sektor Perikanan
Eksploitasi Pekerja Anak: Kajian Terhadap Pekerja Anak di Sektor Perikanan
The purpose of this research is to analyze the forms of exploitation of child labour in the fisheries sector and the resistance of workers to the exploitation they experience. The ...
Dynamic information aggregation decision-making methods based on variable precision rough set and grey clustering
Dynamic information aggregation decision-making methods based on variable precision rough set and grey clustering
Purpose – The purpose of this paper is to construct a dynamic information aggregation decision-making model based on variable precision rough set. ...
A novel linguistic decision making approach based on attribute correlation and EDAS method
A novel linguistic decision making approach based on attribute correlation and EDAS method
AbstractOne of characteristics of large-scale linguistic decision making problems is that decision information with respect to decision making attributes is derived from multi-sour...
Dynamics of task allocation in global software development
Dynamics of task allocation in global software development
AbstractContextGlobal software development (GSD) promises high‐quality software at low cost. GSD enables around‐the‐clock development to achieve maximum production in a short perio...
Disturbance of Information in Superior Parietal Lobe during Dual-task Interference in a Simulated Driving Task
Disturbance of Information in Superior Parietal Lobe during Dual-task Interference in a Simulated Driving Task
AbstractPerforming a secondary task while driving causes a decline in driving performance. This phenomenon, called dual-task interference, can have lethal consequences. Previous fM...
GIS BASED DECISION SUPPORT SYSTEM FOR SEISMIC RISK IN BUCHAREST. CASE STUDY – THE HISTORICAL CENTRE
GIS BASED DECISION SUPPORT SYSTEM FOR SEISMIC RISK IN BUCHAREST. CASE STUDY – THE HISTORICAL CENTRE
Because of the increasing volume of information, problem decisions tend to be more difficult to deal with. Achieving an objective and making a suitable decision may become a real c...
Data-Driven Decision Making in the Community College Context
Data-Driven Decision Making in the Community College Context
This case study explored how data-driven decision making occurred within institutional planning activities at a California community college. The problem statement for this researc...

Back to Top