Javascript must be enabled to continue!
Gridding crowd-sourced weather data: is quality control required?
View through CrossRef
Context. For monitoring, analysing and forecasting the impact of weather and climate change on society, we see an increasing need for high-quality, high-resolution gridded weather and climate services. The official networks of the National Meteorological and Hydrological Services (NMHS) sample part of the land-use excluding, for example, urban areas, which motivated us to include third-party data which does sample urban areas.  We have provided the results in real-time as well as historical gridded data services.Approach. By including a statistical, simplified model of the observation error in our gridding approach (i.e. multi-fidelity regression Kriging), we did not include any advanced quality control (QC) of the crowd-sourced data. This might sound surprising in the context of weather and climate data,but is consistent with approaches in other scientific disciplines (e.g. marine engineering, aerospace engineering).Results. In this study, we investigate the quantitative effect of state-of-the-art quality control on the accuracy of gridded services, by analysing hourly temperature observations in The Netherlands for the year 2023. Our results indicate that quality control of crowd-sourced weather data can potentially increase the accuracy of straightforward – and commonly used – nearest-neighbour approximation, but generally deteriorates the accuracy of more advanced gridded services. This finding indicates that using strict QC procedures to turn crowd-sourced data into a dataset with similar fidelity as the NMHS-sourced data is not the way to go. Therefore, we do indeed continue to blend first-party data, crowd-sourced data and land-use data without applying any advanced quality control to the crowd-sourced data.Ecosystem. We emphasize that crowd-sourced weather data only improves services when it is blended with a high-quality network of NMHS data and possibly with land-use data. In other words, the availability of crowd-sourced data does not remove the need for high-quality NMHS observation networks. Therefore, we do not intend to present crowd-sourced data as a stand-alone product; rather, it lives in an integrated ecosystem.
Title: Gridding crowd-sourced weather data: is quality control required?
Description:
Context.
For monitoring, analysing and forecasting the impact of weather and climate change on society, we see an increasing need for high-quality, high-resolution gridded weather and climate services.
The official networks of the National Meteorological and Hydrological Services (NMHS) sample part of the land-use excluding, for example, urban areas, which motivated us to include third-party data which does sample urban areas.
 We have provided the results in real-time as well as historical gridded data services.
Approach.
By including a statistical, simplified model of the observation error in our gridding approach (i.
e.
multi-fidelity regression Kriging), we did not include any advanced quality control (QC) of the crowd-sourced data.
This might sound surprising in the context of weather and climate data,but is consistent with approaches in other scientific disciplines (e.
g.
marine engineering, aerospace engineering).
Results.
In this study, we investigate the quantitative effect of state-of-the-art quality control on the accuracy of gridded services, by analysing hourly temperature observations in The Netherlands for the year 2023.
Our results indicate that quality control of crowd-sourced weather data can potentially increase the accuracy of straightforward – and commonly used – nearest-neighbour approximation, but generally deteriorates the accuracy of more advanced gridded services.
This finding indicates that using strict QC procedures to turn crowd-sourced data into a dataset with similar fidelity as the NMHS-sourced data is not the way to go.
Therefore, we do indeed continue to blend first-party data, crowd-sourced data and land-use data without applying any advanced quality control to the crowd-sourced data.
Ecosystem.
We emphasize that crowd-sourced weather data only improves services when it is blended with a high-quality network of NMHS data and possibly with land-use data.
In other words, the availability of crowd-sourced data does not remove the need for high-quality NMHS observation networks.
Therefore, we do not intend to present crowd-sourced data as a stand-alone product; rather, it lives in an integrated ecosystem.
Related Results
Crowd Density Estimation via Global Crowd Collectiveness Metric
Crowd Density Estimation via Global Crowd Collectiveness Metric
Drone-captured crowd videos have become increasingly prevalent in various applications in recent years, including crowd density estimation via measuring crowd collectiveness. Tradi...
Efficient Hexagonal Gridding Method for DNA Microarray Image
Efficient Hexagonal Gridding Method for DNA Microarray Image
In genetics, a deoxyribonucleic acid (DNA) microarray is a useful instrument that is frequently used to track thousands of genes' expression levels simultaneously. For DNA microarr...
Is Tommy John Surgery Performed More Frequently in Major League Baseball Pitchers From Warm Weather Areas?
Is Tommy John Surgery Performed More Frequently in Major League Baseball Pitchers From Warm Weather Areas?
Background: Medial ulnar collateral ligament (UCL) reconstruction is a common procedure performed on Major League Baseball (MLB) pitchers with symptomatic UCL insufficiency, freque...
Weather Sensitivity Surveys in Germany
Weather Sensitivity Surveys in Germany
Weather sensitivity describes reactions of the human body related to the current weather and especially to changes in weather. A clear scientific  definition of weather se...
Challenges of Deep Learning for Crowd Analytics
Challenges of Deep Learning for Crowd Analytics
In high population cities, the gatherings of large crowds in public places and public areas accelerate or jeopardize people safety and transportation, which is a key challenge to t...
Crowd-Based Requirements Engineering
Crowd-Based Requirements Engineering
The high availability of data on the Internet has opened up new avenues for software development and deployment. For Requirements Engineering (RE), in particular, online user feedb...
Contemporary understanding of riots: Classical crowd psychology, ideology and the social identity approach
Contemporary understanding of riots: Classical crowd psychology, ideology and the social identity approach
This article explores the origins and ideology of classical crowd psychology, a body of theory reflected in contemporary popularised understandings such as of the 2011 English ‘rio...
Requirements for the use of impact-based forecasts by road safety organisations in Germany
Requirements for the use of impact-based forecasts by road safety organisations in Germany
<p>Road transportation is considered the most vulnerable mode of transport in terms of weather impacts (Molarius et al., 2014). Extreme weather events like winter sto...

