Javascript must be enabled to continue!
Spatial interpolation of daily precipitation using random forest
View through CrossRef
<p>Regression kriging is one of the most popular spatial interpolation techniques. Its main strength is that it exploits both spatial autocorrelation as well as information contained in environmental covariates. While regression kriging is still dominant, in the past few years machine learning, especially random forest, is increasingly being used for mapping. Machine learning is more flexible than multiple linear regression and can thus make better use of environmental covariates. But machine learning typically ignores spatial autocorrelation. Several attempts have been made to include spatial autocorrelation in random forest, by adding distances to observation locations and other geometries to the set of covariates. But none of these studies have tried the obvious solution to include the nearest observations themselves and the distances to the nearest observations as covariates. In this study we tried this solution by introducing and testing Random Forest for Spatial Interpolation (RFSI). RFSI trains a random forest model on environmental covariates as well as nearest observations and their distances from the prediction point. We applied and evaluated RFSI for mapping daily precipitation in Catalonia for the 2016-2018 period. We trained four models (random forests, RFsp, pooled regression kriging and RFSI) using 63,927 daily precipitation observations from 87 GHCN-stations located in Catalonia. Maximum and minimum daily temperatures and IMERG precipitation estimates (derived from the GPM mission) were used as environmental covariates for all four models. Results based on 5-fold cross validation showed that RFSI (R-square 69.4%, RMSE 3.8 mm) significantly outperformed all random forest (R-square 50.6%, RMSE 3.8 mm), RFsp (R-square 55.5%, RMSE 4.6 mm) and pooled regression kriging (R-square 65.3%, RMSE 4.0 mm). Finetuning RFSI could potentially improve prediction accuracy even more. In addition to improved prediction accuracy, RFSI has the advantage that it uses much fewer spatial covariates than RFsp.</p>
Title: Spatial interpolation of daily precipitation using random forest
Description:
<p>Regression kriging is one of the most popular spatial interpolation techniques.
Its main strength is that it exploits both spatial autocorrelation as well as information contained in environmental covariates.
While regression kriging is still dominant, in the past few years machine learning, especially random forest, is increasingly being used for mapping.
Machine learning is more flexible than multiple linear regression and can thus make better use of environmental covariates.
But machine learning typically ignores spatial autocorrelation.
Several attempts have been made to include spatial autocorrelation in random forest, by adding distances to observation locations and other geometries to the set of covariates.
But none of these studies have tried the obvious solution to include the nearest observations themselves and the distances to the nearest observations as covariates.
In this study we tried this solution by introducing and testing Random Forest for Spatial Interpolation (RFSI).
RFSI trains a random forest model on environmental covariates as well as nearest observations and their distances from the prediction point.
We applied and evaluated RFSI for mapping daily precipitation in Catalonia for the 2016-2018 period.
We trained four models (random forests, RFsp, pooled regression kriging and RFSI) using 63,927 daily precipitation observations from 87 GHCN-stations located in Catalonia.
Maximum and minimum daily temperatures and IMERG precipitation estimates (derived from the GPM mission) were used as environmental covariates for all four models.
Results based on 5-fold cross validation showed that RFSI (R-square 69.
4%, RMSE 3.
8 mm) significantly outperformed all random forest (R-square 50.
6%, RMSE 3.
8 mm), RFsp (R-square 55.
5%, RMSE 4.
6 mm) and pooled regression kriging (R-square 65.
3%, RMSE 4.
0 mm).
Finetuning RFSI could potentially improve prediction accuracy even more.
In addition to improved prediction accuracy, RFSI has the advantage that it uses much fewer spatial covariates than RFsp.
</p>.
Related Results
Spatio-temporal Distribution Characteristics of Summer Precipitation Duration in Northwest China
Spatio-temporal Distribution Characteristics of Summer Precipitation Duration in Northwest China
Based on the daily precipitation observation data of 208 rain-gauge
stations in Northwest China from 1961 to 2020, we use the statistical
analysis method, the Mann-Kendall test met...
Impact of spatial resolution and interpolation schemes of precipitation data on hydrological modeling
Impact of spatial resolution and interpolation schemes of precipitation data on hydrological modeling
<p>Hydrological modelling depends strongly on precipitation input. Especially, to simulate very localized heavy precipitation events, models require precipitation inf...
Trend in Extreme Precipitation Indices Based on Long Term In Situ Precipitation Records over Pakistan
Trend in Extreme Precipitation Indices Based on Long Term In Situ Precipitation Records over Pakistan
Assessing the long-term precipitation changes is of utmost importance for understanding the impact of climate change. This study investigated the variability of extreme precipitati...
Factors influencing and patterns of forest utilization in communities around the Huay Tak Teak Biosphere Reserve, Lampang Province
Factors influencing and patterns of forest utilization in communities around the Huay Tak Teak Biosphere Reserve, Lampang Province
Background and Objectives: To establish the land regulation, it is necessary to know basic information of the surrounding community’s land use and to be aware of basic forest laws....
Application of Random Forest Algorithm and Multi-Temporal Satellite Data for Forest Types Classification in Chiang Mai Province
Application of Random Forest Algorithm and Multi-Temporal Satellite Data for Forest Types Classification in Chiang Mai Province
Background and Objectives: Chiang Mai Province is strategically important as a major watershed area for the Ping River basin ecosystem and boasts one of the highest biodiversity le...
Significant Reduction in Precipitation Seasonality and the Association with Extreme Precipitation in the Hai River Basin of China from 1960 to 2018
Significant Reduction in Precipitation Seasonality and the Association with Extreme Precipitation in the Hai River Basin of China from 1960 to 2018
The Hai River Basin (HRB) serves as a vital center for the population, economy and politics in northern China. Natural hazards, particularly floods, pose significant risks to the r...
Secondary Succession in the Lowland Forests of the Marlborough Sounds Maritime Park
Secondary Succession in the Lowland Forests of the Marlborough Sounds Maritime Park
<p>This study documents aspects of the forest recovery process in secondary communities of the Marlborough sounds Maritime park. some 39 types of seral vegetation were recogn...
SYSTEMATIZATION OF INTERPOLATION METHODS IN GEOMETRIC MODELING
SYSTEMATIZATION OF INTERPOLATION METHODS IN GEOMETRIC MODELING
This article proposes to show the connections between the practical problems of geometric modeling, for the solution of which there are no adequate interpolation methods. Interpola...

