Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

A Spatially Robust and Interpretable Machine Learning Framework for Rainfall Erosivity Prediction

View through CrossRef
Rainfall erosivity (R30) is a critical driver of soil erosion processes, yet its spatial and temporal variability poses significant challenges for reliable prediction. This study presents a spatially robust and interpretable machine learning framework for modelling R30 using ensemble and neural network approaches. A comprehensive modelling pipeline was developed incorporating structured pre-processing, logarithmic target transformation, and a station-wise GroupKFold strategy was explicitly designed to eliminate spatial leakage, a common but often overlooked source of over-optimistic performance in erosivity modelling studies. Hyperparameter optimization was conducted using RandomizedSearchCV to ensure model generalization. Among the evaluated algorithms, the optimized Random Forest model demonstrated superior performance compared to a tuned multilayer perceptron (MLP), achieving a mean spatial cross-validated R2 of approximately 0.70 on the original R30 scale. SHAP (Shapley Additive Explanations) analysis revealed rainfall energy–related indices as dominant predictors, confirming physical consistency between model behaviour and erosivity mechanisms. This study jointly combines spatial GroupKFold validation, SHAP-based physical interpretability, and temporal forecasting assessment for R30 modelling at the station scale Spatial residual diagnostics indicated no systematic geographic bias, supporting model robustness across heterogeneous climatic regimes. Temporal generalization was further assessed using a leave-last-year-per-station validation scheme, yielding an R2 of 0.51 under forecasting conditions. Prediction uncertainty was quantified through tree-level ensemble variability, with 90% prediction intervals achieving approximately 88% empirical coverage. Interval widths increased for extreme erosivity events, reflecting heightened stochastic variability. The proposed framework demonstrates that ensemble machine learning methods can provide accurate, interpretable, and uncertainty-aware prediction of rainfall erosivity while maintaining strict spatial validation standards. This is one of the first studies to jointly integrate spatially explicit cross-validation, SHAP-based physical interpretability, and probabilistic uncertainty estimation for station-scale rainfall erosivity modelling
Title: A Spatially Robust and Interpretable Machine Learning Framework for Rainfall Erosivity Prediction
Description:
Rainfall erosivity (R30) is a critical driver of soil erosion processes, yet its spatial and temporal variability poses significant challenges for reliable prediction.
This study presents a spatially robust and interpretable machine learning framework for modelling R30 using ensemble and neural network approaches.
A comprehensive modelling pipeline was developed incorporating structured pre-processing, logarithmic target transformation, and a station-wise GroupKFold strategy was explicitly designed to eliminate spatial leakage, a common but often overlooked source of over-optimistic performance in erosivity modelling studies.
Hyperparameter optimization was conducted using RandomizedSearchCV to ensure model generalization.
Among the evaluated algorithms, the optimized Random Forest model demonstrated superior performance compared to a tuned multilayer perceptron (MLP), achieving a mean spatial cross-validated R2 of approximately 0.
70 on the original R30 scale.
SHAP (Shapley Additive Explanations) analysis revealed rainfall energy–related indices as dominant predictors, confirming physical consistency between model behaviour and erosivity mechanisms.
This study jointly combines spatial GroupKFold validation, SHAP-based physical interpretability, and temporal forecasting assessment for R30 modelling at the station scale Spatial residual diagnostics indicated no systematic geographic bias, supporting model robustness across heterogeneous climatic regimes.
Temporal generalization was further assessed using a leave-last-year-per-station validation scheme, yielding an R2 of 0.
51 under forecasting conditions.
Prediction uncertainty was quantified through tree-level ensemble variability, with 90% prediction intervals achieving approximately 88% empirical coverage.
Interval widths increased for extreme erosivity events, reflecting heightened stochastic variability.
The proposed framework demonstrates that ensemble machine learning methods can provide accurate, interpretable, and uncertainty-aware prediction of rainfall erosivity while maintaining strict spatial validation standards.
This is one of the first studies to jointly integrate spatially explicit cross-validation, SHAP-based physical interpretability, and probabilistic uncertainty estimation for station-scale rainfall erosivity modelling.

Related Results

A Rainfall Erosivity database for Brazil from MSWEP rainfall dataset
A Rainfall Erosivity database for Brazil from MSWEP rainfall dataset
<p>In erosion studies the behavior of rainfall is primordial, since rain is responsible for the first stage of the erosion process: the detachment of soil particles. ...
A framework of abrupt changes and trends detection for rainfall erosivity
A framework of abrupt changes and trends detection for rainfall erosivity
<p>Rainfall erosivity (R factor), in the Universal Soil Loss Equation (USLE) , a climate index, is used worldwide to assess and...
Dynamic Rainfall Erosivity Estimates Derived from GPM IMERG data
Dynamic Rainfall Erosivity Estimates Derived from GPM IMERG data
Abstract. Soil degradation is a critical threat to agriculture and food security around the world. Understanding the processes that drive soil erosion is necessary to support susta...
SPATIAL AND TEMPORAL DISTRIBUTION OF RAINFALL EROSIVITY IN BRAZILIAN AMAZONIA: A HYDROLOGICAL PERSPECTIVE BASED ON CMORPH TECHNIQUE
SPATIAL AND TEMPORAL DISTRIBUTION OF RAINFALL EROSIVITY IN BRAZILIAN AMAZONIA: A HYDROLOGICAL PERSPECTIVE BASED ON CMORPH TECHNIQUE
Rainfall erosivity is one crucial factor that controls soil erosion. The interannual variability of rainfall erosivity in Brazilian Amazonia connected to the El Niño-Southern Oscil...
Rainfall erosivity estimation using gridded daily precipitation datasets
Rainfall erosivity estimation using gridded daily precipitation datasets
Abstract. Rainfall erosivity is one of the most important factors incorporated into the empirical soil erosion models USLE (Universal Soil Loss Equation) and RUSLE (Revised Univers...
Spatiotemporal Distribution of Rainfall Erosivity in the Bailong River Basin, Gansu Province
Spatiotemporal Distribution of Rainfall Erosivity in the Bailong River Basin, Gansu Province
Rainfall erosivity is one of the key parameters that determine soil erosion, sediment yield, and water quality, thus its importance has grown in modeling of the environmental effec...
Impact of Time Resolution of Rainfall Measurement on Erosivity Factor in Arid Region of India
Impact of Time Resolution of Rainfall Measurement on Erosivity Factor in Arid Region of India
Rainfall erosivity is considered as a vital factor in computing soil loss through erosion prediction models such as original and derived versions of the universal soil loss equatio...
Projected changes in US erosivity
Projected changes in US erosivity
Abstract. Downscaled rainfall projections from 21 climate models from the CMIP5 archive are used to estimate future changes in rainfall erosivity in the continental Unites States. ...

Back to Top