Javascript must be enabled to continue!
Retrieving gapless 1-km land surface temperature based on numerical model and auto machine learning approach
View through CrossRef
Land Surface Temperature (LST) is crucial in many areas; but seamless LST data are difficult to obtain due to limitations in thermal infrared sensor technologies. Numerical modeling, which is based on physics-driven process, can simulate continuous spatial and temporal data. Simultaneously, machine learning, a typical data-driven approach, has been effective in remotely-sensed data reconstruction. In this study, we designed a fusion framework that combines the strengths of numerical modeling and machine learning. The framework includes the following steps: 1) Optimization of the numerical model: We use the urbanized High-Resolution Land Data Assimilation System (u-HRLDAS) model. Various spatio-temporal data sources are used to refine and optimize the model's simulations. 2) Database creation for LST reconstruction: This database incorporates forcing variables like 2-meter temperature, relative humidity, air pressure, wind speed, downward longwave and shortwave radiation for the u-HRLDAS model, along with the model's simulated LST outputs. Additional remotely-sensed data such as the Digital Elevation Model (DEM), Normalized Difference Vegetation Index (NDVI), latitude, longitude, land use and cover, and slope are also included. The datasets span the summer months (June to August) from 2011 to 2014. Daily LST data from MOD11A1 and MYD11A1 are used as label data. 3) Optimal model identification via automatic machine learning framework: The MODIS LST data in the database serves as training labels, with a 70/30 split for training and validation. Evaluation metrics like RMSE, MAE, and R² guide the selection. We chose the AutoGluon-Tabular framework, developed by Amazon, for its superior performance, which is achieved through bagging and stacking methods.  Finally, the 1-km seamless LST is reconstructed based on the model with the highest accuracy in validation.Taking Xi’an city in China as the study region, nine models (Weightensemble_L2, LightGBMLarge, XGBoost, LightGBM, CatBoost, LightGBM, ExtraTree, NeuralNetTorch, and NeuralNetFastAI) were trained within the Autogluon-Tabular framework. These models displayed RMSE values ranging from 0.737 to 1.417 K, MAE spanning 0.522 to 1.031 K, and R² from 0.967 to 0.991. Notably, the Weightedensemble_L2 model excelled, with the lowest RMSE (0.737) and MAE (0.522), and the highest R² (0.991), closely followed by the LightBGMlarge model. with RMSE, MAE, and R² values of 0.739, 0.526, and 0.991, respectively. Furthermore, we conducted supplementary testing using four reserved MODIS LST images. Employing the previously trained WeightedEnsemble_L2 model, seamless predictions of MODIS LST were generated at four overpass timestamps: 02:30, 05:30, 14:30, and 17:30. The resulting spatial distributions is similar with the observed LST, validating our method's capability to capture LST's spatial characteristics and ensure spatial continuity compared to the original MODIS LST data.In conclusion, the proposed fusion framework which integrates numerical modeling and automatic machine learning, successfully reconstructed LST with high accuracy and strong spatial similarities. There are still shortcomings of this method, such as the predicted images losing some spatial details compared to the observations, which need to be improved in the future.
Title: Retrieving gapless 1-km land surface temperature based on numerical model and auto machine learning approach
Description:
Land Surface Temperature (LST) is crucial in many areas; but seamless LST data are difficult to obtain due to limitations in thermal infrared sensor technologies.
Numerical modeling, which is based on physics-driven process, can simulate continuous spatial and temporal data.
Simultaneously, machine learning, a typical data-driven approach, has been effective in remotely-sensed data reconstruction.
In this study, we designed a fusion framework that combines the strengths of numerical modeling and machine learning.
The framework includes the following steps: 1) Optimization of the numerical model: We use the urbanized High-Resolution Land Data Assimilation System (u-HRLDAS) model.
Various spatio-temporal data sources are used to refine and optimize the model's simulations.
2) Database creation for LST reconstruction: This database incorporates forcing variables like 2-meter temperature, relative humidity, air pressure, wind speed, downward longwave and shortwave radiation for the u-HRLDAS model, along with the model's simulated LST outputs.
Additional remotely-sensed data such as the Digital Elevation Model (DEM), Normalized Difference Vegetation Index (NDVI), latitude, longitude, land use and cover, and slope are also included.
The datasets span the summer months (June to August) from 2011 to 2014.
Daily LST data from MOD11A1 and MYD11A1 are used as label data.
3) Optimal model identification via automatic machine learning framework: The MODIS LST data in the database serves as training labels, with a 70/30 split for training and validation.
Evaluation metrics like RMSE, MAE, and R² guide the selection.
We chose the AutoGluon-Tabular framework, developed by Amazon, for its superior performance, which is achieved through bagging and stacking methods.
 Finally, the 1-km seamless LST is reconstructed based on the model with the highest accuracy in validation.
Taking Xi’an city in China as the study region, nine models (Weightensemble_L2, LightGBMLarge, XGBoost, LightGBM, CatBoost, LightGBM, ExtraTree, NeuralNetTorch, and NeuralNetFastAI) were trained within the Autogluon-Tabular framework.
These models displayed RMSE values ranging from 0.
737 to 1.
417 K, MAE spanning 0.
522 to 1.
031 K, and R² from 0.
967 to 0.
991.
Notably, the Weightedensemble_L2 model excelled, with the lowest RMSE (0.
737) and MAE (0.
522), and the highest R² (0.
991), closely followed by the LightBGMlarge model.
with RMSE, MAE, and R² values of 0.
739, 0.
526, and 0.
991, respectively.
Furthermore, we conducted supplementary testing using four reserved MODIS LST images.
Employing the previously trained WeightedEnsemble_L2 model, seamless predictions of MODIS LST were generated at four overpass timestamps: 02:30, 05:30, 14:30, and 17:30.
The resulting spatial distributions is similar with the observed LST, validating our method's capability to capture LST's spatial characteristics and ensure spatial continuity compared to the original MODIS LST data.
In conclusion, the proposed fusion framework which integrates numerical modeling and automatic machine learning, successfully reconstructed LST with high accuracy and strong spatial similarities.
There are still shortcomings of this method, such as the predicted images losing some spatial details compared to the observations, which need to be improved in the future.
Related Results
Study on Urban Thermal Environment based on Diurnal Temperature Range
Study on Urban Thermal Environment based on Diurnal Temperature Range
<p>Diurnal temperature range (includes land surface temperature diurnal range and near surface air temperature diurnal range) is an important meteorological parameter...
Electronic Equipment Usage on Japanese Vehicles
Electronic Equipment Usage on Japanese Vehicles
<div class="htmlview paragraph">The application of a wide assortment of electronic systems to motor vehicles currently under serious consideration in the automotive and elect...
LAND USE OPTIMIZATION IN UKRAINE AT THE STAGE OF LAND MARKET FORMATION
LAND USE OPTIMIZATION IN UKRAINE AT THE STAGE OF LAND MARKET FORMATION
In the context of the reform of the sale of agricultural land, the priority is to optimize land use, which is to find a balance of land that would meet their environmental, economi...
Land Degradation Assessment in Pakistan based on LU and VCF
Land Degradation Assessment in Pakistan based on LU and VCF
Abstract
Land degradation is a global environmental issue receiving much attention currently. According to the definition and interpretation of land degradation by relevant...
The Land Use Model Intercomparison Project (LUMIP): Rationale and experimental design
The Land Use Model Intercomparison Project (LUMIP): Rationale and experimental design
Abstract. Human land-use activities have resulted in large to the Earth surface, with resulting implications for climate. In the future, land-use activities are likely to expand an...
Principles of land law of Ukraine
Principles of land law of Ukraine
This article is devoted to topical issues of legal regulation of land relations in the state.
One of the central issues of land relations in Ukraine - the issue of principles of l...
Modelling of Unauthorized Land Use Sprawl using High Resolution Data and GIS Based-Cellular Automata
Modelling of Unauthorized Land Use Sprawl using High Resolution Data and GIS Based-Cellular Automata
This study uses Cellular Automata (CA) model, Geography Information System (GIS) and remote sensing data to produce trend and sprawl pattern simulation of un-authorized land use in...

