Javascript must be enabled to continue!

Multi-step prediction of dissolved oxygen in rivers based on random forest missing value imputation and attention mechanism coupled with recurrent neural network

Abstract Accurately predicting dissolved oxygen is of great significance to the intelligent management and control of river water quality. However, due to the interference of external factors and the irregularity of its changes, this is still a ticklish problem, especially in multi-step forecasting. This article mainly studies two issues: we first analyze the lack of water quality data and propose to use the random forest algorithm to interpolate the missing data. Then, we systematically discuss and compare water quality prediction methods based on attention-based RNN, and develop attention-based RNN into a multi-step prediction for dissolved oxygen. Finally, we applied the model to the canal in Jiangnan (China) and compared eight baseline methods. In the dissolved oxygen single-step prediction, the attention-based GRU model has better performance. Its measure indicators MAE, RMSE, and R2 are 0.051, 0.225, and 0.958, which are better than baseline methods. Next, attention-based GRU was developed into multi-step prediction, which can predict the dissolved oxygen in the next 20 hours with high prediction accuracy. The MAE, RMSE, and R2 are 0.253, 0.306, and 0.918. Experimental results show that attention-based GRU can achieve more accurate dissolved oxygen prediction in single-neural network and multi-step predictions.

IWA Publishing

Juan Huan Mingbao Li Xiangen Xu Hao Zhang Beier Yang Jiang Jianming Bing Shi

Water Supply

2022

Title: Multi-step prediction of dissolved oxygen in rivers based on random forest missing value imputation and attention mechanism coupled with recurrent neural network

Description:

Abstract Accurately predicting dissolved oxygen is of great significance to the intelligent management and control of river water quality.

However, due to the interference of external factors and the irregularity of its changes, this is still a ticklish problem, especially in multi-step forecasting.

This article mainly studies two issues: we first analyze the lack of water quality data and propose to use the random forest algorithm to interpolate the missing data.

Then, we systematically discuss and compare water quality prediction methods based on attention-based RNN, and develop attention-based RNN into a multi-step prediction for dissolved oxygen.

Finally, we applied the model to the canal in Jiangnan (China) and compared eight baseline methods.

In the dissolved oxygen single-step prediction, the attention-based GRU model has better performance.

Its measure indicators MAE, RMSE, and R2 are 0.

051, 0.

225, and 0.

958, which are better than baseline methods.

Next, attention-based GRU was developed into multi-step prediction, which can predict the dissolved oxygen in the next 20 hours with high prediction accuracy.

The MAE, RMSE, and R2 are 0.

253, 0.

306, and 0.

918.

Experimental results show that attention-based GRU can achieve more accurate dissolved oxygen prediction in single-neural network and multi-step predictions.

Back

AbstractLeft-censored missing values commonly exist in targeted metabolomics datasets and can be considered as missing not at random (MNAR). Improper data processing procedures for...

A New Approach of Outlier-robust Missing Value Imputation for Metabolomics Data Analysis

Background:Metabolomics data generation and quantification are different from other types of molecular “omics” data in bioinformatics. Mass spectrometry (MS) based (gas chromatogra...

Uncovering the consequences of batch effect associated missing values in omics data analysis

ABSTRACTStatistical analyses in high-dimensional omics data are often hampered by the presence of batch effects (BEs) and missing values (MVs), but the interaction between these tw...

Handling Missing Data in COVID-19 Incidence Estimation: Secondary Data Analysis

Abstract Background The COVID-19 pandemic has revealed significant challenges in disease forecasting and in developing a public health response, ...

A framework for testing different imputation methods for tabular datasets

AbstractBackground and purposeHandling missing values is a prevalent challenge in the analysis of clinical data. The rise of data-driven models demands an efficient use of the avai...

MissForest—non-parametric missing value imputation for mixed-type data

AbstractMotivation: Modern data acquisition based on high-throughput technology is often facing the problem of missing data. Algorithms commonly used in the analysis of such large-...

Temporary Rivers

Temporary rivers are those that do not flow continuously through time along their entire length. The phrase temporary rivers primarily came into use during the first decade of the ...

Enhancing data integrity in Electronic Health Records: Review of methods for handling missing data

AbstractIntroductionElectronic Health Records (EHRs) are vital repositories of patient information for medical research, but the prevalence of missing data presents an obstacle to ...

Email:
Password:

Email:

Multi-step prediction of dissolved oxygen in rivers based on random forest missing value imputation and attention mechanism coupled with recurrent neural network

Related Results