Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

USA AIR QUALITY: FORECASTING PM2.5 CONCENTRATIONS USING MACHINE LEARNING MODELS

View through CrossRef
Emissions have been a serious topic worldwide and every authority in each country needs to set regulations and penalties to control emissions. These gas emissions have an impact on many sectors like society’s health, countries’ economies, and ecosystems. For that, it is always needed to build any new or updated regulation to be based on strong evidence and that is machine learning comes to fulfil the need. Machine learning algorithms can read big numbers of records and train them then build prediction models. This step allows the entry of historic data, and the algorithm will read and recognize the pattern of the PM2.5 concentrations. As a result of the prediction models, the forecasting will take a part and predict future values of PM2.5 concentrations in the provided meteorological condition available in the dataset. Using the dataset from Kaggle, the steps started with understanding the dataset and handling the errors that appeared like outlier removal, missing values imputation and eliminating the duplicated records. Then four regression machine learning models were applied which are: linear regression, random forest, neural network, and polynomial regression. These four models have been tested to predict the PM2.5 concentrations and tuned the hyperparameters to attain the best accuracy score measured by the value of R-squared and mean squared error MSE. As a result, the four models achieved a good score where linear regression R-squared value= 75.6%, Random Forest R-squared value= 74.9%, neural network R-squared value=75.4% and Polynomial regression accomplished the best model accuracy with R-squared= 100% and an MSE=0%. Taking one further step after defining the best-fitted model, is to forecast future values of the concentrations of PM2.5. Using forecaster code in python and a polynomial regression model to produce forecasting values by reading 500 backward steps of data and then generating the forecasted values.
Title: USA AIR QUALITY: FORECASTING PM2.5 CONCENTRATIONS USING MACHINE LEARNING MODELS
Description:
Emissions have been a serious topic worldwide and every authority in each country needs to set regulations and penalties to control emissions.
These gas emissions have an impact on many sectors like society’s health, countries’ economies, and ecosystems.
For that, it is always needed to build any new or updated regulation to be based on strong evidence and that is machine learning comes to fulfil the need.
Machine learning algorithms can read big numbers of records and train them then build prediction models.
This step allows the entry of historic data, and the algorithm will read and recognize the pattern of the PM2.
5 concentrations.
As a result of the prediction models, the forecasting will take a part and predict future values of PM2.
5 concentrations in the provided meteorological condition available in the dataset.
Using the dataset from Kaggle, the steps started with understanding the dataset and handling the errors that appeared like outlier removal, missing values imputation and eliminating the duplicated records.
Then four regression machine learning models were applied which are: linear regression, random forest, neural network, and polynomial regression.
These four models have been tested to predict the PM2.
5 concentrations and tuned the hyperparameters to attain the best accuracy score measured by the value of R-squared and mean squared error MSE.
As a result, the four models achieved a good score where linear regression R-squared value= 75.
6%, Random Forest R-squared value= 74.
9%, neural network R-squared value=75.
4% and Polynomial regression accomplished the best model accuracy with R-squared= 100% and an MSE=0%.
Taking one further step after defining the best-fitted model, is to forecast future values of the concentrations of PM2.
5.
Using forecaster code in python and a polynomial regression model to produce forecasting values by reading 500 backward steps of data and then generating the forecasted values.

Related Results

Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Characterization and Transport Pathways of High PM2.5 Pollution Episodes During 2015–2021 in Tehran, Iran
Characterization and Transport Pathways of High PM2.5 Pollution Episodes During 2015–2021 in Tehran, Iran
Abstract Purpose High PM2.5 pollution episodes can affect entire regions around the world and have substantial impacts on climate, visibility, an...
Development of PM2.5 forecasting system in Seoul, South Korea using chemical transport modeling and ConvLSTM-DNN  
Development of PM2.5 forecasting system in Seoul, South Korea using chemical transport modeling and ConvLSTM-DNN  
<p>Ambient exposure to PM2.5 can adversely affect public health, and forecasting PM2.5 is essential for implementing protection measures in advance. Current PM2.5 for...
Pharmacological inhibition of PAI-1 alleviates cardiopulmonary pathologies induced by exposure to air pollutants PM2.5
Pharmacological inhibition of PAI-1 alleviates cardiopulmonary pathologies induced by exposure to air pollutants PM2.5
OBJECTIVEExposure to air pollutants leads to the development of pulmonary and cardiovascular diseases, and thus air pollution is one of the major global threats to human health. Ai...
The links of fine airborne particulate matter exposure to occurrence of cardiovascular and metabolic diseases in Michigan, USA
The links of fine airborne particulate matter exposure to occurrence of cardiovascular and metabolic diseases in Michigan, USA
Air pollutants, particularly airborne particulate matter with aerodynamic diameter < 2.5μm (PM2.5), have been linked to the increase in mortality and morbidity associated with c...
Effect of Acute PM2.5 Exposure on Lung Function in Children: A Systematic Review and Meta-Analysis
Effect of Acute PM2.5 Exposure on Lung Function in Children: A Systematic Review and Meta-Analysis
Abstract Background: Long-term exposures to particulate matter 2.5 (PM2.5) impairing lung function in children, but the effect of acute exposure has not been clarified. The...
Investigations of PM2.5 Long-term Exposure and Subclinical Atherosclerosis in Women: An Overview
Investigations of PM2.5 Long-term Exposure and Subclinical Atherosclerosis in Women: An Overview
Background/Aim Evidence demonstrates that coronary artery calcification (CAC) is a biological marker for cardiovascular disease (CVD) risk prediction. Acute cardiovascular effects ...

Back to Top