Time series-based PM2.5 concentration prediction in Jing-Jin-Ji area using machine learning algorithm models

Globally all countries encounter air pollution problems along their development path. As a significant indicator of air quality, PM2.5 concentration has long been proven to be affecting the population’s death rate. Machine learning algorithms proven to outperform traditional statistical approaches a...

Full description

Saved in:
Bibliographic Details
Published in:Heliyon Vol. 8; no. 9; p. e10691
Main Authors: Ma, Xin, Chen, Tengfei, Ge, Rubing, Cui, Caocao, Xu, Fan, Lv, Qi
Format: Journal Article
Language:English
Published: Elsevier Ltd 01-09-2022
Elsevier
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Globally all countries encounter air pollution problems along their development path. As a significant indicator of air quality, PM2.5 concentration has long been proven to be affecting the population’s death rate. Machine learning algorithms proven to outperform traditional statistical approaches are widely used in air pollution prediction. However research on the model selection discussion and environmental interpretation of model prediction results is still scarce and urgently needed to lead the policy making on air pollution control. Our research compared four types of machine learning algorisms LinearSVR, K-Nearest Neighbor, Lasso regression, Gradient boosting by looking into their performance in predicting PM2.5 concentrations among different cities and seasons. The results show that the machine learning model is able to forecast the next day PM2.5 concentration based on the previous five days' data with better accuracy. The comparative experiments show that based on city level the Gradient Boosting prediction model has better prediction performance with mean absolute error (MAE) of 9 ug/m3 and root mean square error (RMSE) of 10.25–16.76 ug/m3, lower compared with the other three models, and based on season level four models have the best prediction performances in winter time and the worst in summer time. And more importantly the demonstration of models' different performances in each city and each season is of great significance in environmental policy implications. Jing-Jin-Ji city group; PM2.5 prediction; Lasso regression; Gradient boosting; Linear SVR; K-Nearest Neighbor.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Xin Ma and Tengfei Chen contributed equally to this work.
ISSN:2405-8440
2405-8440
DOI:10.1016/j.heliyon.2022.e10691