Hybrid FCMG-OP-FIS model approach to convert regression into classification data for machine learning-based AQI prediction

Air pollution from vehicle emissions, industrial activities, and medical facilities poses significant health risks in urban areas, underscoring the necessity for robust air quality index (AQI) monitoring. This paper presents a novel method for AQI prediction by integrating a fuzzy centre merge graph...

Full description

Saved in:
Bibliographic Details
Published in:Heliyon Vol. 10; no. 21; p. e39759
Main Authors: Ordenshiya, K.M., Revathi, G.K.
Format: Journal Article
Language:English
Published: Elsevier Ltd 15-11-2024
Elsevier
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Air pollution from vehicle emissions, industrial activities, and medical facilities poses significant health risks in urban areas, underscoring the necessity for robust air quality index (AQI) monitoring. This paper presents a novel method for AQI prediction by integrating a fuzzy centre merge graph with an optimal value-based fuzzy inference system (FCMG-OP-FIS) and machine learning (ML). Traditional ML techniques encounter difficulties when converting regression datasets into classification formats, particularly when unable to label the dataset using the traditional method. The proposed FCMG-OP-FIS model efficiently converts regression data into a classification framework. Unlike traditional AQI prediction methods that rely solely on pollutant data, this approach incorporates both pollutant and meteorological data to improve prediction accuracy. The innovative fuzzy centre merge graph (FCMG) balances the dataset for optimal solutions and facilitates input grouping for Simulink, simplifying rule management. The FCMG-OP-FIS model generates a regression output for AQI, which is subsequently classified into levels (healthy, moderate, or unhealthy) using IF-THEN rules. To enhance accuracy further, a random forest classifier (RFC) is trained on the FCMG-OP-FIS classified output data. The regression output of the FCMG-OP-FIS model is validated using metrics such as RMSE (0.48), MSE (0.23), MAE (0.23), and MAPE (1.77%). Additionally, the classification output from the RFC model employs advanced validation techniques including stratified shuffle validation, grid search cross-validation, and confusion matrix analysis, achieving an accuracy rate of 99%, with the F1 score, precision, and recall over all at 99%. These results demonstrate the effectiveness of the proposed model in accurately labelling data for classification and predicting AQI through ML, highlighting its potential for practical application in environmental monitoring and management. •The innovative FCMG model simplifies the process of achieving optimal solutions by effectively managing the complexities of unbalanced datasets and resizing the dataset. This approach also facilitates the merging of pollutant and meteorological data.•The FCMG-OP-FIS model fine-tunes the parameters of the FIS to enhance output accuracy and the model facilitates the conversion of the regression dataset into a classification dataset. The resulting RFC model, built on this classifier dataset, demonstrates exceptional performance in AQI prediction.•The regression output of the FCMG-OP-FIS model is validated with metrics such as RMSE, MSE, MAE, and MAPE, while the classification output achieves outstanding performance through stratified shuffle validation, grid search cross-validation, and confusion matrix analysis, resulting in high accuracy rates for F1 score, precision, and recall.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:2405-8440
2405-8440
DOI:10.1016/j.heliyon.2024.e39759