XGBoost Based Machine Learning Techniques for Water Quality Prediction

Water is important to all living beings and is essential to our daily lives. It is used for drinking, irrigation, and manufacturing. But, as globalisation increased, water lost its purity, and scientific procedures are now employed to test water for its quality. In this study, a prediction model was...

Full description

Saved in:
Bibliographic Details
Published in:2023 International Conference on Circuit Power and Computing Technologies (ICCPCT) pp. 364 - 369
Main Authors: Roshan, S. Patric, Edwin, E. Bijolin, Thanka, M. Roshni, Ebenezer, V., Joy, R. Priscilla, S, Stewart Kirubakaran
Format: Conference Proceeding
Language:English
Published: IEEE 10-08-2023
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Water is important to all living beings and is essential to our daily lives. It is used for drinking, irrigation, and manufacturing. But, as globalisation increased, water lost its purity, and scientific procedures are now employed to test water for its quality. In this study, a prediction model was developed to quantify quality and state of the water using machine learning techniques. XGBoost, Naive Bayes, and Support Vector Machine (SVM) are water quality prediction methods. The dataset was initially obtained from Kaggle, and it was pre-processed to eliminate null values and outliers. The Water Quality Index (WQI) value was calculated using an arithmetic approach that employs index weights of parameters, and the water was labelled based on its WQI value. Before feeding the imbalanced dataset into the machine learning model for training and testing, the Synthetic Minority Over-sampling Technique (SMOTE) will be used to balance it with a WQI value and its class. The prediction will be conducted by feeding the most accurate parameters into the model, and the model's output will be the water quality classification and consumption.
DOI:10.1109/ICCPCT58313.2023.10244964