Water Quality Classification Using Machine Learning

Water quality is crucial as it directly affects the ecosystem and human health. However, current water quality classification methods are inefficient because they do not compare prediction accuracy between machine learning methods. In this regard, the objective of this study is to classify water qua...

Full description

Bibliographic Details
Published in:8th International Conference on Recent Advances and Innovations in Engineering: Empowering Computing, Analytics, and Engineering Through Digital Innovation, ICRAIE 2023
Main Author: Arifin F.F.T.; Idrus Z.; Halim S.A.; Ahmarofi A.A.; Ahmad K.A.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2023
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189937584&doi=10.1109%2fICRAIE59459.2023.10468171&partnerID=40&md5=0ca6e7c23223107809e4b13ec8c4c4e1
Description
Summary:Water quality is crucial as it directly affects the ecosystem and human health. However, current water quality classification methods are inefficient because they do not compare prediction accuracy between machine learning methods. In this regard, the objective of this study is to classify water quality based on the proposed machine learning tools. To fulfill that, a preliminary study was conducted by collecting related information in the research domain through articles, electronic books, and online databases. The data collection for the prototype's dataset was obtained from an electronic book published by the Pakistan Council of Research in Water Resources 2021. Subsequently, the data pre-processing phase was conducted by using WEKA software which includes the crucial steps to transform the data into a cleaner format and make the model more accurate. The model for each technique was developed using Python in Jupyter Notebook. The results of the accuracy score for each model were also conducted in this phase. The findings of this research show that the Decision Tree model performs excellently with an accuracy of 97.37% compared to the Support Vector Machine and K-Nearest Neighbour models, with an accuracy of 95.69% and 74.72%, respectively. Consequently, implementing a multi-class classification system can help future researchers classify more accurately and reduce the misclassification of water quality. © 2023 IEEE.
ISSN:
DOI:10.1109/ICRAIE59459.2023.10468171