Water Quality Classification Using Machine Learning

Water quality is crucial as it directly affects the ecosystem and human health. However, current water quality classification methods are inefficient because they do not compare prediction accuracy between machine learning methods. In this regard, the objective of this study is to classify water qua...

Full description

Bibliographic Details
Published in:8th International Conference on Recent Advances and Innovations in Engineering: Empowering Computing, Analytics, and Engineering Through Digital Innovation, ICRAIE 2023
Main Author: Arifin F.F.T.; Idrus Z.; Halim S.A.; Ahmarofi A.A.; Ahmad K.A.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2023
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189937584&doi=10.1109%2fICRAIE59459.2023.10468171&partnerID=40&md5=0ca6e7c23223107809e4b13ec8c4c4e1
id 2-s2.0-85189937584
spelling 2-s2.0-85189937584
Arifin F.F.T.; Idrus Z.; Halim S.A.; Ahmarofi A.A.; Ahmad K.A.
Water Quality Classification Using Machine Learning
2023
8th International Conference on Recent Advances and Innovations in Engineering: Empowering Computing, Analytics, and Engineering Through Digital Innovation, ICRAIE 2023


10.1109/ICRAIE59459.2023.10468171
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189937584&doi=10.1109%2fICRAIE59459.2023.10468171&partnerID=40&md5=0ca6e7c23223107809e4b13ec8c4c4e1
Water quality is crucial as it directly affects the ecosystem and human health. However, current water quality classification methods are inefficient because they do not compare prediction accuracy between machine learning methods. In this regard, the objective of this study is to classify water quality based on the proposed machine learning tools. To fulfill that, a preliminary study was conducted by collecting related information in the research domain through articles, electronic books, and online databases. The data collection for the prototype's dataset was obtained from an electronic book published by the Pakistan Council of Research in Water Resources 2021. Subsequently, the data pre-processing phase was conducted by using WEKA software which includes the crucial steps to transform the data into a cleaner format and make the model more accurate. The model for each technique was developed using Python in Jupyter Notebook. The results of the accuracy score for each model were also conducted in this phase. The findings of this research show that the Decision Tree model performs excellently with an accuracy of 97.37% compared to the Support Vector Machine and K-Nearest Neighbour models, with an accuracy of 95.69% and 74.72%, respectively. Consequently, implementing a multi-class classification system can help future researchers classify more accurately and reduce the misclassification of water quality. © 2023 IEEE.
Institute of Electrical and Electronics Engineers Inc.

English
Conference paper

author Arifin F.F.T.; Idrus Z.; Halim S.A.; Ahmarofi A.A.; Ahmad K.A.
spellingShingle Arifin F.F.T.; Idrus Z.; Halim S.A.; Ahmarofi A.A.; Ahmad K.A.
Water Quality Classification Using Machine Learning
author_facet Arifin F.F.T.; Idrus Z.; Halim S.A.; Ahmarofi A.A.; Ahmad K.A.
author_sort Arifin F.F.T.; Idrus Z.; Halim S.A.; Ahmarofi A.A.; Ahmad K.A.
title Water Quality Classification Using Machine Learning
title_short Water Quality Classification Using Machine Learning
title_full Water Quality Classification Using Machine Learning
title_fullStr Water Quality Classification Using Machine Learning
title_full_unstemmed Water Quality Classification Using Machine Learning
title_sort Water Quality Classification Using Machine Learning
publishDate 2023
container_title 8th International Conference on Recent Advances and Innovations in Engineering: Empowering Computing, Analytics, and Engineering Through Digital Innovation, ICRAIE 2023
container_volume
container_issue
doi_str_mv 10.1109/ICRAIE59459.2023.10468171
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189937584&doi=10.1109%2fICRAIE59459.2023.10468171&partnerID=40&md5=0ca6e7c23223107809e4b13ec8c4c4e1
description Water quality is crucial as it directly affects the ecosystem and human health. However, current water quality classification methods are inefficient because they do not compare prediction accuracy between machine learning methods. In this regard, the objective of this study is to classify water quality based on the proposed machine learning tools. To fulfill that, a preliminary study was conducted by collecting related information in the research domain through articles, electronic books, and online databases. The data collection for the prototype's dataset was obtained from an electronic book published by the Pakistan Council of Research in Water Resources 2021. Subsequently, the data pre-processing phase was conducted by using WEKA software which includes the crucial steps to transform the data into a cleaner format and make the model more accurate. The model for each technique was developed using Python in Jupyter Notebook. The results of the accuracy score for each model were also conducted in this phase. The findings of this research show that the Decision Tree model performs excellently with an accuracy of 97.37% compared to the Support Vector Machine and K-Nearest Neighbour models, with an accuracy of 95.69% and 74.72%, respectively. Consequently, implementing a multi-class classification system can help future researchers classify more accurately and reduce the misclassification of water quality. © 2023 IEEE.
publisher Institute of Electrical and Electronics Engineers Inc.
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1809677779283738624