Water Quality Classification Using Machine Learning
Water quality is crucial as it directly affects the ecosystem and human health. However, current water quality classification methods are inefficient because they do not compare prediction accuracy between machine learning methods. In this regard, the objective of this study is to classify water qua...
Published in: | 8th International Conference on Recent Advances and Innovations in Engineering: Empowering Computing, Analytics, and Engineering Through Digital Innovation, ICRAIE 2023 |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
Institute of Electrical and Electronics Engineers Inc.
2023
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189937584&doi=10.1109%2fICRAIE59459.2023.10468171&partnerID=40&md5=0ca6e7c23223107809e4b13ec8c4c4e1 |
id |
2-s2.0-85189937584 |
---|---|
spelling |
2-s2.0-85189937584 Arifin F.F.T.; Idrus Z.; Halim S.A.; Ahmarofi A.A.; Ahmad K.A. Water Quality Classification Using Machine Learning 2023 8th International Conference on Recent Advances and Innovations in Engineering: Empowering Computing, Analytics, and Engineering Through Digital Innovation, ICRAIE 2023 10.1109/ICRAIE59459.2023.10468171 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189937584&doi=10.1109%2fICRAIE59459.2023.10468171&partnerID=40&md5=0ca6e7c23223107809e4b13ec8c4c4e1 Water quality is crucial as it directly affects the ecosystem and human health. However, current water quality classification methods are inefficient because they do not compare prediction accuracy between machine learning methods. In this regard, the objective of this study is to classify water quality based on the proposed machine learning tools. To fulfill that, a preliminary study was conducted by collecting related information in the research domain through articles, electronic books, and online databases. The data collection for the prototype's dataset was obtained from an electronic book published by the Pakistan Council of Research in Water Resources 2021. Subsequently, the data pre-processing phase was conducted by using WEKA software which includes the crucial steps to transform the data into a cleaner format and make the model more accurate. The model for each technique was developed using Python in Jupyter Notebook. The results of the accuracy score for each model were also conducted in this phase. The findings of this research show that the Decision Tree model performs excellently with an accuracy of 97.37% compared to the Support Vector Machine and K-Nearest Neighbour models, with an accuracy of 95.69% and 74.72%, respectively. Consequently, implementing a multi-class classification system can help future researchers classify more accurately and reduce the misclassification of water quality. © 2023 IEEE. Institute of Electrical and Electronics Engineers Inc. English Conference paper |
author |
Arifin F.F.T.; Idrus Z.; Halim S.A.; Ahmarofi A.A.; Ahmad K.A. |
spellingShingle |
Arifin F.F.T.; Idrus Z.; Halim S.A.; Ahmarofi A.A.; Ahmad K.A. Water Quality Classification Using Machine Learning |
author_facet |
Arifin F.F.T.; Idrus Z.; Halim S.A.; Ahmarofi A.A.; Ahmad K.A. |
author_sort |
Arifin F.F.T.; Idrus Z.; Halim S.A.; Ahmarofi A.A.; Ahmad K.A. |
title |
Water Quality Classification Using Machine Learning |
title_short |
Water Quality Classification Using Machine Learning |
title_full |
Water Quality Classification Using Machine Learning |
title_fullStr |
Water Quality Classification Using Machine Learning |
title_full_unstemmed |
Water Quality Classification Using Machine Learning |
title_sort |
Water Quality Classification Using Machine Learning |
publishDate |
2023 |
container_title |
8th International Conference on Recent Advances and Innovations in Engineering: Empowering Computing, Analytics, and Engineering Through Digital Innovation, ICRAIE 2023 |
container_volume |
|
container_issue |
|
doi_str_mv |
10.1109/ICRAIE59459.2023.10468171 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189937584&doi=10.1109%2fICRAIE59459.2023.10468171&partnerID=40&md5=0ca6e7c23223107809e4b13ec8c4c4e1 |
description |
Water quality is crucial as it directly affects the ecosystem and human health. However, current water quality classification methods are inefficient because they do not compare prediction accuracy between machine learning methods. In this regard, the objective of this study is to classify water quality based on the proposed machine learning tools. To fulfill that, a preliminary study was conducted by collecting related information in the research domain through articles, electronic books, and online databases. The data collection for the prototype's dataset was obtained from an electronic book published by the Pakistan Council of Research in Water Resources 2021. Subsequently, the data pre-processing phase was conducted by using WEKA software which includes the crucial steps to transform the data into a cleaner format and make the model more accurate. The model for each technique was developed using Python in Jupyter Notebook. The results of the accuracy score for each model were also conducted in this phase. The findings of this research show that the Decision Tree model performs excellently with an accuracy of 97.37% compared to the Support Vector Machine and K-Nearest Neighbour models, with an accuracy of 95.69% and 74.72%, respectively. Consequently, implementing a multi-class classification system can help future researchers classify more accurately and reduce the misclassification of water quality. © 2023 IEEE. |
publisher |
Institute of Electrical and Electronics Engineers Inc. |
issn |
|
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677779283738624 |