Public Sentiment of Biodiversity Conservation Using Machine Learning Approach
The ecosystem depends on biodiversity as it offers ecological services, helps lessen the effects of natural disasters, a source of economically essential goods, and has aesthetic and cultural benefits. However, the importance of biodiversity in delivering the necessities for human existence and well...
Published in: | 2023 IEEE International Conference on Computing, ICOCO 2023 |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
Institute of Electrical and Electronics Engineers Inc.
2023
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85184851013&doi=10.1109%2fICOCO59262.2023.10397822&partnerID=40&md5=b75a0d1dbb1caeb35e9d39a3c3b442b2 |
id |
2-s2.0-85184851013 |
---|---|
spelling |
2-s2.0-85184851013 Zamani F.; Halim S.A.; Mutalib S.; Yusoff S.B. Public Sentiment of Biodiversity Conservation Using Machine Learning Approach 2023 2023 IEEE International Conference on Computing, ICOCO 2023 10.1109/ICOCO59262.2023.10397822 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85184851013&doi=10.1109%2fICOCO59262.2023.10397822&partnerID=40&md5=b75a0d1dbb1caeb35e9d39a3c3b442b2 The ecosystem depends on biodiversity as it offers ecological services, helps lessen the effects of natural disasters, a source of economically essential goods, and has aesthetic and cultural benefits. However, the importance of biodiversity in delivering the necessities for human existence and well-being is not commonly recognized. Even though regional and fragmented analysis has been done, a thorough analysis technique has been challenging to develop. One of the approaches to improve sentiment analysis is by using supervised machine learning classifiers which is able to learn and adapt to the nuances of language. Hence, this study aims to implement a comparative analysis of different machine learning methods namely Support Vector Machine, Logistic Regression and Naïve Bayes in order to determine the most efficient classifier for sentiment analysis on tweets discussing biodiversity and related topics. The result shows that Logistic Regression with Bag-of-Word feature extraction is the best-performing machine learning algorithm for the given biodiversity datasets with an accuracy of 75.35%. This study also highlights the importance of feature extraction in optimizing machine learning models. Bag-of-Word feature extraction slightly outperforms TF-IDF by increasing the accuracy of the Logistic Regression classifier. Besides, the performance of machine learning model also increases with the increase of sample size and the combination of Logistic Regression with Bag-of-Word technique results to the best performance where it achieved 72.2% of accuracy for 2500 tweets sample. Future research could explore using ensemble learning to combine multiple machine learning algorithms for domain-specific sentiment analysis on social media networks. © 2023 IEEE. Institute of Electrical and Electronics Engineers Inc. English Conference paper |
author |
Zamani F.; Halim S.A.; Mutalib S.; Yusoff S.B. |
spellingShingle |
Zamani F.; Halim S.A.; Mutalib S.; Yusoff S.B. Public Sentiment of Biodiversity Conservation Using Machine Learning Approach |
author_facet |
Zamani F.; Halim S.A.; Mutalib S.; Yusoff S.B. |
author_sort |
Zamani F.; Halim S.A.; Mutalib S.; Yusoff S.B. |
title |
Public Sentiment of Biodiversity Conservation Using Machine Learning Approach |
title_short |
Public Sentiment of Biodiversity Conservation Using Machine Learning Approach |
title_full |
Public Sentiment of Biodiversity Conservation Using Machine Learning Approach |
title_fullStr |
Public Sentiment of Biodiversity Conservation Using Machine Learning Approach |
title_full_unstemmed |
Public Sentiment of Biodiversity Conservation Using Machine Learning Approach |
title_sort |
Public Sentiment of Biodiversity Conservation Using Machine Learning Approach |
publishDate |
2023 |
container_title |
2023 IEEE International Conference on Computing, ICOCO 2023 |
container_volume |
|
container_issue |
|
doi_str_mv |
10.1109/ICOCO59262.2023.10397822 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85184851013&doi=10.1109%2fICOCO59262.2023.10397822&partnerID=40&md5=b75a0d1dbb1caeb35e9d39a3c3b442b2 |
description |
The ecosystem depends on biodiversity as it offers ecological services, helps lessen the effects of natural disasters, a source of economically essential goods, and has aesthetic and cultural benefits. However, the importance of biodiversity in delivering the necessities for human existence and well-being is not commonly recognized. Even though regional and fragmented analysis has been done, a thorough analysis technique has been challenging to develop. One of the approaches to improve sentiment analysis is by using supervised machine learning classifiers which is able to learn and adapt to the nuances of language. Hence, this study aims to implement a comparative analysis of different machine learning methods namely Support Vector Machine, Logistic Regression and Naïve Bayes in order to determine the most efficient classifier for sentiment analysis on tweets discussing biodiversity and related topics. The result shows that Logistic Regression with Bag-of-Word feature extraction is the best-performing machine learning algorithm for the given biodiversity datasets with an accuracy of 75.35%. This study also highlights the importance of feature extraction in optimizing machine learning models. Bag-of-Word feature extraction slightly outperforms TF-IDF by increasing the accuracy of the Logistic Regression classifier. Besides, the performance of machine learning model also increases with the increase of sample size and the combination of Logistic Regression with Bag-of-Word technique results to the best performance where it achieved 72.2% of accuracy for 2500 tweets sample. Future research could explore using ensemble learning to combine multiple machine learning algorithms for domain-specific sentiment analysis on social media networks. © 2023 IEEE. |
publisher |
Institute of Electrical and Electronics Engineers Inc. |
issn |
|
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677779802783744 |