Public Sentiment of Biodiversity Conservation Using Machine Learning Approach

The ecosystem depends on biodiversity as it offers ecological services, helps lessen the effects of natural disasters, a source of economically essential goods, and has aesthetic and cultural benefits. However, the importance of biodiversity in delivering the necessities for human existence and well...

Full description

Bibliographic Details
Published in:2023 IEEE International Conference on Computing, ICOCO 2023
Main Author: Zamani F.; Halim S.A.; Mutalib S.; Yusoff S.B.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2023
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85184851013&doi=10.1109%2fICOCO59262.2023.10397822&partnerID=40&md5=b75a0d1dbb1caeb35e9d39a3c3b442b2
id 2-s2.0-85184851013
spelling 2-s2.0-85184851013
Zamani F.; Halim S.A.; Mutalib S.; Yusoff S.B.
Public Sentiment of Biodiversity Conservation Using Machine Learning Approach
2023
2023 IEEE International Conference on Computing, ICOCO 2023


10.1109/ICOCO59262.2023.10397822
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85184851013&doi=10.1109%2fICOCO59262.2023.10397822&partnerID=40&md5=b75a0d1dbb1caeb35e9d39a3c3b442b2
The ecosystem depends on biodiversity as it offers ecological services, helps lessen the effects of natural disasters, a source of economically essential goods, and has aesthetic and cultural benefits. However, the importance of biodiversity in delivering the necessities for human existence and well-being is not commonly recognized. Even though regional and fragmented analysis has been done, a thorough analysis technique has been challenging to develop. One of the approaches to improve sentiment analysis is by using supervised machine learning classifiers which is able to learn and adapt to the nuances of language. Hence, this study aims to implement a comparative analysis of different machine learning methods namely Support Vector Machine, Logistic Regression and Naïve Bayes in order to determine the most efficient classifier for sentiment analysis on tweets discussing biodiversity and related topics. The result shows that Logistic Regression with Bag-of-Word feature extraction is the best-performing machine learning algorithm for the given biodiversity datasets with an accuracy of 75.35%. This study also highlights the importance of feature extraction in optimizing machine learning models. Bag-of-Word feature extraction slightly outperforms TF-IDF by increasing the accuracy of the Logistic Regression classifier. Besides, the performance of machine learning model also increases with the increase of sample size and the combination of Logistic Regression with Bag-of-Word technique results to the best performance where it achieved 72.2% of accuracy for 2500 tweets sample. Future research could explore using ensemble learning to combine multiple machine learning algorithms for domain-specific sentiment analysis on social media networks. © 2023 IEEE.
Institute of Electrical and Electronics Engineers Inc.

English
Conference paper

author Zamani F.; Halim S.A.; Mutalib S.; Yusoff S.B.
spellingShingle Zamani F.; Halim S.A.; Mutalib S.; Yusoff S.B.
Public Sentiment of Biodiversity Conservation Using Machine Learning Approach
author_facet Zamani F.; Halim S.A.; Mutalib S.; Yusoff S.B.
author_sort Zamani F.; Halim S.A.; Mutalib S.; Yusoff S.B.
title Public Sentiment of Biodiversity Conservation Using Machine Learning Approach
title_short Public Sentiment of Biodiversity Conservation Using Machine Learning Approach
title_full Public Sentiment of Biodiversity Conservation Using Machine Learning Approach
title_fullStr Public Sentiment of Biodiversity Conservation Using Machine Learning Approach
title_full_unstemmed Public Sentiment of Biodiversity Conservation Using Machine Learning Approach
title_sort Public Sentiment of Biodiversity Conservation Using Machine Learning Approach
publishDate 2023
container_title 2023 IEEE International Conference on Computing, ICOCO 2023
container_volume
container_issue
doi_str_mv 10.1109/ICOCO59262.2023.10397822
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85184851013&doi=10.1109%2fICOCO59262.2023.10397822&partnerID=40&md5=b75a0d1dbb1caeb35e9d39a3c3b442b2
description The ecosystem depends on biodiversity as it offers ecological services, helps lessen the effects of natural disasters, a source of economically essential goods, and has aesthetic and cultural benefits. However, the importance of biodiversity in delivering the necessities for human existence and well-being is not commonly recognized. Even though regional and fragmented analysis has been done, a thorough analysis technique has been challenging to develop. One of the approaches to improve sentiment analysis is by using supervised machine learning classifiers which is able to learn and adapt to the nuances of language. Hence, this study aims to implement a comparative analysis of different machine learning methods namely Support Vector Machine, Logistic Regression and Naïve Bayes in order to determine the most efficient classifier for sentiment analysis on tweets discussing biodiversity and related topics. The result shows that Logistic Regression with Bag-of-Word feature extraction is the best-performing machine learning algorithm for the given biodiversity datasets with an accuracy of 75.35%. This study also highlights the importance of feature extraction in optimizing machine learning models. Bag-of-Word feature extraction slightly outperforms TF-IDF by increasing the accuracy of the Logistic Regression classifier. Besides, the performance of machine learning model also increases with the increase of sample size and the combination of Logistic Regression with Bag-of-Word technique results to the best performance where it achieved 72.2% of accuracy for 2500 tweets sample. Future research could explore using ensemble learning to combine multiple machine learning algorithms for domain-specific sentiment analysis on social media networks. © 2023 IEEE.
publisher Institute of Electrical and Electronics Engineers Inc.
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1809677779802783744