Leveraging social media data using latent dirichlet allocation and naïve bayes for mental health sentiment analytics on Covid-19 pandemic

In Malaysia, during the early stages of the COVID-19 pandemic, the negative impact on mental health became noticeable. The public's psychological and behavioral responses have risen as the COVID-19 outbreak progresses. A high impression of severity, vulnerability, impact, and fear was the eleme...

Full description

Bibliographic Details
Published in:International Journal of Advances in Intelligent Informatics
Main Author: Khalid N.; Abdul-Rahman S.; Wibowo W.; Abdullah N.A.S.; Mutalib S.
Format: Article
Language:English
Published: Universitas Ahmad Dahlan 2023
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85178263223&doi=10.26555%2fijain.v9i3.1367&partnerID=40&md5=3fe570e0c1a4769fe05ce372cd767c64
id 2-s2.0-85178263223
spelling 2-s2.0-85178263223
Khalid N.; Abdul-Rahman S.; Wibowo W.; Abdullah N.A.S.; Mutalib S.
Leveraging social media data using latent dirichlet allocation and naïve bayes for mental health sentiment analytics on Covid-19 pandemic
2023
International Journal of Advances in Intelligent Informatics
9
3
10.26555/ijain.v9i3.1367
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85178263223&doi=10.26555%2fijain.v9i3.1367&partnerID=40&md5=3fe570e0c1a4769fe05ce372cd767c64
In Malaysia, during the early stages of the COVID-19 pandemic, the negative impact on mental health became noticeable. The public's psychological and behavioral responses have risen as the COVID-19 outbreak progresses. A high impression of severity, vulnerability, impact, and fear was the element that influenced higher anxiety. Social media data can be used to track Malaysian sentiments in the COVID-19 era. However, it is often found on the internet in text format with no labels, and manually decoding this data is usually complicated. Furthermore, traditional data-gathering approaches, such as filling out a survey form, may not completely capture the sentiments. This study uses a text mining technique called Latent Dirichlet Allocation (LDA) on social media to discover mental health topics during the COVID-19 pandemic. Then, a model is developed using a hybrid approach, combining both lexicon-based and Naïve Bayes classifier. The accuracy, precision, recall, and F-measures are used to evaluate the sentiment classification. The result shows that the best lexicon-based technique is VADER with 72% accuracy compared to TextBlob with 70% accuracy. These sentiments results allow for a better understanding and handling of the pandemic. The top three topics are identified and further classified into positive and negative comments. In conclusion, the developed model can assist healthcare workers and policymakers in making the right decisions in the upcoming pandemic outbreaks. © 2023, Universitas Ahmad Dahlan. All rights reserved.
Universitas Ahmad Dahlan
24426571
English
Article
All Open Access; Gold Open Access
author Khalid N.; Abdul-Rahman S.; Wibowo W.; Abdullah N.A.S.; Mutalib S.
spellingShingle Khalid N.; Abdul-Rahman S.; Wibowo W.; Abdullah N.A.S.; Mutalib S.
Leveraging social media data using latent dirichlet allocation and naïve bayes for mental health sentiment analytics on Covid-19 pandemic
author_facet Khalid N.; Abdul-Rahman S.; Wibowo W.; Abdullah N.A.S.; Mutalib S.
author_sort Khalid N.; Abdul-Rahman S.; Wibowo W.; Abdullah N.A.S.; Mutalib S.
title Leveraging social media data using latent dirichlet allocation and naïve bayes for mental health sentiment analytics on Covid-19 pandemic
title_short Leveraging social media data using latent dirichlet allocation and naïve bayes for mental health sentiment analytics on Covid-19 pandemic
title_full Leveraging social media data using latent dirichlet allocation and naïve bayes for mental health sentiment analytics on Covid-19 pandemic
title_fullStr Leveraging social media data using latent dirichlet allocation and naïve bayes for mental health sentiment analytics on Covid-19 pandemic
title_full_unstemmed Leveraging social media data using latent dirichlet allocation and naïve bayes for mental health sentiment analytics on Covid-19 pandemic
title_sort Leveraging social media data using latent dirichlet allocation and naïve bayes for mental health sentiment analytics on Covid-19 pandemic
publishDate 2023
container_title International Journal of Advances in Intelligent Informatics
container_volume 9
container_issue 3
doi_str_mv 10.26555/ijain.v9i3.1367
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85178263223&doi=10.26555%2fijain.v9i3.1367&partnerID=40&md5=3fe570e0c1a4769fe05ce372cd767c64
description In Malaysia, during the early stages of the COVID-19 pandemic, the negative impact on mental health became noticeable. The public's psychological and behavioral responses have risen as the COVID-19 outbreak progresses. A high impression of severity, vulnerability, impact, and fear was the element that influenced higher anxiety. Social media data can be used to track Malaysian sentiments in the COVID-19 era. However, it is often found on the internet in text format with no labels, and manually decoding this data is usually complicated. Furthermore, traditional data-gathering approaches, such as filling out a survey form, may not completely capture the sentiments. This study uses a text mining technique called Latent Dirichlet Allocation (LDA) on social media to discover mental health topics during the COVID-19 pandemic. Then, a model is developed using a hybrid approach, combining both lexicon-based and Naïve Bayes classifier. The accuracy, precision, recall, and F-measures are used to evaluate the sentiment classification. The result shows that the best lexicon-based technique is VADER with 72% accuracy compared to TextBlob with 70% accuracy. These sentiments results allow for a better understanding and handling of the pandemic. The top three topics are identified and further classified into positive and negative comments. In conclusion, the developed model can assist healthcare workers and policymakers in making the right decisions in the upcoming pandemic outbreaks. © 2023, Universitas Ahmad Dahlan. All rights reserved.
publisher Universitas Ahmad Dahlan
issn 24426571
language English
format Article
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1809677579242700800