A Sentiment Analysis Framework on COVID-19 in Major Cities of Malaysia based on Tweets using Machine Learning Classification Model
Twitter is one of the famous social media platforms for people to share their stories and opinions on any situations, such as the COVID-19 pandemic. With the indirect influence of tweets on users and the rise in cases of COVID-19 in Malaysia, it is important to monitor information related to the pan...
Published in: | 2021 IEEE 11th International Conference on System Engineering and Technology, ICSET 2021 - Proceedings |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
Institute of Electrical and Electronics Engineers Inc.
2021
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85123368779&doi=10.1109%2fICSET53708.2021.9612527&partnerID=40&md5=2499ca2d1ebad930b80d633b9bfaaf33 |
id |
2-s2.0-85123368779 |
---|---|
spelling |
2-s2.0-85123368779 Aminuddin R.; Bistamam M.A.; Ibrahim S.; Abu Mangshor N.N.; Fesol S.F.A.; Wahab N.I.F.A. A Sentiment Analysis Framework on COVID-19 in Major Cities of Malaysia based on Tweets using Machine Learning Classification Model 2021 2021 IEEE 11th International Conference on System Engineering and Technology, ICSET 2021 - Proceedings 10.1109/ICSET53708.2021.9612527 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85123368779&doi=10.1109%2fICSET53708.2021.9612527&partnerID=40&md5=2499ca2d1ebad930b80d633b9bfaaf33 Twitter is one of the famous social media platforms for people to share their stories and opinions on any situations, such as the COVID-19 pandemic. With the indirect influence of tweets on users and the rise in cases of COVID-19 in Malaysia, it is important to monitor information related to the pandemic in order to avoid misinformation, panic, or confusion among public. As the data from tweets are also one of the useful raw data sources that can be used for data visualization, this project aims to design and develop a web-based system for visualizing the status of pandemic in Malaysia based on the data collected from Twitter. There are four phases in the methodology of this project: (i) Planning, (ii) Analysis, (iii) Design and Development, and (iv) Testing and Documentation. In the planning and analysis phases, the data will be collected from March 2020 to March 2021 and will be filtered by using keywords and hashtags, such as #COVID19 and #Coronavirus, as well as the location tagged on the tweets. The collected data then will be pre-processed to remove any unwanted texts. The classification of the data is based on sentiment analysis using one of machine learning models that is Support Vector Machine (SVM). The performance of the classification model will be evaluated using the evaluation model: (i) accuracy, (ii) recall, (iii) precision, and (iv) F1-measure. The final output of this project is the data visualization of the sentiment analysis on COVID-19 in Malaysia based on two of its major cities: Kuala Lumpur and Klang. © 2021 IEEE. Institute of Electrical and Electronics Engineers Inc. English Conference paper |
author |
Aminuddin R.; Bistamam M.A.; Ibrahim S.; Abu Mangshor N.N.; Fesol S.F.A.; Wahab N.I.F.A. |
spellingShingle |
Aminuddin R.; Bistamam M.A.; Ibrahim S.; Abu Mangshor N.N.; Fesol S.F.A.; Wahab N.I.F.A. A Sentiment Analysis Framework on COVID-19 in Major Cities of Malaysia based on Tweets using Machine Learning Classification Model |
author_facet |
Aminuddin R.; Bistamam M.A.; Ibrahim S.; Abu Mangshor N.N.; Fesol S.F.A.; Wahab N.I.F.A. |
author_sort |
Aminuddin R.; Bistamam M.A.; Ibrahim S.; Abu Mangshor N.N.; Fesol S.F.A.; Wahab N.I.F.A. |
title |
A Sentiment Analysis Framework on COVID-19 in Major Cities of Malaysia based on Tweets using Machine Learning Classification Model |
title_short |
A Sentiment Analysis Framework on COVID-19 in Major Cities of Malaysia based on Tweets using Machine Learning Classification Model |
title_full |
A Sentiment Analysis Framework on COVID-19 in Major Cities of Malaysia based on Tweets using Machine Learning Classification Model |
title_fullStr |
A Sentiment Analysis Framework on COVID-19 in Major Cities of Malaysia based on Tweets using Machine Learning Classification Model |
title_full_unstemmed |
A Sentiment Analysis Framework on COVID-19 in Major Cities of Malaysia based on Tweets using Machine Learning Classification Model |
title_sort |
A Sentiment Analysis Framework on COVID-19 in Major Cities of Malaysia based on Tweets using Machine Learning Classification Model |
publishDate |
2021 |
container_title |
2021 IEEE 11th International Conference on System Engineering and Technology, ICSET 2021 - Proceedings |
container_volume |
|
container_issue |
|
doi_str_mv |
10.1109/ICSET53708.2021.9612527 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85123368779&doi=10.1109%2fICSET53708.2021.9612527&partnerID=40&md5=2499ca2d1ebad930b80d633b9bfaaf33 |
description |
Twitter is one of the famous social media platforms for people to share their stories and opinions on any situations, such as the COVID-19 pandemic. With the indirect influence of tweets on users and the rise in cases of COVID-19 in Malaysia, it is important to monitor information related to the pandemic in order to avoid misinformation, panic, or confusion among public. As the data from tweets are also one of the useful raw data sources that can be used for data visualization, this project aims to design and develop a web-based system for visualizing the status of pandemic in Malaysia based on the data collected from Twitter. There are four phases in the methodology of this project: (i) Planning, (ii) Analysis, (iii) Design and Development, and (iv) Testing and Documentation. In the planning and analysis phases, the data will be collected from March 2020 to March 2021 and will be filtered by using keywords and hashtags, such as #COVID19 and #Coronavirus, as well as the location tagged on the tweets. The collected data then will be pre-processed to remove any unwanted texts. The classification of the data is based on sentiment analysis using one of machine learning models that is Support Vector Machine (SVM). The performance of the classification model will be evaluated using the evaluation model: (i) accuracy, (ii) recall, (iii) precision, and (iv) F1-measure. The final output of this project is the data visualization of the sentiment analysis on COVID-19 in Malaysia based on two of its major cities: Kuala Lumpur and Klang. © 2021 IEEE. |
publisher |
Institute of Electrical and Electronics Engineers Inc. |
issn |
|
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809678027748016128 |