Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches
The pervasiveness of misinformation surrounding the COVID-19 pandemic has garnered heightened attention due to its implications, as a noteworthy proportion of the populace is being exposed to spurious and unsubstantiated narratives concerning the crisis. This research utilizes a dataset sourced from...
Published in: | 8th International Conference on Recent Advances and Innovations in Engineering: Empowering Computing, Analytics, and Engineering Through Digital Innovation, ICRAIE 2023 |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
Institute of Electrical and Electronics Engineers Inc.
2023
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189939276&doi=10.1109%2fICRAIE59459.2023.10468126&partnerID=40&md5=e7fcbe2274590bd27d72d1312dd144d7 |
id |
2-s2.0-85189939276 |
---|---|
spelling |
2-s2.0-85189939276 Hussin M.H.; Mahmud Y.; Mohd Hanafiah N.I.; Azma Nasruddin Z.; Mohd Ariffin N.H.; Ince M.; Senel F.A. Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches 2023 8th International Conference on Recent Advances and Innovations in Engineering: Empowering Computing, Analytics, and Engineering Through Digital Innovation, ICRAIE 2023 10.1109/ICRAIE59459.2023.10468126 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189939276&doi=10.1109%2fICRAIE59459.2023.10468126&partnerID=40&md5=e7fcbe2274590bd27d72d1312dd144d7 The pervasiveness of misinformation surrounding the COVID-19 pandemic has garnered heightened attention due to its implications, as a noteworthy proportion of the populace is being exposed to spurious and unsubstantiated narratives concerning the crisis. This research utilizes a dataset sourced from Codalab, comprising 8,560 tweets, with 4,480 labelled as real and 4,080 as fake. The research explores the effectiveness of different machine learning models, including logistic regression (LR), random forest (RF), and deep learning models such as Convolutional Neural Networks (CNN) and Bidirectional Long Short-Term Memory (Bi-LSTM). In addition to model comparison, experiments were conducted to analyze the impact of different data splits (70:30, 80:20, and 90:10), batch sizes (16, 32, and 64), and the number of epochs (5, 10, and 15) on model performance. The experiments provided insights into the optimal configurations for the models. The results showcase the model's capabilities, with high accuracy achieved across the different models. Specifically, logistic regression achieved an accuracy of 92%, random forest 91%, Bi-LSTM 93%, and CNN 95%. These findings highlight the potential of deep learning models, particularly CNN, in accurately detecting fake news from COVID-19-related tweets. © 2023 IEEE. Institute of Electrical and Electronics Engineers Inc. English Conference paper |
author |
Hussin M.H.; Mahmud Y.; Mohd Hanafiah N.I.; Azma Nasruddin Z.; Mohd Ariffin N.H.; Ince M.; Senel F.A. |
spellingShingle |
Hussin M.H.; Mahmud Y.; Mohd Hanafiah N.I.; Azma Nasruddin Z.; Mohd Ariffin N.H.; Ince M.; Senel F.A. Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches |
author_facet |
Hussin M.H.; Mahmud Y.; Mohd Hanafiah N.I.; Azma Nasruddin Z.; Mohd Ariffin N.H.; Ince M.; Senel F.A. |
author_sort |
Hussin M.H.; Mahmud Y.; Mohd Hanafiah N.I.; Azma Nasruddin Z.; Mohd Ariffin N.H.; Ince M.; Senel F.A. |
title |
Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches |
title_short |
Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches |
title_full |
Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches |
title_fullStr |
Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches |
title_full_unstemmed |
Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches |
title_sort |
Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches |
publishDate |
2023 |
container_title |
8th International Conference on Recent Advances and Innovations in Engineering: Empowering Computing, Analytics, and Engineering Through Digital Innovation, ICRAIE 2023 |
container_volume |
|
container_issue |
|
doi_str_mv |
10.1109/ICRAIE59459.2023.10468126 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189939276&doi=10.1109%2fICRAIE59459.2023.10468126&partnerID=40&md5=e7fcbe2274590bd27d72d1312dd144d7 |
description |
The pervasiveness of misinformation surrounding the COVID-19 pandemic has garnered heightened attention due to its implications, as a noteworthy proportion of the populace is being exposed to spurious and unsubstantiated narratives concerning the crisis. This research utilizes a dataset sourced from Codalab, comprising 8,560 tweets, with 4,480 labelled as real and 4,080 as fake. The research explores the effectiveness of different machine learning models, including logistic regression (LR), random forest (RF), and deep learning models such as Convolutional Neural Networks (CNN) and Bidirectional Long Short-Term Memory (Bi-LSTM). In addition to model comparison, experiments were conducted to analyze the impact of different data splits (70:30, 80:20, and 90:10), batch sizes (16, 32, and 64), and the number of epochs (5, 10, and 15) on model performance. The experiments provided insights into the optimal configurations for the models. The results showcase the model's capabilities, with high accuracy achieved across the different models. Specifically, logistic regression achieved an accuracy of 92%, random forest 91%, Bi-LSTM 93%, and CNN 95%. These findings highlight the potential of deep learning models, particularly CNN, in accurately detecting fake news from COVID-19-related tweets. © 2023 IEEE. |
publisher |
Institute of Electrical and Electronics Engineers Inc. |
issn |
|
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1820775448275058688 |