Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches

The pervasiveness of misinformation surrounding the COVID-19 pandemic has garnered heightened attention due to its implications, as a noteworthy proportion of the populace is being exposed to spurious and unsubstantiated narratives concerning the crisis. This research utilizes a dataset sourced from...

Full description

Bibliographic Details
Published in:	8th International Conference on Recent Advances and Innovations in Engineering: Empowering Computing, Analytics, and Engineering Through Digital Innovation, ICRAIE 2023
Main Author:	Hussin M.H.; Mahmud Y.; Mohd Hanafiah N.I.; Azma Nasruddin Z.; Mohd Ariffin N.H.; Ince M.; Senel F.A.
Format:	Conference paper
Language:	English
Published:	Institute of Electrical and Electronics Engineers Inc. 2023
Online Access:	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189939276&doi=10.1109%2fICRAIE59459.2023.10468126&partnerID=40&md5=e7fcbe2274590bd27d72d1312dd144d7

id	2-s2.0-85189939276
spelling	2-s2.0-85189939276 Hussin M.H.; Mahmud Y.; Mohd Hanafiah N.I.; Azma Nasruddin Z.; Mohd Ariffin N.H.; Ince M.; Senel F.A. Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches 2023 8th International Conference on Recent Advances and Innovations in Engineering: Empowering Computing, Analytics, and Engineering Through Digital Innovation, ICRAIE 2023 10.1109/ICRAIE59459.2023.10468126 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189939276&doi=10.1109%2fICRAIE59459.2023.10468126&partnerID=40&md5=e7fcbe2274590bd27d72d1312dd144d7 The pervasiveness of misinformation surrounding the COVID-19 pandemic has garnered heightened attention due to its implications, as a noteworthy proportion of the populace is being exposed to spurious and unsubstantiated narratives concerning the crisis. This research utilizes a dataset sourced from Codalab, comprising 8,560 tweets, with 4,480 labelled as real and 4,080 as fake. The research explores the effectiveness of different machine learning models, including logistic regression (LR), random forest (RF), and deep learning models such as Convolutional Neural Networks (CNN) and Bidirectional Long Short-Term Memory (Bi-LSTM). In addition to model comparison, experiments were conducted to analyze the impact of different data splits (70:30, 80:20, and 90:10), batch sizes (16, 32, and 64), and the number of epochs (5, 10, and 15) on model performance. The experiments provided insights into the optimal configurations for the models. The results showcase the model's capabilities, with high accuracy achieved across the different models. Specifically, logistic regression achieved an accuracy of 92%, random forest 91%, Bi-LSTM 93%, and CNN 95%. These findings highlight the potential of deep learning models, particularly CNN, in accurately detecting fake news from COVID-19-related tweets. © 2023 IEEE. Institute of Electrical and Electronics Engineers Inc. English Conference paper
author	Hussin M.H.; Mahmud Y.; Mohd Hanafiah N.I.; Azma Nasruddin Z.; Mohd Ariffin N.H.; Ince M.; Senel F.A.
spellingShingle	Hussin M.H.; Mahmud Y.; Mohd Hanafiah N.I.; Azma Nasruddin Z.; Mohd Ariffin N.H.; Ince M.; Senel F.A. Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches
author_facet	Hussin M.H.; Mahmud Y.; Mohd Hanafiah N.I.; Azma Nasruddin Z.; Mohd Ariffin N.H.; Ince M.; Senel F.A.
author_sort	Hussin M.H.; Mahmud Y.; Mohd Hanafiah N.I.; Azma Nasruddin Z.; Mohd Ariffin N.H.; Ince M.; Senel F.A.
title	Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches
title_short	Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches
title_full	Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches
title_fullStr	Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches
title_full_unstemmed	Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches
title_sort	Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches
publishDate	2023
container_title	8th International Conference on Recent Advances and Innovations in Engineering: Empowering Computing, Analytics, and Engineering Through Digital Innovation, ICRAIE 2023
container_volume
container_issue
doi_str_mv	10.1109/ICRAIE59459.2023.10468126
url	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189939276&doi=10.1109%2fICRAIE59459.2023.10468126&partnerID=40&md5=e7fcbe2274590bd27d72d1312dd144d7
description	The pervasiveness of misinformation surrounding the COVID-19 pandemic has garnered heightened attention due to its implications, as a noteworthy proportion of the populace is being exposed to spurious and unsubstantiated narratives concerning the crisis. This research utilizes a dataset sourced from Codalab, comprising 8,560 tweets, with 4,480 labelled as real and 4,080 as fake. The research explores the effectiveness of different machine learning models, including logistic regression (LR), random forest (RF), and deep learning models such as Convolutional Neural Networks (CNN) and Bidirectional Long Short-Term Memory (Bi-LSTM). In addition to model comparison, experiments were conducted to analyze the impact of different data splits (70:30, 80:20, and 90:10), batch sizes (16, 32, and 64), and the number of epochs (5, 10, and 15) on model performance. The experiments provided insights into the optimal configurations for the models. The results showcase the model's capabilities, with high accuracy achieved across the different models. Specifically, logistic regression achieved an accuracy of 92%, random forest 91%, Bi-LSTM 93%, and CNN 95%. These findings highlight the potential of deep learning models, particularly CNN, in accurately detecting fake news from COVID-19-related tweets. © 2023 IEEE.
publisher	Institute of Electrical and Electronics Engineers Inc.
issn
language	English
format	Conference paper
accesstype
record_format	scopus
collection	Scopus
_version_	1820775448275058688

Fake News Detection Regarding COVID-19 Tweets Using Machine Learning Approaches

Similar Items