Development of a machine learning algorithm for fake news detection

With the extensive technological advancements and expansion, the persistent issues regarding the creation and rapid dissemination of fake news have become a prevalent and recurrent concern. The manipulation of news content has critical repercussions, such as causing public mistrust, fear, harm, and...

Full description

Bibliographic Details
Published in:Indonesian Journal of Electrical Engineering and Computer Science
Main Author: Abdullah N.A.S.; Rusli N.I.A.; Yuslee N.S.
Format: Article
Language:English
Published: Institute of Advanced Engineering and Science 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85198090562&doi=10.11591%2fijeecs.v35.i3.pp1732-1743&partnerID=40&md5=4dbf62275f0cea4e88fae99a323e0641
id 2-s2.0-85198090562
spelling 2-s2.0-85198090562
Abdullah N.A.S.; Rusli N.I.A.; Yuslee N.S.
Development of a machine learning algorithm for fake news detection
2024
Indonesian Journal of Electrical Engineering and Computer Science
35
3
10.11591/ijeecs.v35.i3.pp1732-1743
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85198090562&doi=10.11591%2fijeecs.v35.i3.pp1732-1743&partnerID=40&md5=4dbf62275f0cea4e88fae99a323e0641
With the extensive technological advancements and expansion, the persistent issues regarding the creation and rapid dissemination of fake news have become a prevalent and recurrent concern. The manipulation of news content has critical repercussions, such as causing public mistrust, fear, harm, and misinformation. Addressing that, this study developed a supervised machine learning algorithm that can accurately classify social media data as fake news. The methodology of the proposed fake news detection model involved five main components: data acquisition from Twitter, data preprocessing, data transformation, model development using Naïve Bayes, decision tree, and support vector machine (SVM) and model evaluation using accuracy, precision, recall and F1-score. The results revealed that decision tree recorded the highest accuracy for both textual data (100%) and metadata (94.54%) and consistently outperformed both Naïve Bayes and SVM in terms of precision, recall, and F1-score metrics, with a score of 100% for the classification of textual data-based datasets. Regarding the metadata-based classification, decision tree also demonstrated excellent performance, with the highest F1-score of 94% for fake news data. Meanwhile, SVM exhibited the highest precision and recall performance for the metadata-based classification. Overall, the application of the decision tree classifier was deemed the most effective in Twitter fake news detection. © 2024 Institute of Advanced Engineering and Science. All rights reserved.
Institute of Advanced Engineering and Science
25024752
English
Article
All Open Access; Gold Open Access
author Abdullah N.A.S.; Rusli N.I.A.; Yuslee N.S.
spellingShingle Abdullah N.A.S.; Rusli N.I.A.; Yuslee N.S.
Development of a machine learning algorithm for fake news detection
author_facet Abdullah N.A.S.; Rusli N.I.A.; Yuslee N.S.
author_sort Abdullah N.A.S.; Rusli N.I.A.; Yuslee N.S.
title Development of a machine learning algorithm for fake news detection
title_short Development of a machine learning algorithm for fake news detection
title_full Development of a machine learning algorithm for fake news detection
title_fullStr Development of a machine learning algorithm for fake news detection
title_full_unstemmed Development of a machine learning algorithm for fake news detection
title_sort Development of a machine learning algorithm for fake news detection
publishDate 2024
container_title Indonesian Journal of Electrical Engineering and Computer Science
container_volume 35
container_issue 3
doi_str_mv 10.11591/ijeecs.v35.i3.pp1732-1743
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85198090562&doi=10.11591%2fijeecs.v35.i3.pp1732-1743&partnerID=40&md5=4dbf62275f0cea4e88fae99a323e0641
description With the extensive technological advancements and expansion, the persistent issues regarding the creation and rapid dissemination of fake news have become a prevalent and recurrent concern. The manipulation of news content has critical repercussions, such as causing public mistrust, fear, harm, and misinformation. Addressing that, this study developed a supervised machine learning algorithm that can accurately classify social media data as fake news. The methodology of the proposed fake news detection model involved five main components: data acquisition from Twitter, data preprocessing, data transformation, model development using Naïve Bayes, decision tree, and support vector machine (SVM) and model evaluation using accuracy, precision, recall and F1-score. The results revealed that decision tree recorded the highest accuracy for both textual data (100%) and metadata (94.54%) and consistently outperformed both Naïve Bayes and SVM in terms of precision, recall, and F1-score metrics, with a score of 100% for the classification of textual data-based datasets. Regarding the metadata-based classification, decision tree also demonstrated excellent performance, with the highest F1-score of 94% for fake news data. Meanwhile, SVM exhibited the highest precision and recall performance for the metadata-based classification. Overall, the application of the decision tree classifier was deemed the most effective in Twitter fake news detection. © 2024 Institute of Advanced Engineering and Science. All rights reserved.
publisher Institute of Advanced Engineering and Science
issn 25024752
language English
format Article
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1814778498003238912