Autonomous anomaly detection for streaming data

Anomaly detection from data streams is a hotly studied topic in the machine learning domain. It is widely considered a challenging task because the underlying patterns exhibited by the streaming data may dynamically change at any time. In this paper, a new algorithm is proposed to detect anomalies a...

Full description

Bibliographic Details
Published in:Knowledge-Based Systems
Main Author: Iqbal Basheer M.Y.; Mohd Ali A.; Abdul Hamid N.H.; Mohd Ariffin M.A.; Osman R.; Nordin S.; Gu X.
Format: Article
Language:English
Published: Elsevier B.V. 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85178997887&doi=10.1016%2fj.knosys.2023.111235&partnerID=40&md5=66f0acd2d0fdb446021eb5e1d4bab41f
id 2-s2.0-85178997887
spelling 2-s2.0-85178997887
Iqbal Basheer M.Y.; Mohd Ali A.; Abdul Hamid N.H.; Mohd Ariffin M.A.; Osman R.; Nordin S.; Gu X.
Autonomous anomaly detection for streaming data
2024
Knowledge-Based Systems
284

10.1016/j.knosys.2023.111235
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85178997887&doi=10.1016%2fj.knosys.2023.111235&partnerID=40&md5=66f0acd2d0fdb446021eb5e1d4bab41f
Anomaly detection from data streams is a hotly studied topic in the machine learning domain. It is widely considered a challenging task because the underlying patterns exhibited by the streaming data may dynamically change at any time. In this paper, a new algorithm is proposed to detect anomalies autonomously for streaming data. The proposed algorithm is nonparametric and does not require any threshold to be preset by users. The algorithmic procedure of the proposed algorithm is composed of the following three complementary stages. Firstly, the potentially anomalous samples that represent highly different patterns from others are identified from data streams based on data density. Then, these potentially anomalous samples are clustered online using the evolving autonomous data partitioning algorithm. Finally, true anomalies are identified from these minor clusters with the least amounts of samples associated with them. Numerical examples based on three benchmark datasets demonstrated the potential of the proposed algorithm as a highly effective approach for anomaly detection from data streams. © 2023 Elsevier B.V.
Elsevier B.V.
9507051
English
Article

author Iqbal Basheer M.Y.; Mohd Ali A.; Abdul Hamid N.H.; Mohd Ariffin M.A.; Osman R.; Nordin S.; Gu X.
spellingShingle Iqbal Basheer M.Y.; Mohd Ali A.; Abdul Hamid N.H.; Mohd Ariffin M.A.; Osman R.; Nordin S.; Gu X.
Autonomous anomaly detection for streaming data
author_facet Iqbal Basheer M.Y.; Mohd Ali A.; Abdul Hamid N.H.; Mohd Ariffin M.A.; Osman R.; Nordin S.; Gu X.
author_sort Iqbal Basheer M.Y.; Mohd Ali A.; Abdul Hamid N.H.; Mohd Ariffin M.A.; Osman R.; Nordin S.; Gu X.
title Autonomous anomaly detection for streaming data
title_short Autonomous anomaly detection for streaming data
title_full Autonomous anomaly detection for streaming data
title_fullStr Autonomous anomaly detection for streaming data
title_full_unstemmed Autonomous anomaly detection for streaming data
title_sort Autonomous anomaly detection for streaming data
publishDate 2024
container_title Knowledge-Based Systems
container_volume 284
container_issue
doi_str_mv 10.1016/j.knosys.2023.111235
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85178997887&doi=10.1016%2fj.knosys.2023.111235&partnerID=40&md5=66f0acd2d0fdb446021eb5e1d4bab41f
description Anomaly detection from data streams is a hotly studied topic in the machine learning domain. It is widely considered a challenging task because the underlying patterns exhibited by the streaming data may dynamically change at any time. In this paper, a new algorithm is proposed to detect anomalies autonomously for streaming data. The proposed algorithm is nonparametric and does not require any threshold to be preset by users. The algorithmic procedure of the proposed algorithm is composed of the following three complementary stages. Firstly, the potentially anomalous samples that represent highly different patterns from others are identified from data streams based on data density. Then, these potentially anomalous samples are clustered online using the evolving autonomous data partitioning algorithm. Finally, true anomalies are identified from these minor clusters with the least amounts of samples associated with them. Numerical examples based on three benchmark datasets demonstrated the potential of the proposed algorithm as a highly effective approach for anomaly detection from data streams. © 2023 Elsevier B.V.
publisher Elsevier B.V.
issn 9507051
language English
format Article
accesstype
record_format scopus
collection Scopus
_version_ 1809677574296567808