Autonomous anomaly detection for streaming data

Anomaly detection from data streams is a hotly studied topic in the machine learning domain. It is widely considered a challenging task because the underlying patterns exhibited by the streaming data may dynamically change at any time. In this paper, a new algorithm is proposed to detect anomalies a...

Full description

Bibliographic Details
Published in:KNOWLEDGE-BASED SYSTEMS
Main Authors: Basheer, Muhammad Yunus Iqbal; Ali, Azliza Mohd; Hamid, Nurzeatul Hamimah Abdul; Ariffin, Muhammad Azizi Mohd; Osman, Rozianawaty; Nordin, Sharifalillah; Gu, Xiaowei
Format: Article; Early Access
Language:English
Published: ELSEVIER 2024
Subjects:
Online Access:https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001133574000001
author Basheer
Muhammad Yunus Iqbal; Ali
Azliza Mohd; Hamid
Nurzeatul Hamimah Abdul; Ariffin
Muhammad Azizi Mohd; Osman
Rozianawaty; Nordin
Sharifalillah; Gu
Xiaowei
spellingShingle Basheer
Muhammad Yunus Iqbal; Ali
Azliza Mohd; Hamid
Nurzeatul Hamimah Abdul; Ariffin
Muhammad Azizi Mohd; Osman
Rozianawaty; Nordin
Sharifalillah; Gu
Xiaowei
Autonomous anomaly detection for streaming data
Computer Science
author_facet Basheer
Muhammad Yunus Iqbal; Ali
Azliza Mohd; Hamid
Nurzeatul Hamimah Abdul; Ariffin
Muhammad Azizi Mohd; Osman
Rozianawaty; Nordin
Sharifalillah; Gu
Xiaowei
author_sort Basheer
spelling Basheer, Muhammad Yunus Iqbal; Ali, Azliza Mohd; Hamid, Nurzeatul Hamimah Abdul; Ariffin, Muhammad Azizi Mohd; Osman, Rozianawaty; Nordin, Sharifalillah; Gu, Xiaowei
Autonomous anomaly detection for streaming data
KNOWLEDGE-BASED SYSTEMS
English
Article; Early Access
Anomaly detection from data streams is a hotly studied topic in the machine learning domain. It is widely considered a challenging task because the underlying patterns exhibited by the streaming data may dynamically change at any time. In this paper, a new algorithm is proposed to detect anomalies autonomously for streaming data. The proposed algorithm is nonparametric and does not require any threshold to be preset by users. The algorithmic procedure of the proposed algorithm is composed of the following three complementary stages. Firstly, the potentially anomalous samples that represent highly different patterns from others are identified from data streams based on data density. Then, these potentially anomalous samples are clustered online using the evolving autonomous data partitioning algorithm. Finally, true anomalies are identified from these minor clusters with the least amounts of samples associated with them. Numerical examples based on three benchmark datasets demonstrated the potential of the proposed algorithm as a highly effective approach for anomaly detection from data streams.
ELSEVIER
0950-7051
1872-7409
2024
284

10.1016/j.knosys.2023.111235
Computer Science

WOS:001133574000001
https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001133574000001
title Autonomous anomaly detection for streaming data
title_short Autonomous anomaly detection for streaming data
title_full Autonomous anomaly detection for streaming data
title_fullStr Autonomous anomaly detection for streaming data
title_full_unstemmed Autonomous anomaly detection for streaming data
title_sort Autonomous anomaly detection for streaming data
container_title KNOWLEDGE-BASED SYSTEMS
language English
format Article; Early Access
description Anomaly detection from data streams is a hotly studied topic in the machine learning domain. It is widely considered a challenging task because the underlying patterns exhibited by the streaming data may dynamically change at any time. In this paper, a new algorithm is proposed to detect anomalies autonomously for streaming data. The proposed algorithm is nonparametric and does not require any threshold to be preset by users. The algorithmic procedure of the proposed algorithm is composed of the following three complementary stages. Firstly, the potentially anomalous samples that represent highly different patterns from others are identified from data streams based on data density. Then, these potentially anomalous samples are clustered online using the evolving autonomous data partitioning algorithm. Finally, true anomalies are identified from these minor clusters with the least amounts of samples associated with them. Numerical examples based on three benchmark datasets demonstrated the potential of the proposed algorithm as a highly effective approach for anomaly detection from data streams.
publisher ELSEVIER
issn 0950-7051
1872-7409
publishDate 2024
container_volume 284
container_issue
doi_str_mv 10.1016/j.knosys.2023.111235
topic Computer Science
topic_facet Computer Science
accesstype
id WOS:001133574000001
url https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001133574000001
record_format wos
collection Web of Science (WoS)
_version_ 1791586756281237504