Autonomous anomaly detection for streaming data
Anomaly detection from data streams is a hotly studied topic in the machine learning domain. It is widely considered a challenging task because the underlying patterns exhibited by the streaming data may dynamically change at any time. In this paper, a new algorithm is proposed to detect anomalies a...
Published in: | KNOWLEDGE-BASED SYSTEMS |
---|---|
Main Authors: | , , , , , , , |
Format: | Article; Early Access |
Language: | English |
Published: |
ELSEVIER
2024
|
Subjects: | |
Online Access: | https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001133574000001 |
author |
Basheer Muhammad Yunus Iqbal; Ali Azliza Mohd; Hamid Nurzeatul Hamimah Abdul; Ariffin Muhammad Azizi Mohd; Osman Rozianawaty; Nordin Sharifalillah; Gu Xiaowei |
---|---|
spellingShingle |
Basheer Muhammad Yunus Iqbal; Ali Azliza Mohd; Hamid Nurzeatul Hamimah Abdul; Ariffin Muhammad Azizi Mohd; Osman Rozianawaty; Nordin Sharifalillah; Gu Xiaowei Autonomous anomaly detection for streaming data Computer Science |
author_facet |
Basheer Muhammad Yunus Iqbal; Ali Azliza Mohd; Hamid Nurzeatul Hamimah Abdul; Ariffin Muhammad Azizi Mohd; Osman Rozianawaty; Nordin Sharifalillah; Gu Xiaowei |
author_sort |
Basheer |
spelling |
Basheer, Muhammad Yunus Iqbal; Ali, Azliza Mohd; Hamid, Nurzeatul Hamimah Abdul; Ariffin, Muhammad Azizi Mohd; Osman, Rozianawaty; Nordin, Sharifalillah; Gu, Xiaowei Autonomous anomaly detection for streaming data KNOWLEDGE-BASED SYSTEMS English Article; Early Access Anomaly detection from data streams is a hotly studied topic in the machine learning domain. It is widely considered a challenging task because the underlying patterns exhibited by the streaming data may dynamically change at any time. In this paper, a new algorithm is proposed to detect anomalies autonomously for streaming data. The proposed algorithm is nonparametric and does not require any threshold to be preset by users. The algorithmic procedure of the proposed algorithm is composed of the following three complementary stages. Firstly, the potentially anomalous samples that represent highly different patterns from others are identified from data streams based on data density. Then, these potentially anomalous samples are clustered online using the evolving autonomous data partitioning algorithm. Finally, true anomalies are identified from these minor clusters with the least amounts of samples associated with them. Numerical examples based on three benchmark datasets demonstrated the potential of the proposed algorithm as a highly effective approach for anomaly detection from data streams. ELSEVIER 0950-7051 1872-7409 2024 284 10.1016/j.knosys.2023.111235 Computer Science WOS:001133574000001 https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001133574000001 |
title |
Autonomous anomaly detection for streaming data |
title_short |
Autonomous anomaly detection for streaming data |
title_full |
Autonomous anomaly detection for streaming data |
title_fullStr |
Autonomous anomaly detection for streaming data |
title_full_unstemmed |
Autonomous anomaly detection for streaming data |
title_sort |
Autonomous anomaly detection for streaming data |
container_title |
KNOWLEDGE-BASED SYSTEMS |
language |
English |
format |
Article; Early Access |
description |
Anomaly detection from data streams is a hotly studied topic in the machine learning domain. It is widely considered a challenging task because the underlying patterns exhibited by the streaming data may dynamically change at any time. In this paper, a new algorithm is proposed to detect anomalies autonomously for streaming data. The proposed algorithm is nonparametric and does not require any threshold to be preset by users. The algorithmic procedure of the proposed algorithm is composed of the following three complementary stages. Firstly, the potentially anomalous samples that represent highly different patterns from others are identified from data streams based on data density. Then, these potentially anomalous samples are clustered online using the evolving autonomous data partitioning algorithm. Finally, true anomalies are identified from these minor clusters with the least amounts of samples associated with them. Numerical examples based on three benchmark datasets demonstrated the potential of the proposed algorithm as a highly effective approach for anomaly detection from data streams. |
publisher |
ELSEVIER |
issn |
0950-7051 1872-7409 |
publishDate |
2024 |
container_volume |
284 |
container_issue |
|
doi_str_mv |
10.1016/j.knosys.2023.111235 |
topic |
Computer Science |
topic_facet |
Computer Science |
accesstype |
|
id |
WOS:001133574000001 |
url |
https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001133574000001 |
record_format |
wos |
collection |
Web of Science (WoS) |
_version_ |
1809678576889364480 |