Anomalous behaviour detection based on heterogeneous data and data fusion

In this paper, we propose a new approach to identify anomalous behaviour based on heterogeneous data and a data fusion technique. There are four types of datasets applied in this study including credit card, loyalty card, GPS, and image data. The first step of the complete framework in this proposed...

Full description

Bibliographic Details
Published in:Soft Computing
Main Author: Ali A.M.; Angelov P.
Format: Article
Language:English
Published: Springer Verlag 2018
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85040059834&doi=10.1007%2fs00500-017-2989-5&partnerID=40&md5=e3c98ef17ab2e1211bda7bbaa2912bb8
id 2-s2.0-85040059834
spelling 2-s2.0-85040059834
Ali A.M.; Angelov P.
Anomalous behaviour detection based on heterogeneous data and data fusion
2018
Soft Computing
22
10
10.1007/s00500-017-2989-5
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85040059834&doi=10.1007%2fs00500-017-2989-5&partnerID=40&md5=e3c98ef17ab2e1211bda7bbaa2912bb8
In this paper, we propose a new approach to identify anomalous behaviour based on heterogeneous data and a data fusion technique. There are four types of datasets applied in this study including credit card, loyalty card, GPS, and image data. The first step of the complete framework in this proposed study is to identify the best features for every dataset. Then, the new anomaly detection technique which is recently introduced and known as empirical data analytics (EDA) is applied to detect the abnormal behaviour based on the datasets. Standardised eccentricity (a newly introduced within EDA measure offering a new simplified form of the well-known Chebyshev inequality) can be applied to any data distribution. Image data are processed using pre-trained deep learning network, and classification is done by using support vector machine. Most of the other data used in our previous work are of type “signal”/real number (e.g. credit card, loyalty card and GPS data). However, a clear conclusion that a misuse was made very often cannot be reached based on them only. When gender or age is different from the expected, it is obvious misuse. At the final stage of the proposed method is combining anomaly result and image recognition using data fusion technique. From the experiment results, this proposed technique may simplify the tedious job in the real complex cases of forensic investigation. The proposed technique is using heterogeneous data which combine all the data from the VAST Challenge as well as image data using an introduced data fusion technique. These can assist the human expert in processing huge amount of heterogeneous data to detect anomalies. In future research, text data can also be used as a part of heterogeneous data mixture, and the data fusion technique may be applied to other datasets. © 2018, The Author(s).
Springer Verlag
14327643
English
Article
All Open Access; Hybrid Gold Open Access
author Ali A.M.; Angelov P.
spellingShingle Ali A.M.; Angelov P.
Anomalous behaviour detection based on heterogeneous data and data fusion
author_facet Ali A.M.; Angelov P.
author_sort Ali A.M.; Angelov P.
title Anomalous behaviour detection based on heterogeneous data and data fusion
title_short Anomalous behaviour detection based on heterogeneous data and data fusion
title_full Anomalous behaviour detection based on heterogeneous data and data fusion
title_fullStr Anomalous behaviour detection based on heterogeneous data and data fusion
title_full_unstemmed Anomalous behaviour detection based on heterogeneous data and data fusion
title_sort Anomalous behaviour detection based on heterogeneous data and data fusion
publishDate 2018
container_title Soft Computing
container_volume 22
container_issue 10
doi_str_mv 10.1007/s00500-017-2989-5
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85040059834&doi=10.1007%2fs00500-017-2989-5&partnerID=40&md5=e3c98ef17ab2e1211bda7bbaa2912bb8
description In this paper, we propose a new approach to identify anomalous behaviour based on heterogeneous data and a data fusion technique. There are four types of datasets applied in this study including credit card, loyalty card, GPS, and image data. The first step of the complete framework in this proposed study is to identify the best features for every dataset. Then, the new anomaly detection technique which is recently introduced and known as empirical data analytics (EDA) is applied to detect the abnormal behaviour based on the datasets. Standardised eccentricity (a newly introduced within EDA measure offering a new simplified form of the well-known Chebyshev inequality) can be applied to any data distribution. Image data are processed using pre-trained deep learning network, and classification is done by using support vector machine. Most of the other data used in our previous work are of type “signal”/real number (e.g. credit card, loyalty card and GPS data). However, a clear conclusion that a misuse was made very often cannot be reached based on them only. When gender or age is different from the expected, it is obvious misuse. At the final stage of the proposed method is combining anomaly result and image recognition using data fusion technique. From the experiment results, this proposed technique may simplify the tedious job in the real complex cases of forensic investigation. The proposed technique is using heterogeneous data which combine all the data from the VAST Challenge as well as image data using an introduced data fusion technique. These can assist the human expert in processing huge amount of heterogeneous data to detect anomalies. In future research, text data can also be used as a part of heterogeneous data mixture, and the data fusion technique may be applied to other datasets. © 2018, The Author(s).
publisher Springer Verlag
issn 14327643
language English
format Article
accesstype All Open Access; Hybrid Gold Open Access
record_format scopus
collection Scopus
_version_ 1825722584824545280