An Improved Pheromone-Based Kohonen Self-Organising Map in Clustering and Visualising Balanced and Imbalanced Datasets

The data distribution issue remains an unsolved clustering problem in data mining, especially in dealing with imbalanced datasets. The Kohonen Self-Organising Map (KSOM) is one of the well-known clustering algorithms that can solve various problems without a predefined number of clusters. However, s...

Full description

Bibliographic Details
Published in:Journal of Information and Communication Technology
Main Author: Ahmad A.; Yusof R.; Zulkifli N.S.A.; Ismail M.N.
Format: Article
Language:English
Published: Universiti Utara Malaysia Press 2021
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85116722393&doi=10.32890%2fjict2021.20.4.8&partnerID=40&md5=dc3c52654fa0332d8d03c6a557e12262
id 2-s2.0-85116722393
spelling 2-s2.0-85116722393
Ahmad A.; Yusof R.; Zulkifli N.S.A.; Ismail M.N.
An Improved Pheromone-Based Kohonen Self-Organising Map in Clustering and Visualising Balanced and Imbalanced Datasets
2021
Journal of Information and Communication Technology
20
4
10.32890/jict2021.20.4.8
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85116722393&doi=10.32890%2fjict2021.20.4.8&partnerID=40&md5=dc3c52654fa0332d8d03c6a557e12262
The data distribution issue remains an unsolved clustering problem in data mining, especially in dealing with imbalanced datasets. The Kohonen Self-Organising Map (KSOM) is one of the well-known clustering algorithms that can solve various problems without a predefined number of clusters. However, similar to other clustering algorithms, this algorithm requires sufficient data for its unsupervised learning process. The inadequate amount of class label data in a dataset significantly affects the clustering learning process, leading to inefficient and unreliable results. Numerous research have been conducted by hybridising and optimising the KSOM algorithm with various optimisation techniques. Unfortunately, the problems are still unsolved, especially separation boundary and overlapping clusters. Therefore, this research proposed an improved pheromonebased PKSOM algorithm known as iPKSOM to solve the mentioned problem. Six different datasets, i.e. Iris, Seed, Glass, Titanic, WDBC, and Tropical Wood datasets were chosen to investigate the effectiveness of the iPKSOM algorithm. All datasets were observed and compared with the original KSOM results. This modification significantly impacted the clustering process by improving and refining the scatteredness of clustering data and reducing overlapping clusters. Therefore, this proposed algorithm can be implemented in clustering other complex datasets, such as high dimensional and streaming data. © 2021. Journal of Information and Communication Technology. All rights reserved.
Universiti Utara Malaysia Press
1675414X
English
Article
All Open Access; Gold Open Access
author Ahmad A.; Yusof R.; Zulkifli N.S.A.; Ismail M.N.
spellingShingle Ahmad A.; Yusof R.; Zulkifli N.S.A.; Ismail M.N.
An Improved Pheromone-Based Kohonen Self-Organising Map in Clustering and Visualising Balanced and Imbalanced Datasets
author_facet Ahmad A.; Yusof R.; Zulkifli N.S.A.; Ismail M.N.
author_sort Ahmad A.; Yusof R.; Zulkifli N.S.A.; Ismail M.N.
title An Improved Pheromone-Based Kohonen Self-Organising Map in Clustering and Visualising Balanced and Imbalanced Datasets
title_short An Improved Pheromone-Based Kohonen Self-Organising Map in Clustering and Visualising Balanced and Imbalanced Datasets
title_full An Improved Pheromone-Based Kohonen Self-Organising Map in Clustering and Visualising Balanced and Imbalanced Datasets
title_fullStr An Improved Pheromone-Based Kohonen Self-Organising Map in Clustering and Visualising Balanced and Imbalanced Datasets
title_full_unstemmed An Improved Pheromone-Based Kohonen Self-Organising Map in Clustering and Visualising Balanced and Imbalanced Datasets
title_sort An Improved Pheromone-Based Kohonen Self-Organising Map in Clustering and Visualising Balanced and Imbalanced Datasets
publishDate 2021
container_title Journal of Information and Communication Technology
container_volume 20
container_issue 4
doi_str_mv 10.32890/jict2021.20.4.8
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85116722393&doi=10.32890%2fjict2021.20.4.8&partnerID=40&md5=dc3c52654fa0332d8d03c6a557e12262
description The data distribution issue remains an unsolved clustering problem in data mining, especially in dealing with imbalanced datasets. The Kohonen Self-Organising Map (KSOM) is one of the well-known clustering algorithms that can solve various problems without a predefined number of clusters. However, similar to other clustering algorithms, this algorithm requires sufficient data for its unsupervised learning process. The inadequate amount of class label data in a dataset significantly affects the clustering learning process, leading to inefficient and unreliable results. Numerous research have been conducted by hybridising and optimising the KSOM algorithm with various optimisation techniques. Unfortunately, the problems are still unsolved, especially separation boundary and overlapping clusters. Therefore, this research proposed an improved pheromonebased PKSOM algorithm known as iPKSOM to solve the mentioned problem. Six different datasets, i.e. Iris, Seed, Glass, Titanic, WDBC, and Tropical Wood datasets were chosen to investigate the effectiveness of the iPKSOM algorithm. All datasets were observed and compared with the original KSOM results. This modification significantly impacted the clustering process by improving and refining the scatteredness of clustering data and reducing overlapping clusters. Therefore, this proposed algorithm can be implemented in clustering other complex datasets, such as high dimensional and streaming data. © 2021. Journal of Information and Communication Technology. All rights reserved.
publisher Universiti Utara Malaysia Press
issn 1675414X
language English
format Article
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1809678027004575744