Big Data: Issues and Challenges in Clustering Data Visualization

In the era of big data, the continuous generation of data from various fields has resulted in large and complex datasets. These datasets often come in diverse formats and structures, including unstructured or semi-structured data. Despite the wide availability of big data, high dimensionality remain...

Full description

Bibliographic Details
Published in:Journal of Advanced Research in Applied Sciences and Engineering Technology
Main Author: Zaki U.H.H.; Kamsani I.I.; Fadzil A.F.A.; Idrus Z.; Kandogan E.
Format: Article
Language:English
Published: Semarak Ilmu Publishing 2025
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85204223360&doi=10.37934%2faraset.51.1.150159&partnerID=40&md5=5a47cccbc578efebe9a00fe74cf7dcb5
Description
Summary:In the era of big data, the continuous generation of data from various fields has resulted in large and complex datasets. These datasets often come in diverse formats and structures, including unstructured or semi-structured data. Despite the wide availability of big data, high dimensionality remains a significant challenge for analysing and understanding the data for various purposes. Clustering analysis plays a crucial role in data analysis and visualization by uncovering hidden patterns and structures within datasets. However, several challenges hinder the effectiveness of clustering analysis, including data dimensionality, selection of appropriate clustering algorithms, determining the optimal number of clusters, interpreting the results, and handling outliers. This paper aims to explore these challenges and presents preferable visualization techniques that aid in visualizing and interpreting clustering results. By addressing these challenges, including the difficulty of handling outliers and the struggles with high-dimensional datasets, and employing effective visualization techniques, researchers and practitioners can enhance their understanding and utilization of clustering analysis in data analysis. © 2025, Semarak Ilmu Publishing. All rights reserved.
ISSN:24621943
DOI:10.37934/araset.51.1.150159