Big Data: Issues and Challenges in Clustering Data Visualization

In the era of big data, the continuous generation of data from various fields has resulted in large and complex datasets. These datasets often come in diverse formats and structures, including unstructured or semi-structured data. Despite the wide availability of big data, high dimensionality remain...

Full description

Bibliographic Details
Published in:Journal of Advanced Research in Applied Sciences and Engineering Technology
Main Author: Zaki U.H.H.; Kamsani I.I.; Fadzil A.F.A.; Idrus Z.; Kandogan E.
Format: Article
Language:English
Published: Semarak Ilmu Publishing 2025
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85204223360&doi=10.37934%2faraset.51.1.150159&partnerID=40&md5=5a47cccbc578efebe9a00fe74cf7dcb5
id 2-s2.0-85204223360
spelling 2-s2.0-85204223360
Zaki U.H.H.; Kamsani I.I.; Fadzil A.F.A.; Idrus Z.; Kandogan E.
Big Data: Issues and Challenges in Clustering Data Visualization
2025
Journal of Advanced Research in Applied Sciences and Engineering Technology
51
1
10.37934/araset.51.1.150159
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85204223360&doi=10.37934%2faraset.51.1.150159&partnerID=40&md5=5a47cccbc578efebe9a00fe74cf7dcb5
In the era of big data, the continuous generation of data from various fields has resulted in large and complex datasets. These datasets often come in diverse formats and structures, including unstructured or semi-structured data. Despite the wide availability of big data, high dimensionality remains a significant challenge for analysing and understanding the data for various purposes. Clustering analysis plays a crucial role in data analysis and visualization by uncovering hidden patterns and structures within datasets. However, several challenges hinder the effectiveness of clustering analysis, including data dimensionality, selection of appropriate clustering algorithms, determining the optimal number of clusters, interpreting the results, and handling outliers. This paper aims to explore these challenges and presents preferable visualization techniques that aid in visualizing and interpreting clustering results. By addressing these challenges, including the difficulty of handling outliers and the struggles with high-dimensional datasets, and employing effective visualization techniques, researchers and practitioners can enhance their understanding and utilization of clustering analysis in data analysis. © 2025, Semarak Ilmu Publishing. All rights reserved.
Semarak Ilmu Publishing
24621943
English
Article

author Zaki U.H.H.; Kamsani I.I.; Fadzil A.F.A.; Idrus Z.; Kandogan E.
spellingShingle Zaki U.H.H.; Kamsani I.I.; Fadzil A.F.A.; Idrus Z.; Kandogan E.
Big Data: Issues and Challenges in Clustering Data Visualization
author_facet Zaki U.H.H.; Kamsani I.I.; Fadzil A.F.A.; Idrus Z.; Kandogan E.
author_sort Zaki U.H.H.; Kamsani I.I.; Fadzil A.F.A.; Idrus Z.; Kandogan E.
title Big Data: Issues and Challenges in Clustering Data Visualization
title_short Big Data: Issues and Challenges in Clustering Data Visualization
title_full Big Data: Issues and Challenges in Clustering Data Visualization
title_fullStr Big Data: Issues and Challenges in Clustering Data Visualization
title_full_unstemmed Big Data: Issues and Challenges in Clustering Data Visualization
title_sort Big Data: Issues and Challenges in Clustering Data Visualization
publishDate 2025
container_title Journal of Advanced Research in Applied Sciences and Engineering Technology
container_volume 51
container_issue 1
doi_str_mv 10.37934/araset.51.1.150159
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85204223360&doi=10.37934%2faraset.51.1.150159&partnerID=40&md5=5a47cccbc578efebe9a00fe74cf7dcb5
description In the era of big data, the continuous generation of data from various fields has resulted in large and complex datasets. These datasets often come in diverse formats and structures, including unstructured or semi-structured data. Despite the wide availability of big data, high dimensionality remains a significant challenge for analysing and understanding the data for various purposes. Clustering analysis plays a crucial role in data analysis and visualization by uncovering hidden patterns and structures within datasets. However, several challenges hinder the effectiveness of clustering analysis, including data dimensionality, selection of appropriate clustering algorithms, determining the optimal number of clusters, interpreting the results, and handling outliers. This paper aims to explore these challenges and presents preferable visualization techniques that aid in visualizing and interpreting clustering results. By addressing these challenges, including the difficulty of handling outliers and the struggles with high-dimensional datasets, and employing effective visualization techniques, researchers and practitioners can enhance their understanding and utilization of clustering analysis in data analysis. © 2025, Semarak Ilmu Publishing. All rights reserved.
publisher Semarak Ilmu Publishing
issn 24621943
language English
format Article
accesstype
record_format scopus
collection Scopus
_version_ 1812871793263771648