Big Data: Issues and Challenges in Clustering Data Visualization
In the era of big data, the continuous generation of data from various fields has resulted in large and complex datasets. These datasets often come in diverse formats and structures, including unstructured or semi-structured data. Despite the wide availability of big data, high dimensionality remain...
Published in: | Journal of Advanced Research in Applied Sciences and Engineering Technology |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Published: |
Semarak Ilmu Publishing
2025
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85204223360&doi=10.37934%2faraset.51.1.150159&partnerID=40&md5=5a47cccbc578efebe9a00fe74cf7dcb5 |
id |
2-s2.0-85204223360 |
---|---|
spelling |
2-s2.0-85204223360 Zaki U.H.H.; Kamsani I.I.; Fadzil A.F.A.; Idrus Z.; Kandogan E. Big Data: Issues and Challenges in Clustering Data Visualization 2025 Journal of Advanced Research in Applied Sciences and Engineering Technology 51 1 10.37934/araset.51.1.150159 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85204223360&doi=10.37934%2faraset.51.1.150159&partnerID=40&md5=5a47cccbc578efebe9a00fe74cf7dcb5 In the era of big data, the continuous generation of data from various fields has resulted in large and complex datasets. These datasets often come in diverse formats and structures, including unstructured or semi-structured data. Despite the wide availability of big data, high dimensionality remains a significant challenge for analysing and understanding the data for various purposes. Clustering analysis plays a crucial role in data analysis and visualization by uncovering hidden patterns and structures within datasets. However, several challenges hinder the effectiveness of clustering analysis, including data dimensionality, selection of appropriate clustering algorithms, determining the optimal number of clusters, interpreting the results, and handling outliers. This paper aims to explore these challenges and presents preferable visualization techniques that aid in visualizing and interpreting clustering results. By addressing these challenges, including the difficulty of handling outliers and the struggles with high-dimensional datasets, and employing effective visualization techniques, researchers and practitioners can enhance their understanding and utilization of clustering analysis in data analysis. © 2025, Semarak Ilmu Publishing. All rights reserved. Semarak Ilmu Publishing 24621943 English Article |
author |
Zaki U.H.H.; Kamsani I.I.; Fadzil A.F.A.; Idrus Z.; Kandogan E. |
spellingShingle |
Zaki U.H.H.; Kamsani I.I.; Fadzil A.F.A.; Idrus Z.; Kandogan E. Big Data: Issues and Challenges in Clustering Data Visualization |
author_facet |
Zaki U.H.H.; Kamsani I.I.; Fadzil A.F.A.; Idrus Z.; Kandogan E. |
author_sort |
Zaki U.H.H.; Kamsani I.I.; Fadzil A.F.A.; Idrus Z.; Kandogan E. |
title |
Big Data: Issues and Challenges in Clustering Data Visualization |
title_short |
Big Data: Issues and Challenges in Clustering Data Visualization |
title_full |
Big Data: Issues and Challenges in Clustering Data Visualization |
title_fullStr |
Big Data: Issues and Challenges in Clustering Data Visualization |
title_full_unstemmed |
Big Data: Issues and Challenges in Clustering Data Visualization |
title_sort |
Big Data: Issues and Challenges in Clustering Data Visualization |
publishDate |
2025 |
container_title |
Journal of Advanced Research in Applied Sciences and Engineering Technology |
container_volume |
51 |
container_issue |
1 |
doi_str_mv |
10.37934/araset.51.1.150159 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85204223360&doi=10.37934%2faraset.51.1.150159&partnerID=40&md5=5a47cccbc578efebe9a00fe74cf7dcb5 |
description |
In the era of big data, the continuous generation of data from various fields has resulted in large and complex datasets. These datasets often come in diverse formats and structures, including unstructured or semi-structured data. Despite the wide availability of big data, high dimensionality remains a significant challenge for analysing and understanding the data for various purposes. Clustering analysis plays a crucial role in data analysis and visualization by uncovering hidden patterns and structures within datasets. However, several challenges hinder the effectiveness of clustering analysis, including data dimensionality, selection of appropriate clustering algorithms, determining the optimal number of clusters, interpreting the results, and handling outliers. This paper aims to explore these challenges and presents preferable visualization techniques that aid in visualizing and interpreting clustering results. By addressing these challenges, including the difficulty of handling outliers and the struggles with high-dimensional datasets, and employing effective visualization techniques, researchers and practitioners can enhance their understanding and utilization of clustering analysis in data analysis. © 2025, Semarak Ilmu Publishing. All rights reserved. |
publisher |
Semarak Ilmu Publishing |
issn |
24621943 |
language |
English |
format |
Article |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1812871793263771648 |