Data Clutter Reduction in Sampling Technique

Visualization is a process of converting data into its visual form as such data patterns can be extracted from the data. Data patterns are knowledge hidden behind the data. However, when data is big, it tends to overlap and clutter on visualization which distorts the data patterns. Data is overly cr...

Full description

Bibliographic Details
Published in:	International Journal of Advanced Computer Science and Applications
Main Author:	Jamalludin N.N.M.; Idrus Z.; Idrus Z.; Ahmarofi A.A.; Hamid J.A.; Mahadzir N.H.
Format:	Article
Language:	English
Published:	Science and Information Organization 2022
Online Access:	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85146656979&doi=10.14569%2fIJACSA.2022.0131294&partnerID=40&md5=1bce5ca5dc47189265f07722892e81a1

id	2-s2.0-85146656979
spelling	2-s2.0-85146656979 Jamalludin N.N.M.; Idrus Z.; Idrus Z.; Ahmarofi A.A.; Hamid J.A.; Mahadzir N.H. Data Clutter Reduction in Sampling Technique 2022 International Journal of Advanced Computer Science and Applications 13 12 10.14569/IJACSA.2022.0131294 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85146656979&doi=10.14569%2fIJACSA.2022.0131294&partnerID=40&md5=1bce5ca5dc47189265f07722892e81a1 Visualization is a process of converting data into its visual form as such data patterns can be extracted from the data. Data patterns are knowledge hidden behind the data. However, when data is big, it tends to overlap and clutter on visualization which distorts the data patterns. Data is overly crowded on visualization thus, it has become a challenge to extract knowledge patterns. Besides, big data is costly to visualize because it requires expensive hardware facilities due to its size. Moreover, it is timely to plot the data since it takes time for data to render on visualizations. Due to those reasons, there is a need to reduce the size of big datasets and at the same time maintain the data patterns. There are many methods of data reduction, which are preprocessing operations, dimension reduction, compression, network theory, redundancy elimination, data mining, machine learning, data filtering and sampling techniques. However, the commonly used data reduction technique is sampling technique that derives samples from data populations. Thus, sampling technique is chosen as a study for data reduction in this paper. However, the studies are scattered and are not discussed in a single paper. Consequently, the objective of this paper is to collect them in a single paper for further analysis in order to understand them in great detail. To achieve the objective, three interdisciplinary databases which are ACM Digital Library, IEEE Explore and Science Direct have been selected. From the database, a total of 48 studies have been extracted and they are from the years 2017 to 2021. Other than sampling techniques, this paper also seeks information on big data, data visualization, data clutter, and data reduction © 2022, International Journal of Advanced Computer Science and Applications.All Rights Reserved. Science and Information Organization 2158107X English Article All Open Access; Gold Open Access
author	Jamalludin N.N.M.; Idrus Z.; Idrus Z.; Ahmarofi A.A.; Hamid J.A.; Mahadzir N.H.
spellingShingle	Jamalludin N.N.M.; Idrus Z.; Idrus Z.; Ahmarofi A.A.; Hamid J.A.; Mahadzir N.H. Data Clutter Reduction in Sampling Technique
author_facet	Jamalludin N.N.M.; Idrus Z.; Idrus Z.; Ahmarofi A.A.; Hamid J.A.; Mahadzir N.H.
author_sort	Jamalludin N.N.M.; Idrus Z.; Idrus Z.; Ahmarofi A.A.; Hamid J.A.; Mahadzir N.H.
title	Data Clutter Reduction in Sampling Technique
title_short	Data Clutter Reduction in Sampling Technique
title_full	Data Clutter Reduction in Sampling Technique
title_fullStr	Data Clutter Reduction in Sampling Technique
title_full_unstemmed	Data Clutter Reduction in Sampling Technique
title_sort	Data Clutter Reduction in Sampling Technique
publishDate	2022
container_title	International Journal of Advanced Computer Science and Applications
container_volume	13
container_issue	12
doi_str_mv	10.14569/IJACSA.2022.0131294
url	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85146656979&doi=10.14569%2fIJACSA.2022.0131294&partnerID=40&md5=1bce5ca5dc47189265f07722892e81a1
description	Visualization is a process of converting data into its visual form as such data patterns can be extracted from the data. Data patterns are knowledge hidden behind the data. However, when data is big, it tends to overlap and clutter on visualization which distorts the data patterns. Data is overly crowded on visualization thus, it has become a challenge to extract knowledge patterns. Besides, big data is costly to visualize because it requires expensive hardware facilities due to its size. Moreover, it is timely to plot the data since it takes time for data to render on visualizations. Due to those reasons, there is a need to reduce the size of big datasets and at the same time maintain the data patterns. There are many methods of data reduction, which are preprocessing operations, dimension reduction, compression, network theory, redundancy elimination, data mining, machine learning, data filtering and sampling techniques. However, the commonly used data reduction technique is sampling technique that derives samples from data populations. Thus, sampling technique is chosen as a study for data reduction in this paper. However, the studies are scattered and are not discussed in a single paper. Consequently, the objective of this paper is to collect them in a single paper for further analysis in order to understand them in great detail. To achieve the objective, three interdisciplinary databases which are ACM Digital Library, IEEE Explore and Science Direct have been selected. From the database, a total of 48 studies have been extracted and they are from the years 2017 to 2021. Other than sampling techniques, this paper also seeks information on big data, data visualization, data clutter, and data reduction © 2022, International Journal of Advanced Computer Science and Applications.All Rights Reserved.
publisher	Science and Information Organization
issn	2158107X
language	English
format	Article
accesstype	All Open Access; Gold Open Access
record_format	scopus
collection	Scopus
_version_	1809677892199645184

Data Clutter Reduction in Sampling Technique

Similar Items