An analysis on performance of different type classifiers in handling big data sets

Data analysis is one of the most important tasks in the decision making process. It helps decision maker to solve many problems such as classification and regression. However, wrong choice of method will produce inefficiency solution especially when dealing with big data sets. Besides, lack of infor...

Full description

Bibliographic Details
Published in:	Frontiers in Artificial Intelligence and Applications
Main Author:	Mohamad M.; Selamat A.
Format:	Conference paper
Language:	English
Published:	IOS Press BV 2019
Online Access:	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85082075517&doi=10.3233%2fFAIA190057&partnerID=40&md5=467a692fad8c4642ac4725b87309e519

id	2-s2.0-85082075517
spelling	2-s2.0-85082075517 Mohamad M.; Selamat A. An analysis on performance of different type classifiers in handling big data sets 2019 Frontiers in Artificial Intelligence and Applications 318 10.3233/FAIA190057 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85082075517&doi=10.3233%2fFAIA190057&partnerID=40&md5=467a692fad8c4642ac4725b87309e519 Data analysis is one of the most important tasks in the decision making process. It helps decision maker to solve many problems such as classification and regression. However, wrong choice of method will produce inefficiency solution especially when dealing with big data sets. Besides, lack of information on data set characteristics also could make the analysis process more complicated and returned low analysis performance. Therefore, this study has conducted a few experimental works that evaluate six common algorithms in handling big data sets. A standard data analysis framework which consists of data initial process, data analysis and performance evaluation had been implemented. Results has shown that each algorithm has its own capability in handling different type of multi-variate big data sets. Naive Bayes is one of the algorithm that has successfully classified all selected data sets. Poker and Madelon required large space of memory during the analysis process. It can be concluded that, an information on data set characteristics and the capability of assigned data analysis method are important to be specified before any decision can be made. © 2019 The authors and IOS Press. All rights reserved. IOS Press BV 9226389 English Conference paper
author	Mohamad M.; Selamat A.
spellingShingle	Mohamad M.; Selamat A. An analysis on performance of different type classifiers in handling big data sets
author_facet	Mohamad M.; Selamat A.
author_sort	Mohamad M.; Selamat A.
title	An analysis on performance of different type classifiers in handling big data sets
title_short	An analysis on performance of different type classifiers in handling big data sets
title_full	An analysis on performance of different type classifiers in handling big data sets
title_fullStr	An analysis on performance of different type classifiers in handling big data sets
title_full_unstemmed	An analysis on performance of different type classifiers in handling big data sets
title_sort	An analysis on performance of different type classifiers in handling big data sets
publishDate	2019
container_title	Frontiers in Artificial Intelligence and Applications
container_volume	318
container_issue
doi_str_mv	10.3233/FAIA190057
url	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85082075517&doi=10.3233%2fFAIA190057&partnerID=40&md5=467a692fad8c4642ac4725b87309e519
description	Data analysis is one of the most important tasks in the decision making process. It helps decision maker to solve many problems such as classification and regression. However, wrong choice of method will produce inefficiency solution especially when dealing with big data sets. Besides, lack of information on data set characteristics also could make the analysis process more complicated and returned low analysis performance. Therefore, this study has conducted a few experimental works that evaluate six common algorithms in handling big data sets. A standard data analysis framework which consists of data initial process, data analysis and performance evaluation had been implemented. Results has shown that each algorithm has its own capability in handling different type of multi-variate big data sets. Naive Bayes is one of the algorithm that has successfully classified all selected data sets. Poker and Madelon required large space of memory during the analysis process. It can be concluded that, an information on data set characteristics and the capability of assigned data analysis method are important to be specified before any decision can be made. © 2019 The authors and IOS Press. All rights reserved.
publisher	IOS Press BV
issn	9226389
language	English
format	Conference paper
accesstype
record_format	scopus
collection	Scopus
_version_	1825722584231051264

An analysis on performance of different type classifiers in handling big data sets

Similar Items