An analysis on performance of different type classifiers in handling big data sets

Data analysis is one of the most important tasks in the decision making process. It helps decision maker to solve many problems such as classification and regression. However, wrong choice of method will produce inefficiency solution especially when dealing with big data sets. Besides, lack of infor...

Full description

Bibliographic Details
Published in:Frontiers in Artificial Intelligence and Applications
Main Author: Mohamad M.; Selamat A.
Format: Conference paper
Language:English
Published: IOS Press BV 2019
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85082075517&doi=10.3233%2fFAIA190057&partnerID=40&md5=467a692fad8c4642ac4725b87309e519
id 2-s2.0-85082075517
spelling 2-s2.0-85082075517
Mohamad M.; Selamat A.
An analysis on performance of different type classifiers in handling big data sets
2019
Frontiers in Artificial Intelligence and Applications
318

10.3233/FAIA190057
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85082075517&doi=10.3233%2fFAIA190057&partnerID=40&md5=467a692fad8c4642ac4725b87309e519
Data analysis is one of the most important tasks in the decision making process. It helps decision maker to solve many problems such as classification and regression. However, wrong choice of method will produce inefficiency solution especially when dealing with big data sets. Besides, lack of information on data set characteristics also could make the analysis process more complicated and returned low analysis performance. Therefore, this study has conducted a few experimental works that evaluate six common algorithms in handling big data sets. A standard data analysis framework which consists of data initial process, data analysis and performance evaluation had been implemented. Results has shown that each algorithm has its own capability in handling different type of multi-variate big data sets. Naive Bayes is one of the algorithm that has successfully classified all selected data sets. Poker and Madelon required large space of memory during the analysis process. It can be concluded that, an information on data set characteristics and the capability of assigned data analysis method are important to be specified before any decision can be made. © 2019 The authors and IOS Press. All rights reserved.
IOS Press BV
9226389
English
Conference paper

author Mohamad M.; Selamat A.
spellingShingle Mohamad M.; Selamat A.
An analysis on performance of different type classifiers in handling big data sets
author_facet Mohamad M.; Selamat A.
author_sort Mohamad M.; Selamat A.
title An analysis on performance of different type classifiers in handling big data sets
title_short An analysis on performance of different type classifiers in handling big data sets
title_full An analysis on performance of different type classifiers in handling big data sets
title_fullStr An analysis on performance of different type classifiers in handling big data sets
title_full_unstemmed An analysis on performance of different type classifiers in handling big data sets
title_sort An analysis on performance of different type classifiers in handling big data sets
publishDate 2019
container_title Frontiers in Artificial Intelligence and Applications
container_volume 318
container_issue
doi_str_mv 10.3233/FAIA190057
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85082075517&doi=10.3233%2fFAIA190057&partnerID=40&md5=467a692fad8c4642ac4725b87309e519
description Data analysis is one of the most important tasks in the decision making process. It helps decision maker to solve many problems such as classification and regression. However, wrong choice of method will produce inefficiency solution especially when dealing with big data sets. Besides, lack of information on data set characteristics also could make the analysis process more complicated and returned low analysis performance. Therefore, this study has conducted a few experimental works that evaluate six common algorithms in handling big data sets. A standard data analysis framework which consists of data initial process, data analysis and performance evaluation had been implemented. Results has shown that each algorithm has its own capability in handling different type of multi-variate big data sets. Naive Bayes is one of the algorithm that has successfully classified all selected data sets. Poker and Madelon required large space of memory during the analysis process. It can be concluded that, an information on data set characteristics and the capability of assigned data analysis method are important to be specified before any decision can be made. © 2019 The authors and IOS Press. All rights reserved.
publisher IOS Press BV
issn 9226389
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1809677600299155456