An analysis on performance of different type classifiers in handling big data sets
Data analysis is one of the most important tasks in the decision making process. It helps decision maker to solve many problems such as classification and regression. However, wrong choice of method will produce inefficiency solution especially when dealing with big data sets. Besides, lack of infor...
Published in: | Frontiers in Artificial Intelligence and Applications |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
IOS Press BV
2019
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85082075517&doi=10.3233%2fFAIA190057&partnerID=40&md5=467a692fad8c4642ac4725b87309e519 |
id |
2-s2.0-85082075517 |
---|---|
spelling |
2-s2.0-85082075517 Mohamad M.; Selamat A. An analysis on performance of different type classifiers in handling big data sets 2019 Frontiers in Artificial Intelligence and Applications 318 10.3233/FAIA190057 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85082075517&doi=10.3233%2fFAIA190057&partnerID=40&md5=467a692fad8c4642ac4725b87309e519 Data analysis is one of the most important tasks in the decision making process. It helps decision maker to solve many problems such as classification and regression. However, wrong choice of method will produce inefficiency solution especially when dealing with big data sets. Besides, lack of information on data set characteristics also could make the analysis process more complicated and returned low analysis performance. Therefore, this study has conducted a few experimental works that evaluate six common algorithms in handling big data sets. A standard data analysis framework which consists of data initial process, data analysis and performance evaluation had been implemented. Results has shown that each algorithm has its own capability in handling different type of multi-variate big data sets. Naive Bayes is one of the algorithm that has successfully classified all selected data sets. Poker and Madelon required large space of memory during the analysis process. It can be concluded that, an information on data set characteristics and the capability of assigned data analysis method are important to be specified before any decision can be made. © 2019 The authors and IOS Press. All rights reserved. IOS Press BV 9226389 English Conference paper |
author |
Mohamad M.; Selamat A. |
spellingShingle |
Mohamad M.; Selamat A. An analysis on performance of different type classifiers in handling big data sets |
author_facet |
Mohamad M.; Selamat A. |
author_sort |
Mohamad M.; Selamat A. |
title |
An analysis on performance of different type classifiers in handling big data sets |
title_short |
An analysis on performance of different type classifiers in handling big data sets |
title_full |
An analysis on performance of different type classifiers in handling big data sets |
title_fullStr |
An analysis on performance of different type classifiers in handling big data sets |
title_full_unstemmed |
An analysis on performance of different type classifiers in handling big data sets |
title_sort |
An analysis on performance of different type classifiers in handling big data sets |
publishDate |
2019 |
container_title |
Frontiers in Artificial Intelligence and Applications |
container_volume |
318 |
container_issue |
|
doi_str_mv |
10.3233/FAIA190057 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85082075517&doi=10.3233%2fFAIA190057&partnerID=40&md5=467a692fad8c4642ac4725b87309e519 |
description |
Data analysis is one of the most important tasks in the decision making process. It helps decision maker to solve many problems such as classification and regression. However, wrong choice of method will produce inefficiency solution especially when dealing with big data sets. Besides, lack of information on data set characteristics also could make the analysis process more complicated and returned low analysis performance. Therefore, this study has conducted a few experimental works that evaluate six common algorithms in handling big data sets. A standard data analysis framework which consists of data initial process, data analysis and performance evaluation had been implemented. Results has shown that each algorithm has its own capability in handling different type of multi-variate big data sets. Naive Bayes is one of the algorithm that has successfully classified all selected data sets. Poker and Madelon required large space of memory during the analysis process. It can be concluded that, an information on data set characteristics and the capability of assigned data analysis method are important to be specified before any decision can be made. © 2019 The authors and IOS Press. All rights reserved. |
publisher |
IOS Press BV |
issn |
9226389 |
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677600299155456 |