A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data
This study investigates the usefulness of the Synthetic Minority Oversampling Technique (SMOTE) in conjunction with convolutional neural network (CNN) models, which include both single and ensemble classifiers. The objective of this research is to handle the difficulty of multi-class imbalanced imag...
Published in: | International Journal on Informatics Visualization |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Published: |
Politeknik Negeri Padang
2024
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189610439&doi=10.62527%2fjoiv.8.1.2194&partnerID=40&md5=fde1ce93c2d09526862bd2a73f948536 |
id |
2-s2.0-85189610439 |
---|---|
spelling |
2-s2.0-85189610439 Gao X.; Jamil N.; Ramli M.I.; Ariffin S.M.Z.S.Z. A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data 2024 International Journal on Informatics Visualization 8 1 10.62527/joiv.8.1.2194 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189610439&doi=10.62527%2fjoiv.8.1.2194&partnerID=40&md5=fde1ce93c2d09526862bd2a73f948536 This study investigates the usefulness of the Synthetic Minority Oversampling Technique (SMOTE) in conjunction with convolutional neural network (CNN) models, which include both single and ensemble classifiers. The objective of this research is to handle the difficulty of multi-class imbalanced image classification. The application of SMOTE in imbalanced picture datasets is still underexplored, even though CNNs have been shown to be successful in image classification and that ensemble learning approaches have improved their performance. To investigate whether or not SMOTE can increase classification accuracy and other performance measures when combined with CNN-based classifiers, our research makes use of a CIFAR-10 dataset that has been artificially stepimbalanced and has varying imbalanced ratios. We conducted experiments using five distinct models, namely AdaBoost, XGBoost, standalone CNN, CNN-AdaBoost, and CNN-XGBoost, on datasets that were either imbalanced or SMOTE-balanced. Metrics such as accuracy, precision, recall, F1-score, and the area under the receiver operating characteristic curve (AUC) were included in the evaluation process. The findings indicate that SMOTE dramatically improves the accuracy of minority classes, and that the combination of ensemble classifiers with CNNs and oversampling techniques significantly improves overall classification performance, particularly in situations when there is a high-class imbalance. When it comes to enhancing imbalanced classification tasks, this study demonstrates the potential of merging oversampling techniques with CNN-based ensemble classifiers to minimize the impacts of class imbalance in picture datasets. This suggests a promising direction for future research in this area. © 2024, Politeknik Negeri Padang. All rights reserved. Politeknik Negeri Padang 25499904 English Article |
author |
Gao X.; Jamil N.; Ramli M.I.; Ariffin S.M.Z.S.Z. |
spellingShingle |
Gao X.; Jamil N.; Ramli M.I.; Ariffin S.M.Z.S.Z. A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data |
author_facet |
Gao X.; Jamil N.; Ramli M.I.; Ariffin S.M.Z.S.Z. |
author_sort |
Gao X.; Jamil N.; Ramli M.I.; Ariffin S.M.Z.S.Z. |
title |
A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data |
title_short |
A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data |
title_full |
A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data |
title_fullStr |
A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data |
title_full_unstemmed |
A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data |
title_sort |
A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data |
publishDate |
2024 |
container_title |
International Journal on Informatics Visualization |
container_volume |
8 |
container_issue |
1 |
doi_str_mv |
10.62527/joiv.8.1.2194 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189610439&doi=10.62527%2fjoiv.8.1.2194&partnerID=40&md5=fde1ce93c2d09526862bd2a73f948536 |
description |
This study investigates the usefulness of the Synthetic Minority Oversampling Technique (SMOTE) in conjunction with convolutional neural network (CNN) models, which include both single and ensemble classifiers. The objective of this research is to handle the difficulty of multi-class imbalanced image classification. The application of SMOTE in imbalanced picture datasets is still underexplored, even though CNNs have been shown to be successful in image classification and that ensemble learning approaches have improved their performance. To investigate whether or not SMOTE can increase classification accuracy and other performance measures when combined with CNN-based classifiers, our research makes use of a CIFAR-10 dataset that has been artificially stepimbalanced and has varying imbalanced ratios. We conducted experiments using five distinct models, namely AdaBoost, XGBoost, standalone CNN, CNN-AdaBoost, and CNN-XGBoost, on datasets that were either imbalanced or SMOTE-balanced. Metrics such as accuracy, precision, recall, F1-score, and the area under the receiver operating characteristic curve (AUC) were included in the evaluation process. The findings indicate that SMOTE dramatically improves the accuracy of minority classes, and that the combination of ensemble classifiers with CNNs and oversampling techniques significantly improves overall classification performance, particularly in situations when there is a high-class imbalance. When it comes to enhancing imbalanced classification tasks, this study demonstrates the potential of merging oversampling techniques with CNN-based ensemble classifiers to minimize the impacts of class imbalance in picture datasets. This suggests a promising direction for future research in this area. © 2024, Politeknik Negeri Padang. All rights reserved. |
publisher |
Politeknik Negeri Padang |
issn |
25499904 |
language |
English |
format |
Article |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677885085057024 |