A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data

This study investigates the usefulness of the Synthetic Minority Oversampling Technique (SMOTE) in conjunction with convolutional neural network (CNN) models, which include both single and ensemble classifiers. The objective of this research is to handle the difficulty of multi-class imbalanced imag...

Full description

Bibliographic Details
Published in:International Journal on Informatics Visualization
Main Author: Gao X.; Jamil N.; Ramli M.I.; Ariffin S.M.Z.S.Z.
Format: Article
Language:English
Published: Politeknik Negeri Padang 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189610439&doi=10.62527%2fjoiv.8.1.2194&partnerID=40&md5=fde1ce93c2d09526862bd2a73f948536
id 2-s2.0-85189610439
spelling 2-s2.0-85189610439
Gao X.; Jamil N.; Ramli M.I.; Ariffin S.M.Z.S.Z.
A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data
2024
International Journal on Informatics Visualization
8
1
10.62527/joiv.8.1.2194
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189610439&doi=10.62527%2fjoiv.8.1.2194&partnerID=40&md5=fde1ce93c2d09526862bd2a73f948536
This study investigates the usefulness of the Synthetic Minority Oversampling Technique (SMOTE) in conjunction with convolutional neural network (CNN) models, which include both single and ensemble classifiers. The objective of this research is to handle the difficulty of multi-class imbalanced image classification. The application of SMOTE in imbalanced picture datasets is still underexplored, even though CNNs have been shown to be successful in image classification and that ensemble learning approaches have improved their performance. To investigate whether or not SMOTE can increase classification accuracy and other performance measures when combined with CNN-based classifiers, our research makes use of a CIFAR-10 dataset that has been artificially stepimbalanced and has varying imbalanced ratios. We conducted experiments using five distinct models, namely AdaBoost, XGBoost, standalone CNN, CNN-AdaBoost, and CNN-XGBoost, on datasets that were either imbalanced or SMOTE-balanced. Metrics such as accuracy, precision, recall, F1-score, and the area under the receiver operating characteristic curve (AUC) were included in the evaluation process. The findings indicate that SMOTE dramatically improves the accuracy of minority classes, and that the combination of ensemble classifiers with CNNs and oversampling techniques significantly improves overall classification performance, particularly in situations when there is a high-class imbalance. When it comes to enhancing imbalanced classification tasks, this study demonstrates the potential of merging oversampling techniques with CNN-based ensemble classifiers to minimize the impacts of class imbalance in picture datasets. This suggests a promising direction for future research in this area. © 2024, Politeknik Negeri Padang. All rights reserved.
Politeknik Negeri Padang
25499904
English
Article

author Gao X.; Jamil N.; Ramli M.I.; Ariffin S.M.Z.S.Z.
spellingShingle Gao X.; Jamil N.; Ramli M.I.; Ariffin S.M.Z.S.Z.
A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data
author_facet Gao X.; Jamil N.; Ramli M.I.; Ariffin S.M.Z.S.Z.
author_sort Gao X.; Jamil N.; Ramli M.I.; Ariffin S.M.Z.S.Z.
title A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data
title_short A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data
title_full A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data
title_fullStr A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data
title_full_unstemmed A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data
title_sort A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data
publishDate 2024
container_title International Journal on Informatics Visualization
container_volume 8
container_issue 1
doi_str_mv 10.62527/joiv.8.1.2194
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189610439&doi=10.62527%2fjoiv.8.1.2194&partnerID=40&md5=fde1ce93c2d09526862bd2a73f948536
description This study investigates the usefulness of the Synthetic Minority Oversampling Technique (SMOTE) in conjunction with convolutional neural network (CNN) models, which include both single and ensemble classifiers. The objective of this research is to handle the difficulty of multi-class imbalanced image classification. The application of SMOTE in imbalanced picture datasets is still underexplored, even though CNNs have been shown to be successful in image classification and that ensemble learning approaches have improved their performance. To investigate whether or not SMOTE can increase classification accuracy and other performance measures when combined with CNN-based classifiers, our research makes use of a CIFAR-10 dataset that has been artificially stepimbalanced and has varying imbalanced ratios. We conducted experiments using five distinct models, namely AdaBoost, XGBoost, standalone CNN, CNN-AdaBoost, and CNN-XGBoost, on datasets that were either imbalanced or SMOTE-balanced. Metrics such as accuracy, precision, recall, F1-score, and the area under the receiver operating characteristic curve (AUC) were included in the evaluation process. The findings indicate that SMOTE dramatically improves the accuracy of minority classes, and that the combination of ensemble classifiers with CNNs and oversampling techniques significantly improves overall classification performance, particularly in situations when there is a high-class imbalance. When it comes to enhancing imbalanced classification tasks, this study demonstrates the potential of merging oversampling techniques with CNN-based ensemble classifiers to minimize the impacts of class imbalance in picture datasets. This suggests a promising direction for future research in this area. © 2024, Politeknik Negeri Padang. All rights reserved.
publisher Politeknik Negeri Padang
issn 25499904
language English
format Article
accesstype
record_format scopus
collection Scopus
_version_ 1809677885085057024