A Comparative Analysis of Combination of CNN-Based Models with Ensemble Learning on Imbalanced Data

This study investigates the usefulness of the Synthetic Minority Oversampling Technique (SMOTE) in conjunction with convolutional neural network (CNN) models, which include both single and ensemble classifiers. The objective of this research is to handle the difficulty of multi-class imbalanced imag...

Full description

Bibliographic Details
Published in:International Journal on Informatics Visualization
Main Author: Gao X.; Jamil N.; Ramli M.I.; Ariffin S.M.Z.S.Z.
Format: Article
Language:English
Published: Politeknik Negeri Padang 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85189610439&doi=10.62527%2fjoiv.8.1.2194&partnerID=40&md5=fde1ce93c2d09526862bd2a73f948536
Description
Summary:This study investigates the usefulness of the Synthetic Minority Oversampling Technique (SMOTE) in conjunction with convolutional neural network (CNN) models, which include both single and ensemble classifiers. The objective of this research is to handle the difficulty of multi-class imbalanced image classification. The application of SMOTE in imbalanced picture datasets is still underexplored, even though CNNs have been shown to be successful in image classification and that ensemble learning approaches have improved their performance. To investigate whether or not SMOTE can increase classification accuracy and other performance measures when combined with CNN-based classifiers, our research makes use of a CIFAR-10 dataset that has been artificially stepimbalanced and has varying imbalanced ratios. We conducted experiments using five distinct models, namely AdaBoost, XGBoost, standalone CNN, CNN-AdaBoost, and CNN-XGBoost, on datasets that were either imbalanced or SMOTE-balanced. Metrics such as accuracy, precision, recall, F1-score, and the area under the receiver operating characteristic curve (AUC) were included in the evaluation process. The findings indicate that SMOTE dramatically improves the accuracy of minority classes, and that the combination of ensemble classifiers with CNNs and oversampling techniques significantly improves overall classification performance, particularly in situations when there is a high-class imbalance. When it comes to enhancing imbalanced classification tasks, this study demonstrates the potential of merging oversampling techniques with CNN-based ensemble classifiers to minimize the impacts of class imbalance in picture datasets. This suggests a promising direction for future research in this area. © 2024, Politeknik Negeri Padang. All rights reserved.
ISSN:25499904
DOI:10.62527/joiv.8.1.2194