Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach
The prediction of minority class can be like finding a needle in a haystack. Bio-inspired classifier such as Ant Colony Optimization (ACO) decision tree experienced ineffective decision boundaries since its entropy-based heuristic is affected by the strong presence of the dominant class. Consequentl...
Published in: | 2022 3rd International Conference on Artificial Intelligence and Data Sciences: Championing Innovations in Artificial Intelligence and Data Sciences for Sustainable Future, AiDAS 2022 - Proceedings |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
Institute of Electrical and Electronics Engineers Inc.
2022
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85141748721&doi=10.1109%2fAiDAS56890.2022.9918761&partnerID=40&md5=1d1a51d30c90ca937b44762311a3f75c |
id |
2-s2.0-85141748721 |
---|---|
spelling |
2-s2.0-85141748721 Razali M.H.M.; Saian R.; Moktar B.; Wah Y.B.; Ku-Mahamud K.R. Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach 2022 2022 3rd International Conference on Artificial Intelligence and Data Sciences: Championing Innovations in Artificial Intelligence and Data Sciences for Sustainable Future, AiDAS 2022 - Proceedings 10.1109/AiDAS56890.2022.9918761 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85141748721&doi=10.1109%2fAiDAS56890.2022.9918761&partnerID=40&md5=1d1a51d30c90ca937b44762311a3f75c The prediction of minority class can be like finding a needle in a haystack. Bio-inspired classifier such as Ant Colony Optimization (ACO) decision tree experienced ineffective decision boundaries since its entropy-based heuristic is affected by the strong presence of the dominant class. Consequently, the developed trees were dominated by the likelihood of the majority class where the rare class is under-represented. The proposed algorithm with class skew-insensitive heuristic namely the Hellinger-Ant-Tree-Miner (HATM) was compared to the Ant-Tree-Miner (ATM), via a simulation study and application to 15 imbalanced data. Simulation results revealed the advantage of HATM over the ATM under skewed class distributions as the number of covariates and sample sizes increase. Experiments with real data indicate a potential improvement of the ATM measured by balanced accuracy (BACC), F-Measure and minority class prediction (MCP). The Friedman tests justify that HATM performed better than ATM while being competitive with other well-known tree-based classifiers. © 2022 IEEE. Institute of Electrical and Electronics Engineers Inc. English Conference paper |
author |
Razali M.H.M.; Saian R.; Moktar B.; Wah Y.B.; Ku-Mahamud K.R. |
spellingShingle |
Razali M.H.M.; Saian R.; Moktar B.; Wah Y.B.; Ku-Mahamud K.R. Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach |
author_facet |
Razali M.H.M.; Saian R.; Moktar B.; Wah Y.B.; Ku-Mahamud K.R. |
author_sort |
Razali M.H.M.; Saian R.; Moktar B.; Wah Y.B.; Ku-Mahamud K.R. |
title |
Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach |
title_short |
Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach |
title_full |
Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach |
title_fullStr |
Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach |
title_full_unstemmed |
Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach |
title_sort |
Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach |
publishDate |
2022 |
container_title |
2022 3rd International Conference on Artificial Intelligence and Data Sciences: Championing Innovations in Artificial Intelligence and Data Sciences for Sustainable Future, AiDAS 2022 - Proceedings |
container_volume |
|
container_issue |
|
doi_str_mv |
10.1109/AiDAS56890.2022.9918761 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85141748721&doi=10.1109%2fAiDAS56890.2022.9918761&partnerID=40&md5=1d1a51d30c90ca937b44762311a3f75c |
description |
The prediction of minority class can be like finding a needle in a haystack. Bio-inspired classifier such as Ant Colony Optimization (ACO) decision tree experienced ineffective decision boundaries since its entropy-based heuristic is affected by the strong presence of the dominant class. Consequently, the developed trees were dominated by the likelihood of the majority class where the rare class is under-represented. The proposed algorithm with class skew-insensitive heuristic namely the Hellinger-Ant-Tree-Miner (HATM) was compared to the Ant-Tree-Miner (ATM), via a simulation study and application to 15 imbalanced data. Simulation results revealed the advantage of HATM over the ATM under skewed class distributions as the number of covariates and sample sizes increase. Experiments with real data indicate a potential improvement of the ATM measured by balanced accuracy (BACC), F-Measure and minority class prediction (MCP). The Friedman tests justify that HATM performed better than ATM while being competitive with other well-known tree-based classifiers. © 2022 IEEE. |
publisher |
Institute of Electrical and Electronics Engineers Inc. |
issn |
|
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677595628797952 |