Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach

The prediction of minority class can be like finding a needle in a haystack. Bio-inspired classifier such as Ant Colony Optimization (ACO) decision tree experienced ineffective decision boundaries since its entropy-based heuristic is affected by the strong presence of the dominant class. Consequentl...

Full description

Bibliographic Details
Published in:2022 3rd International Conference on Artificial Intelligence and Data Sciences: Championing Innovations in Artificial Intelligence and Data Sciences for Sustainable Future, AiDAS 2022 - Proceedings
Main Author: Razali M.H.M.; Saian R.; Moktar B.; Wah Y.B.; Ku-Mahamud K.R.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2022
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85141748721&doi=10.1109%2fAiDAS56890.2022.9918761&partnerID=40&md5=1d1a51d30c90ca937b44762311a3f75c
id 2-s2.0-85141748721
spelling 2-s2.0-85141748721
Razali M.H.M.; Saian R.; Moktar B.; Wah Y.B.; Ku-Mahamud K.R.
Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach
2022
2022 3rd International Conference on Artificial Intelligence and Data Sciences: Championing Innovations in Artificial Intelligence and Data Sciences for Sustainable Future, AiDAS 2022 - Proceedings


10.1109/AiDAS56890.2022.9918761
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85141748721&doi=10.1109%2fAiDAS56890.2022.9918761&partnerID=40&md5=1d1a51d30c90ca937b44762311a3f75c
The prediction of minority class can be like finding a needle in a haystack. Bio-inspired classifier such as Ant Colony Optimization (ACO) decision tree experienced ineffective decision boundaries since its entropy-based heuristic is affected by the strong presence of the dominant class. Consequently, the developed trees were dominated by the likelihood of the majority class where the rare class is under-represented. The proposed algorithm with class skew-insensitive heuristic namely the Hellinger-Ant-Tree-Miner (HATM) was compared to the Ant-Tree-Miner (ATM), via a simulation study and application to 15 imbalanced data. Simulation results revealed the advantage of HATM over the ATM under skewed class distributions as the number of covariates and sample sizes increase. Experiments with real data indicate a potential improvement of the ATM measured by balanced accuracy (BACC), F-Measure and minority class prediction (MCP). The Friedman tests justify that HATM performed better than ATM while being competitive with other well-known tree-based classifiers. © 2022 IEEE.
Institute of Electrical and Electronics Engineers Inc.

English
Conference paper

author Razali M.H.M.; Saian R.; Moktar B.; Wah Y.B.; Ku-Mahamud K.R.
spellingShingle Razali M.H.M.; Saian R.; Moktar B.; Wah Y.B.; Ku-Mahamud K.R.
Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach
author_facet Razali M.H.M.; Saian R.; Moktar B.; Wah Y.B.; Ku-Mahamud K.R.
author_sort Razali M.H.M.; Saian R.; Moktar B.; Wah Y.B.; Ku-Mahamud K.R.
title Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach
title_short Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach
title_full Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach
title_fullStr Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach
title_full_unstemmed Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach
title_sort Performance of ACO-based Decision Tree Algorithm with Imbalanced Class Data Sets - A Heuristic Approach
publishDate 2022
container_title 2022 3rd International Conference on Artificial Intelligence and Data Sciences: Championing Innovations in Artificial Intelligence and Data Sciences for Sustainable Future, AiDAS 2022 - Proceedings
container_volume
container_issue
doi_str_mv 10.1109/AiDAS56890.2022.9918761
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85141748721&doi=10.1109%2fAiDAS56890.2022.9918761&partnerID=40&md5=1d1a51d30c90ca937b44762311a3f75c
description The prediction of minority class can be like finding a needle in a haystack. Bio-inspired classifier such as Ant Colony Optimization (ACO) decision tree experienced ineffective decision boundaries since its entropy-based heuristic is affected by the strong presence of the dominant class. Consequently, the developed trees were dominated by the likelihood of the majority class where the rare class is under-represented. The proposed algorithm with class skew-insensitive heuristic namely the Hellinger-Ant-Tree-Miner (HATM) was compared to the Ant-Tree-Miner (ATM), via a simulation study and application to 15 imbalanced data. Simulation results revealed the advantage of HATM over the ATM under skewed class distributions as the number of covariates and sample sizes increase. Experiments with real data indicate a potential improvement of the ATM measured by balanced accuracy (BACC), F-Measure and minority class prediction (MCP). The Friedman tests justify that HATM performed better than ATM while being competitive with other well-known tree-based classifiers. © 2022 IEEE.
publisher Institute of Electrical and Electronics Engineers Inc.
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1809677595628797952