Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
The use of the deep Convolutional Neural Network (CNN) in breast cancer classification of mammogram images has been widely investigated to aid radiologists in better clinical diagnoses. Multiple levels of convolution and non-linearity repetitions in CNN's architecture are required to extract si...
Published in: | 2024 IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND INTELLIGENT SYSTEMS, I2CACIS 2024 |
---|---|
Main Authors: | , , , , , , |
Format: | Proceedings Paper |
Language: | English |
Published: |
IEEE
2024
|
Subjects: | |
Online Access: | https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-recordWOS:001308267400048 |
author |
Razali Noor Fadzilah; Isa Iza Sazanita; Sulaiman Siti Noraini; Osman Muhammad Khusairi; Karim Noor Khairiah A.; Nordin Siti Aminah |
---|---|
spellingShingle |
Razali Noor Fadzilah; Isa Iza Sazanita; Sulaiman Siti Noraini; Osman Muhammad Khusairi; Karim Noor Khairiah A.; Nordin Siti Aminah Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images Automation & Control Systems; Computer Science |
author_facet |
Razali Noor Fadzilah; Isa Iza Sazanita; Sulaiman Siti Noraini; Osman Muhammad Khusairi; Karim Noor Khairiah A.; Nordin Siti Aminah |
author_sort |
Razali |
spelling |
Razali, Noor Fadzilah; Isa, Iza Sazanita; Sulaiman, Siti Noraini; Osman, Muhammad Khusairi; Karim, Noor Khairiah A.; Nordin, Siti Aminah Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images 2024 IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND INTELLIGENT SYSTEMS, I2CACIS 2024 English Proceedings Paper The use of the deep Convolutional Neural Network (CNN) in breast cancer classification of mammogram images has been widely investigated to aid radiologists in better clinical diagnoses. Multiple levels of convolution and non-linearity repetitions in CNN's architecture are required to extract significant data to be represented. However, the vanishing gradient effect occurs when deeper network training as a product of the partial derivative of loss function on each weightage update can cause no meaningful network learning, even with additional epochs. Overcoming this using the activation function of rectified linear unit (ReLU) by allowing neurons to be activated to allow non-linearity when the output is more than zero could lessen the problem. However, restrictive allowance of non-linearity for <0 for final feature extraction when producing output probability on highly complex data such as mammogram images leads to dropped network performance. To overcome this, this study proposed an adaptive ReLU based on genetic algorithm (GA) profiling to determine the best threshold value for allowing neuron activation based on mutation and adaptation to improve the restrictive capability of the original ReLU. We modified the adaptive ReLU on the final learning layer of two CNN architectures and observed the performance on a public mammogram dataset of INbreast. Our experiments show improved accuracy from 95.0% to 98.5% and improved classification performance compared to other well-known activation functions. Applying evolutionary-based GAs to activation functions can represent an exciting frontier in meta-learning for neural networks. IEEE 2995-2840 2024 10.1109/I2CACIS61270.2024.10649623 Automation & Control Systems; Computer Science WOS:001308267400048 https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-recordWOS:001308267400048 |
title |
Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images |
title_short |
Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images |
title_full |
Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images |
title_fullStr |
Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images |
title_full_unstemmed |
Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images |
title_sort |
Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images |
container_title |
2024 IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND INTELLIGENT SYSTEMS, I2CACIS 2024 |
language |
English |
format |
Proceedings Paper |
description |
The use of the deep Convolutional Neural Network (CNN) in breast cancer classification of mammogram images has been widely investigated to aid radiologists in better clinical diagnoses. Multiple levels of convolution and non-linearity repetitions in CNN's architecture are required to extract significant data to be represented. However, the vanishing gradient effect occurs when deeper network training as a product of the partial derivative of loss function on each weightage update can cause no meaningful network learning, even with additional epochs. Overcoming this using the activation function of rectified linear unit (ReLU) by allowing neurons to be activated to allow non-linearity when the output is more than zero could lessen the problem. However, restrictive allowance of non-linearity for <0 for final feature extraction when producing output probability on highly complex data such as mammogram images leads to dropped network performance. To overcome this, this study proposed an adaptive ReLU based on genetic algorithm (GA) profiling to determine the best threshold value for allowing neuron activation based on mutation and adaptation to improve the restrictive capability of the original ReLU. We modified the adaptive ReLU on the final learning layer of two CNN architectures and observed the performance on a public mammogram dataset of INbreast. Our experiments show improved accuracy from 95.0% to 98.5% and improved classification performance compared to other well-known activation functions. Applying evolutionary-based GAs to activation functions can represent an exciting frontier in meta-learning for neural networks. |
publisher |
IEEE |
issn |
2995-2840 |
publishDate |
2024 |
container_volume |
|
container_issue |
|
doi_str_mv |
10.1109/I2CACIS61270.2024.10649623 |
topic |
Automation & Control Systems; Computer Science |
topic_facet |
Automation & Control Systems; Computer Science |
accesstype |
|
id |
WOS:001308267400048 |
url |
https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-recordWOS:001308267400048 |
record_format |
wos |
collection |
Web of Science (WoS) |
_version_ |
1820775408837066752 |