Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images

The use of the deep Convolutional Neural Network (CNN) in breast cancer classification of mammogram images has been widely investigated to aid radiologists in better clinical diagnoses. Multiple levels of convolution and non-linearity repetitions in CNN's architecture are required to extract si...

Full description

Bibliographic Details
Published in:2024 IEEE International Conference on Automatic Control and Intelligent Systems, I2CACIS 2024 - Proceedings
Main Author: Razali N.F.; Isa I.S.; Sulaiman S.N.; Osman M.K.; Karim N.K.A.; Nordin S.A.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85203830703&doi=10.1109%2fI2CACIS61270.2024.10649623&partnerID=40&md5=253a66e726ed21956914929da0764700
id 2-s2.0-85203830703
spelling 2-s2.0-85203830703
Razali N.F.; Isa I.S.; Sulaiman S.N.; Osman M.K.; Karim N.K.A.; Nordin S.A.
Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
2024
2024 IEEE International Conference on Automatic Control and Intelligent Systems, I2CACIS 2024 - Proceedings


10.1109/I2CACIS61270.2024.10649623
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85203830703&doi=10.1109%2fI2CACIS61270.2024.10649623&partnerID=40&md5=253a66e726ed21956914929da0764700
The use of the deep Convolutional Neural Network (CNN) in breast cancer classification of mammogram images has been widely investigated to aid radiologists in better clinical diagnoses. Multiple levels of convolution and non-linearity repetitions in CNN's architecture are required to extract significant data to be represented. However, the vanishing gradient effect occurs when deeper network training as a product of the partial derivative of loss function on each weightage update can cause no meaningful network learning, even with additional epochs. Overcoming this using the activation function of rectified linear unit (ReLU) by allowing neurons to be activated to allow non-linearity when the output is more than zero could lessen the problem. However, restrictive allowance of non-linearity for <0 for final feature extraction when producing output probability on highly complex data such as mammogram images leads to dropped network performance. To overcome this, this study proposed an adaptive ReLU based on genetic algorithm (GA) profiling to determine the best threshold value for allowing neuron activation based on mutation and adaptation to improve the restrictive capability of the original ReLU. We modified the adaptive ReLU on the final learning layer of two CNN architectures and observed the performance on a public mammogram dataset of INbreast. Our experiments show improved accuracy from 95.0% to 98.5% and improved classification performance compared to other well-known activation functions. Applying evolutionary-based GAs to activation functions can represent an exciting frontier in meta-learning for neural networks. © 2024 IEEE.
Institute of Electrical and Electronics Engineers Inc.

English
Conference paper

author Razali N.F.; Isa I.S.; Sulaiman S.N.; Osman M.K.; Karim N.K.A.; Nordin S.A.
spellingShingle Razali N.F.; Isa I.S.; Sulaiman S.N.; Osman M.K.; Karim N.K.A.; Nordin S.A.
Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
author_facet Razali N.F.; Isa I.S.; Sulaiman S.N.; Osman M.K.; Karim N.K.A.; Nordin S.A.
author_sort Razali N.F.; Isa I.S.; Sulaiman S.N.; Osman M.K.; Karim N.K.A.; Nordin S.A.
title Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
title_short Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
title_full Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
title_fullStr Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
title_full_unstemmed Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
title_sort Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
publishDate 2024
container_title 2024 IEEE International Conference on Automatic Control and Intelligent Systems, I2CACIS 2024 - Proceedings
container_volume
container_issue
doi_str_mv 10.1109/I2CACIS61270.2024.10649623
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85203830703&doi=10.1109%2fI2CACIS61270.2024.10649623&partnerID=40&md5=253a66e726ed21956914929da0764700
description The use of the deep Convolutional Neural Network (CNN) in breast cancer classification of mammogram images has been widely investigated to aid radiologists in better clinical diagnoses. Multiple levels of convolution and non-linearity repetitions in CNN's architecture are required to extract significant data to be represented. However, the vanishing gradient effect occurs when deeper network training as a product of the partial derivative of loss function on each weightage update can cause no meaningful network learning, even with additional epochs. Overcoming this using the activation function of rectified linear unit (ReLU) by allowing neurons to be activated to allow non-linearity when the output is more than zero could lessen the problem. However, restrictive allowance of non-linearity for <0 for final feature extraction when producing output probability on highly complex data such as mammogram images leads to dropped network performance. To overcome this, this study proposed an adaptive ReLU based on genetic algorithm (GA) profiling to determine the best threshold value for allowing neuron activation based on mutation and adaptation to improve the restrictive capability of the original ReLU. We modified the adaptive ReLU on the final learning layer of two CNN architectures and observed the performance on a public mammogram dataset of INbreast. Our experiments show improved accuracy from 95.0% to 98.5% and improved classification performance compared to other well-known activation functions. Applying evolutionary-based GAs to activation functions can represent an exciting frontier in meta-learning for neural networks. © 2024 IEEE.
publisher Institute of Electrical and Electronics Engineers Inc.
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1812871795809714176