Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images

The use of the deep Convolutional Neural Network (CNN) in breast cancer classification of mammogram images has been widely investigated to aid radiologists in better clinical diagnoses. Multiple levels of convolution and non-linearity repetitions in CNN's architecture are required to extract si...

Full description

Bibliographic Details
Published in:2024 IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND INTELLIGENT SYSTEMS, I2CACIS 2024
Main Authors: Razali, Noor Fadzilah; Isa, Iza Sazanita; Sulaiman, Siti Noraini; Osman, Muhammad Khusairi; Karim, Noor Khairiah A.; Nordin, Siti Aminah
Format: Proceedings Paper
Language:English
Published: IEEE 2024
Subjects:
Online Access:https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-recordWOS:001308267400048
author Razali
Noor Fadzilah; Isa
Iza Sazanita; Sulaiman
Siti Noraini; Osman
Muhammad Khusairi; Karim
Noor Khairiah A.; Nordin
Siti Aminah
spellingShingle Razali
Noor Fadzilah; Isa
Iza Sazanita; Sulaiman
Siti Noraini; Osman
Muhammad Khusairi; Karim
Noor Khairiah A.; Nordin
Siti Aminah
Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
Automation & Control Systems; Computer Science
author_facet Razali
Noor Fadzilah; Isa
Iza Sazanita; Sulaiman
Siti Noraini; Osman
Muhammad Khusairi; Karim
Noor Khairiah A.; Nordin
Siti Aminah
author_sort Razali
spelling Razali, Noor Fadzilah; Isa, Iza Sazanita; Sulaiman, Siti Noraini; Osman, Muhammad Khusairi; Karim, Noor Khairiah A.; Nordin, Siti Aminah
Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
2024 IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND INTELLIGENT SYSTEMS, I2CACIS 2024
English
Proceedings Paper
The use of the deep Convolutional Neural Network (CNN) in breast cancer classification of mammogram images has been widely investigated to aid radiologists in better clinical diagnoses. Multiple levels of convolution and non-linearity repetitions in CNN's architecture are required to extract significant data to be represented. However, the vanishing gradient effect occurs when deeper network training as a product of the partial derivative of loss function on each weightage update can cause no meaningful network learning, even with additional epochs. Overcoming this using the activation function of rectified linear unit (ReLU) by allowing neurons to be activated to allow non-linearity when the output is more than zero could lessen the problem. However, restrictive allowance of non-linearity for <0 for final feature extraction when producing output probability on highly complex data such as mammogram images leads to dropped network performance. To overcome this, this study proposed an adaptive ReLU based on genetic algorithm (GA) profiling to determine the best threshold value for allowing neuron activation based on mutation and adaptation to improve the restrictive capability of the original ReLU. We modified the adaptive ReLU on the final learning layer of two CNN architectures and observed the performance on a public mammogram dataset of INbreast. Our experiments show improved accuracy from 95.0% to 98.5% and improved classification performance compared to other well-known activation functions. Applying evolutionary-based GAs to activation functions can represent an exciting frontier in meta-learning for neural networks.
IEEE
2995-2840

2024


10.1109/I2CACIS61270.2024.10649623
Automation & Control Systems; Computer Science

WOS:001308267400048
https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-recordWOS:001308267400048
title Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
title_short Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
title_full Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
title_fullStr Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
title_full_unstemmed Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
title_sort Optimization of ReLU Activation Function for Deep-Learning-based Breast Cancer Classification on Mammogram Images
container_title 2024 IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND INTELLIGENT SYSTEMS, I2CACIS 2024
language English
format Proceedings Paper
description The use of the deep Convolutional Neural Network (CNN) in breast cancer classification of mammogram images has been widely investigated to aid radiologists in better clinical diagnoses. Multiple levels of convolution and non-linearity repetitions in CNN's architecture are required to extract significant data to be represented. However, the vanishing gradient effect occurs when deeper network training as a product of the partial derivative of loss function on each weightage update can cause no meaningful network learning, even with additional epochs. Overcoming this using the activation function of rectified linear unit (ReLU) by allowing neurons to be activated to allow non-linearity when the output is more than zero could lessen the problem. However, restrictive allowance of non-linearity for <0 for final feature extraction when producing output probability on highly complex data such as mammogram images leads to dropped network performance. To overcome this, this study proposed an adaptive ReLU based on genetic algorithm (GA) profiling to determine the best threshold value for allowing neuron activation based on mutation and adaptation to improve the restrictive capability of the original ReLU. We modified the adaptive ReLU on the final learning layer of two CNN architectures and observed the performance on a public mammogram dataset of INbreast. Our experiments show improved accuracy from 95.0% to 98.5% and improved classification performance compared to other well-known activation functions. Applying evolutionary-based GAs to activation functions can represent an exciting frontier in meta-learning for neural networks.
publisher IEEE
issn 2995-2840

publishDate 2024
container_volume
container_issue
doi_str_mv 10.1109/I2CACIS61270.2024.10649623
topic Automation & Control Systems; Computer Science
topic_facet Automation & Control Systems; Computer Science
accesstype
id WOS:001308267400048
url https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-recordWOS:001308267400048
record_format wos
collection Web of Science (WoS)
_version_ 1820775408837066752