Persistent Homology-Based Machine Learning Method for Filtering and Classifying Mammographic Microcalcification Images in Early Cancer Detection

Microcalcifications in mammogram images are primary indicators for detecting the early stages of breast cancer. However, dense tissues and noise in the images make it challenging to classify the microcalcifications. Currently, preprocessing procedures such as noise removal techniques are applied dir...

Full description

Bibliographic Details
Published in:Cancers
Main Author: Malek A.A.; Alias M.A.; Razak F.A.; Noorani M.S.M.; Mahmud R.; Zulkepli N.F.S.
Format: Article
Language:English
Published: Multidisciplinary Digital Publishing Institute (MDPI) 2023
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85159191208&doi=10.3390%2fcancers15092606&partnerID=40&md5=7bd66b26a233f13279933da49e5d25ec
id 2-s2.0-85159191208
spelling 2-s2.0-85159191208
Malek A.A.; Alias M.A.; Razak F.A.; Noorani M.S.M.; Mahmud R.; Zulkepli N.F.S.
Persistent Homology-Based Machine Learning Method for Filtering and Classifying Mammographic Microcalcification Images in Early Cancer Detection
2023
Cancers
15
9
10.3390/cancers15092606
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85159191208&doi=10.3390%2fcancers15092606&partnerID=40&md5=7bd66b26a233f13279933da49e5d25ec
Microcalcifications in mammogram images are primary indicators for detecting the early stages of breast cancer. However, dense tissues and noise in the images make it challenging to classify the microcalcifications. Currently, preprocessing procedures such as noise removal techniques are applied directly on the images, which may produce a blurry effect and loss of image details. Further, most of the features used in classification models focus on local information of the images and are often burdened with details, resulting in data complexity. This research proposed a filtering and feature extraction technique using persistent homology (PH), a powerful mathematical tool used to study the structure of complex datasets and patterns. The filtering process is not performed directly on the image matrix but through the diagrams arising from PH. These diagrams will enable us to distinguish prominent characteristics of the image from noise. The filtered diagrams are then vectorised using PH features. Supervised machine learning models are trained on the MIAS and DDSM datasets to evaluate the extracted features’ efficacy in discriminating between benign and malignant classes and to obtain the optimal filtering level. This study reveals that appropriate PH filtering levels and features can improve classification accuracy in early cancer detection. © 2023 by the authors.
Multidisciplinary Digital Publishing Institute (MDPI)
20726694
English
Article
All Open Access; Gold Open Access
author Malek A.A.; Alias M.A.; Razak F.A.; Noorani M.S.M.; Mahmud R.; Zulkepli N.F.S.
spellingShingle Malek A.A.; Alias M.A.; Razak F.A.; Noorani M.S.M.; Mahmud R.; Zulkepli N.F.S.
Persistent Homology-Based Machine Learning Method for Filtering and Classifying Mammographic Microcalcification Images in Early Cancer Detection
author_facet Malek A.A.; Alias M.A.; Razak F.A.; Noorani M.S.M.; Mahmud R.; Zulkepli N.F.S.
author_sort Malek A.A.; Alias M.A.; Razak F.A.; Noorani M.S.M.; Mahmud R.; Zulkepli N.F.S.
title Persistent Homology-Based Machine Learning Method for Filtering and Classifying Mammographic Microcalcification Images in Early Cancer Detection
title_short Persistent Homology-Based Machine Learning Method for Filtering and Classifying Mammographic Microcalcification Images in Early Cancer Detection
title_full Persistent Homology-Based Machine Learning Method for Filtering and Classifying Mammographic Microcalcification Images in Early Cancer Detection
title_fullStr Persistent Homology-Based Machine Learning Method for Filtering and Classifying Mammographic Microcalcification Images in Early Cancer Detection
title_full_unstemmed Persistent Homology-Based Machine Learning Method for Filtering and Classifying Mammographic Microcalcification Images in Early Cancer Detection
title_sort Persistent Homology-Based Machine Learning Method for Filtering and Classifying Mammographic Microcalcification Images in Early Cancer Detection
publishDate 2023
container_title Cancers
container_volume 15
container_issue 9
doi_str_mv 10.3390/cancers15092606
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85159191208&doi=10.3390%2fcancers15092606&partnerID=40&md5=7bd66b26a233f13279933da49e5d25ec
description Microcalcifications in mammogram images are primary indicators for detecting the early stages of breast cancer. However, dense tissues and noise in the images make it challenging to classify the microcalcifications. Currently, preprocessing procedures such as noise removal techniques are applied directly on the images, which may produce a blurry effect and loss of image details. Further, most of the features used in classification models focus on local information of the images and are often burdened with details, resulting in data complexity. This research proposed a filtering and feature extraction technique using persistent homology (PH), a powerful mathematical tool used to study the structure of complex datasets and patterns. The filtering process is not performed directly on the image matrix but through the diagrams arising from PH. These diagrams will enable us to distinguish prominent characteristics of the image from noise. The filtered diagrams are then vectorised using PH features. Supervised machine learning models are trained on the MIAS and DDSM datasets to evaluate the extracted features’ efficacy in discriminating between benign and malignant classes and to obtain the optimal filtering level. This study reveals that appropriate PH filtering levels and features can improve classification accuracy in early cancer detection. © 2023 by the authors.
publisher Multidisciplinary Digital Publishing Institute (MDPI)
issn 20726694
language English
format Article
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1809678156771098624