Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation

Lung disease, ranking third globally for causes of death with over 3 million annual fatalities according to the 2015 World Health Organization (WHO), underscores the significance of lung sound features in respiratory illness diagnosis. Auscultation, a subjective early detection method, led to the de...

Full description

Bibliographic Details
Published in:Proceedings - 2024 2nd International Conference on Computer Graphics and Image Processing, CGIP 2024
Main Author: Binti Roslan I.K.; Ehara F.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85195144895&doi=10.1109%2fCGIP62525.2024.00032&partnerID=40&md5=9d88d61a38057d61e08f80112286a599
id 2-s2.0-85195144895
spelling 2-s2.0-85195144895
Binti Roslan I.K.; Ehara F.
Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation
2024
Proceedings - 2024 2nd International Conference on Computer Graphics and Image Processing, CGIP 2024


10.1109/CGIP62525.2024.00032
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85195144895&doi=10.1109%2fCGIP62525.2024.00032&partnerID=40&md5=9d88d61a38057d61e08f80112286a599
Lung disease, ranking third globally for causes of death with over 3 million annual fatalities according to the 2015 World Health Organization (WHO), underscores the significance of lung sound features in respiratory illness diagnosis. Auscultation, a subjective early detection method, led to the development of a Computer-Aided Diagnosis (CAD) system employing the VGG16 model for medical imaging complexity. Data limitations necessitated augmentation, enhancing VGG16's performance compared to non-augment results. Respiratory Disease Detection (RDD) task was introduced. Short-Time Fourier Transform (STFT) facilitated audio feature extraction, while VGG16, using transfer learning and fine-tuning, proved effective on a Kaggle-sourced dataset. Augmentation techniques, including pitch shifting, time stretching, and horizontal flipping, addressed class imbalance. The study introduces innovative data augmentation techniques to overcome the challenge of limited training data, demonstrating the effectiveness of augmentation in enhancing the VGG16 model's performance. © 2024 IEEE.
Institute of Electrical and Electronics Engineers Inc.

English
Conference paper

author Binti Roslan I.K.; Ehara F.
spellingShingle Binti Roslan I.K.; Ehara F.
Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation
author_facet Binti Roslan I.K.; Ehara F.
author_sort Binti Roslan I.K.; Ehara F.
title Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation
title_short Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation
title_full Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation
title_fullStr Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation
title_full_unstemmed Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation
title_sort Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation
publishDate 2024
container_title Proceedings - 2024 2nd International Conference on Computer Graphics and Image Processing, CGIP 2024
container_volume
container_issue
doi_str_mv 10.1109/CGIP62525.2024.00032
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85195144895&doi=10.1109%2fCGIP62525.2024.00032&partnerID=40&md5=9d88d61a38057d61e08f80112286a599
description Lung disease, ranking third globally for causes of death with over 3 million annual fatalities according to the 2015 World Health Organization (WHO), underscores the significance of lung sound features in respiratory illness diagnosis. Auscultation, a subjective early detection method, led to the development of a Computer-Aided Diagnosis (CAD) system employing the VGG16 model for medical imaging complexity. Data limitations necessitated augmentation, enhancing VGG16's performance compared to non-augment results. Respiratory Disease Detection (RDD) task was introduced. Short-Time Fourier Transform (STFT) facilitated audio feature extraction, while VGG16, using transfer learning and fine-tuning, proved effective on a Kaggle-sourced dataset. Augmentation techniques, including pitch shifting, time stretching, and horizontal flipping, addressed class imbalance. The study introduces innovative data augmentation techniques to overcome the challenge of limited training data, demonstrating the effectiveness of augmentation in enhancing the VGG16 model's performance. © 2024 IEEE.
publisher Institute of Electrical and Electronics Engineers Inc.
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1809678155449892864