Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation
Lung disease, ranking third globally for causes of death with over 3 million annual fatalities according to the 2015 World Health Organization (WHO), underscores the significance of lung sound features in respiratory illness diagnosis. Auscultation, a subjective early detection method, led to the de...
Published in: | Proceedings - 2024 2nd International Conference on Computer Graphics and Image Processing, CGIP 2024 |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
Institute of Electrical and Electronics Engineers Inc.
2024
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85195144895&doi=10.1109%2fCGIP62525.2024.00032&partnerID=40&md5=9d88d61a38057d61e08f80112286a599 |
id |
2-s2.0-85195144895 |
---|---|
spelling |
2-s2.0-85195144895 Binti Roslan I.K.; Ehara F. Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation 2024 Proceedings - 2024 2nd International Conference on Computer Graphics and Image Processing, CGIP 2024 10.1109/CGIP62525.2024.00032 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85195144895&doi=10.1109%2fCGIP62525.2024.00032&partnerID=40&md5=9d88d61a38057d61e08f80112286a599 Lung disease, ranking third globally for causes of death with over 3 million annual fatalities according to the 2015 World Health Organization (WHO), underscores the significance of lung sound features in respiratory illness diagnosis. Auscultation, a subjective early detection method, led to the development of a Computer-Aided Diagnosis (CAD) system employing the VGG16 model for medical imaging complexity. Data limitations necessitated augmentation, enhancing VGG16's performance compared to non-augment results. Respiratory Disease Detection (RDD) task was introduced. Short-Time Fourier Transform (STFT) facilitated audio feature extraction, while VGG16, using transfer learning and fine-tuning, proved effective on a Kaggle-sourced dataset. Augmentation techniques, including pitch shifting, time stretching, and horizontal flipping, addressed class imbalance. The study introduces innovative data augmentation techniques to overcome the challenge of limited training data, demonstrating the effectiveness of augmentation in enhancing the VGG16 model's performance. © 2024 IEEE. Institute of Electrical and Electronics Engineers Inc. English Conference paper |
author |
Binti Roslan I.K.; Ehara F. |
spellingShingle |
Binti Roslan I.K.; Ehara F. Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation |
author_facet |
Binti Roslan I.K.; Ehara F. |
author_sort |
Binti Roslan I.K.; Ehara F. |
title |
Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation |
title_short |
Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation |
title_full |
Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation |
title_fullStr |
Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation |
title_full_unstemmed |
Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation |
title_sort |
Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation |
publishDate |
2024 |
container_title |
Proceedings - 2024 2nd International Conference on Computer Graphics and Image Processing, CGIP 2024 |
container_volume |
|
container_issue |
|
doi_str_mv |
10.1109/CGIP62525.2024.00032 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85195144895&doi=10.1109%2fCGIP62525.2024.00032&partnerID=40&md5=9d88d61a38057d61e08f80112286a599 |
description |
Lung disease, ranking third globally for causes of death with over 3 million annual fatalities according to the 2015 World Health Organization (WHO), underscores the significance of lung sound features in respiratory illness diagnosis. Auscultation, a subjective early detection method, led to the development of a Computer-Aided Diagnosis (CAD) system employing the VGG16 model for medical imaging complexity. Data limitations necessitated augmentation, enhancing VGG16's performance compared to non-augment results. Respiratory Disease Detection (RDD) task was introduced. Short-Time Fourier Transform (STFT) facilitated audio feature extraction, while VGG16, using transfer learning and fine-tuning, proved effective on a Kaggle-sourced dataset. Augmentation techniques, including pitch shifting, time stretching, and horizontal flipping, addressed class imbalance. The study introduces innovative data augmentation techniques to overcome the challenge of limited training data, demonstrating the effectiveness of augmentation in enhancing the VGG16 model's performance. © 2024 IEEE. |
publisher |
Institute of Electrical and Electronics Engineers Inc. |
issn |
|
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809678155449892864 |