Detection of Respiratory Diseases from Auscultated Sounds Using VGG16 with Data Augmentation

Lung disease, ranking third globally for causes of death with over 3 million annual fatalities according to the 2015 World Health Organization (WHO), underscores the significance of lung sound features in respiratory illness diagnosis. Auscultation, a subjective early detection method, led to the de...

Full description

Bibliographic Details
Published in:Proceedings - 2024 2nd International Conference on Computer Graphics and Image Processing, CGIP 2024
Main Author: Binti Roslan I.K.; Ehara F.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85195144895&doi=10.1109%2fCGIP62525.2024.00032&partnerID=40&md5=9d88d61a38057d61e08f80112286a599
Description
Summary:Lung disease, ranking third globally for causes of death with over 3 million annual fatalities according to the 2015 World Health Organization (WHO), underscores the significance of lung sound features in respiratory illness diagnosis. Auscultation, a subjective early detection method, led to the development of a Computer-Aided Diagnosis (CAD) system employing the VGG16 model for medical imaging complexity. Data limitations necessitated augmentation, enhancing VGG16's performance compared to non-augment results. Respiratory Disease Detection (RDD) task was introduced. Short-Time Fourier Transform (STFT) facilitated audio feature extraction, while VGG16, using transfer learning and fine-tuning, proved effective on a Kaggle-sourced dataset. Augmentation techniques, including pitch shifting, time stretching, and horizontal flipping, addressed class imbalance. The study introduces innovative data augmentation techniques to overcome the challenge of limited training data, demonstrating the effectiveness of augmentation in enhancing the VGG16 model's performance. © 2024 IEEE.
ISSN:
DOI:10.1109/CGIP62525.2024.00032