Pediatric pneumonia detection using vision transformer

Pneumonia is the leading cause of death in children and one of the leading causes of death worldwide. Early detection of pneumonia in pediatric is needed so that children get proper treatment and avoid wider transmission. This study aims to detect pneumonia in children based on chest X-ray (CXR) ima...

Full description

Bibliographic Details
Published in:AIP Conference Proceedings
Main Author: Cahyani D.E.; Setyawan F.F.; Hariadi A.D.; Wahyuningsih S.; Setumin S.
Format: Conference paper
Language:English
Published: American Institute of Physics 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85202626485&doi=10.1063%2f5.0224905&partnerID=40&md5=81939fb419e22de46827df96bcb5cda0
id 2-s2.0-85202626485
spelling 2-s2.0-85202626485
Cahyani D.E.; Setyawan F.F.; Hariadi A.D.; Wahyuningsih S.; Setumin S.
Pediatric pneumonia detection using vision transformer
2024
AIP Conference Proceedings
3189
1
10.1063/5.0224905
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85202626485&doi=10.1063%2f5.0224905&partnerID=40&md5=81939fb419e22de46827df96bcb5cda0
Pneumonia is the leading cause of death in children and one of the leading causes of death worldwide. Early detection of pneumonia in pediatric is needed so that children get proper treatment and avoid wider transmission. This study aims to detect pneumonia in children based on chest X-ray (CXR) images by comparing the Vision Transformer (ViT), InceptionV3, and Xception models. Vision Transformer (ViT) is a new transformer model that trains neural networks on image data. Meanwhile, the InceptionV3 and Xception models are transfer learning models in a convolutional neural network. This study uses a dataset called the Pediatric Chest X-ray Pneumonia, which contains two classes, namely normal and pneumonia. Vision Transformer achieved the best accuracy, precision, recall, and F1-measure values of 97.78, 97.46, 97.88, and 97.16. Furthermore, the second model, namely InceptionV3, obtained accuracy, precision, recall, and F1-measure values of 97.18, 96.06, 96.87, and 96.45. Finally, the Xception model obtained accuracy, precision, recall, and F1-measure values of 97.01, 95.85, 96.65, and 96.24. So, the conclusion of this research is that the Vision Transformer model can generate good performance for pediatric pneumonia detection based on CXR. © 2024 Author(s).
American Institute of Physics
0094243X
English
Conference paper

author Cahyani D.E.; Setyawan F.F.; Hariadi A.D.; Wahyuningsih S.; Setumin S.
spellingShingle Cahyani D.E.; Setyawan F.F.; Hariadi A.D.; Wahyuningsih S.; Setumin S.
Pediatric pneumonia detection using vision transformer
author_facet Cahyani D.E.; Setyawan F.F.; Hariadi A.D.; Wahyuningsih S.; Setumin S.
author_sort Cahyani D.E.; Setyawan F.F.; Hariadi A.D.; Wahyuningsih S.; Setumin S.
title Pediatric pneumonia detection using vision transformer
title_short Pediatric pneumonia detection using vision transformer
title_full Pediatric pneumonia detection using vision transformer
title_fullStr Pediatric pneumonia detection using vision transformer
title_full_unstemmed Pediatric pneumonia detection using vision transformer
title_sort Pediatric pneumonia detection using vision transformer
publishDate 2024
container_title AIP Conference Proceedings
container_volume 3189
container_issue 1
doi_str_mv 10.1063/5.0224905
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85202626485&doi=10.1063%2f5.0224905&partnerID=40&md5=81939fb419e22de46827df96bcb5cda0
description Pneumonia is the leading cause of death in children and one of the leading causes of death worldwide. Early detection of pneumonia in pediatric is needed so that children get proper treatment and avoid wider transmission. This study aims to detect pneumonia in children based on chest X-ray (CXR) images by comparing the Vision Transformer (ViT), InceptionV3, and Xception models. Vision Transformer (ViT) is a new transformer model that trains neural networks on image data. Meanwhile, the InceptionV3 and Xception models are transfer learning models in a convolutional neural network. This study uses a dataset called the Pediatric Chest X-ray Pneumonia, which contains two classes, namely normal and pneumonia. Vision Transformer achieved the best accuracy, precision, recall, and F1-measure values of 97.78, 97.46, 97.88, and 97.16. Furthermore, the second model, namely InceptionV3, obtained accuracy, precision, recall, and F1-measure values of 97.18, 96.06, 96.87, and 96.45. Finally, the Xception model obtained accuracy, precision, recall, and F1-measure values of 97.01, 95.85, 96.65, and 96.24. So, the conclusion of this research is that the Vision Transformer model can generate good performance for pediatric pneumonia detection based on CXR. © 2024 Author(s).
publisher American Institute of Physics
issn 0094243X
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1812871794673057792