Malaysian Sign Language (MSL) Detection: Comparison of YOLOv5 and CNN

In addressing the underrepresentation of Malaysian Sign Language (MSL) in sign language detection research, this study evaluates the performance of deep learning models on an MSL dataset to achieve high accuracy, precision, and recall for hand gesture detection and translation. Methodologically, the...

Full description

Bibliographic Details
Published in:2024 IEEE 15th Control and System Graduate Research Colloquium, ICSGRC 2024 - Conference Proceeding
Main Author: Halim A.H.A.; Rahim A.A.A.; Rozaini N.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85206654199&doi=10.1109%2fICSGRC62081.2024.10691273&partnerID=40&md5=d5dd698c8fe1ee71d1ff2e389aa1c212
id 2-s2.0-85206654199
spelling 2-s2.0-85206654199
Halim A.H.A.; Rahim A.A.A.; Rozaini N.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E.
Malaysian Sign Language (MSL) Detection: Comparison of YOLOv5 and CNN
2024
2024 IEEE 15th Control and System Graduate Research Colloquium, ICSGRC 2024 - Conference Proceeding


10.1109/ICSGRC62081.2024.10691273
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85206654199&doi=10.1109%2fICSGRC62081.2024.10691273&partnerID=40&md5=d5dd698c8fe1ee71d1ff2e389aa1c212
In addressing the underrepresentation of Malaysian Sign Language (MSL) in sign language detection research, this study evaluates the performance of deep learning models on an MSL dataset to achieve high accuracy, precision, and recall for hand gesture detection and translation. Methodologically, the study compares two algorithms: YOLOv5, known for its object detection capabilities, and the Convolutional Neural Network (CNN), which excels in learning features directly from raw images. The dataset, comprising ten common MSL phrases, was collected using a webcam and OpenCV from four non-deaf individuals, incorporating variations in hand gestures and lighting. The findings indicate that YOLOv5 significantly outperforms CNN, with YOLOv5 achieving an overall accuracy of 0.99, precision of 0.93, and recall of 0.99, while CNN achieves an overall accuracy of 0.94, precision of 0.76, and recall of 0.71. The study concludes that YOLOv5 offers a more robust and accurate model for MSL detection, which can serve as a valuable communication aid for MSL users. However, the study acknowledges limitations such as using a non-deaf contributor dataset and a modest sample size, which may not fully represent the diversity of MSL users. Future research is suggested to incorporate data from deaf individuals to enhance the model's applicability and accuracy, considering that signing patterns may vary between non-deaf and deaf signers. © 2024 IEEE.
Institute of Electrical and Electronics Engineers Inc.

English
Conference paper

author Halim A.H.A.; Rahim A.A.A.; Rozaini N.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E.
spellingShingle Halim A.H.A.; Rahim A.A.A.; Rozaini N.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E.
Malaysian Sign Language (MSL) Detection: Comparison of YOLOv5 and CNN
author_facet Halim A.H.A.; Rahim A.A.A.; Rozaini N.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E.
author_sort Halim A.H.A.; Rahim A.A.A.; Rozaini N.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E.
title Malaysian Sign Language (MSL) Detection: Comparison of YOLOv5 and CNN
title_short Malaysian Sign Language (MSL) Detection: Comparison of YOLOv5 and CNN
title_full Malaysian Sign Language (MSL) Detection: Comparison of YOLOv5 and CNN
title_fullStr Malaysian Sign Language (MSL) Detection: Comparison of YOLOv5 and CNN
title_full_unstemmed Malaysian Sign Language (MSL) Detection: Comparison of YOLOv5 and CNN
title_sort Malaysian Sign Language (MSL) Detection: Comparison of YOLOv5 and CNN
publishDate 2024
container_title 2024 IEEE 15th Control and System Graduate Research Colloquium, ICSGRC 2024 - Conference Proceeding
container_volume
container_issue
doi_str_mv 10.1109/ICSGRC62081.2024.10691273
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85206654199&doi=10.1109%2fICSGRC62081.2024.10691273&partnerID=40&md5=d5dd698c8fe1ee71d1ff2e389aa1c212
description In addressing the underrepresentation of Malaysian Sign Language (MSL) in sign language detection research, this study evaluates the performance of deep learning models on an MSL dataset to achieve high accuracy, precision, and recall for hand gesture detection and translation. Methodologically, the study compares two algorithms: YOLOv5, known for its object detection capabilities, and the Convolutional Neural Network (CNN), which excels in learning features directly from raw images. The dataset, comprising ten common MSL phrases, was collected using a webcam and OpenCV from four non-deaf individuals, incorporating variations in hand gestures and lighting. The findings indicate that YOLOv5 significantly outperforms CNN, with YOLOv5 achieving an overall accuracy of 0.99, precision of 0.93, and recall of 0.99, while CNN achieves an overall accuracy of 0.94, precision of 0.76, and recall of 0.71. The study concludes that YOLOv5 offers a more robust and accurate model for MSL detection, which can serve as a valuable communication aid for MSL users. However, the study acknowledges limitations such as using a non-deaf contributor dataset and a modest sample size, which may not fully represent the diversity of MSL users. Future research is suggested to incorporate data from deaf individuals to enhance the model's applicability and accuracy, considering that signing patterns may vary between non-deaf and deaf signers. © 2024 IEEE.
publisher Institute of Electrical and Electronics Engineers Inc.
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1814778500841734144