Phoneme-based or isolated-word modeling speech recognition system? An overview

In this paper speech theories and some methodological concerns about feature extraction and classification techniques widely used in speech recognition system are surveyed and discussed. The shortage of isolated word speech recognition is addressed as compared to its phoneme-based counterpart. This...

Full description

Bibliographic Details
Published in:Proceedings - 2011 IEEE 7th International Colloquium on Signal Processing and Its Applications, CSPA 2011
Main Author: Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N.
Format: Conference paper
Language:English
Published: 2011
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-79957526190&doi=10.1109%2fCSPA.2011.5759892&partnerID=40&md5=d14274f39f7580865f84bd9b0d0963e7
id 2-s2.0-79957526190
spelling 2-s2.0-79957526190
Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N.
Phoneme-based or isolated-word modeling speech recognition system? An overview
2011
Proceedings - 2011 IEEE 7th International Colloquium on Signal Processing and Its Applications, CSPA 2011


10.1109/CSPA.2011.5759892
https://www.scopus.com/inward/record.uri?eid=2-s2.0-79957526190&doi=10.1109%2fCSPA.2011.5759892&partnerID=40&md5=d14274f39f7580865f84bd9b0d0963e7
In this paper speech theories and some methodological concerns about feature extraction and classification techniques widely used in speech recognition system are surveyed and discussed. The shortage of isolated word speech recognition is addressed as compared to its phoneme-based counterpart. This paper could be regarded as a very early stage towards methodology establishment in searching for better accuracy and less complexity system which has more generic properties. It is hoped that the system can classify speech regardless of the varieties across languages or accents. Speaker independency (SI) manner speech recognition system is required for this application and in fact, in many other potential applications as much as a telephonic network (large database consists of many different speakers) is a primary requirement. Isolated-word ASR for fixed vocabularies has been successfully implemented using HMM, ANN and SVM but suffers from lack of adaptability to other languages and increase in complexity as number of vocabularies increases. Conversely, phonemes, the smallest unit of human speech sounds are apparently more feasible to represent the basic building block for cross-language mapping. In fact, the phonetic transcription systems such as IPA and SAMPA are widely recognized and standardized for several languages in the world. This paper intends to investigate the phoneme-based potential as language independent phonetic units to overcome the lack of available training data so as to achieve a more generic speech recognizer. © 2011 IEEE.


English
Conference paper

author Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N.
spellingShingle Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N.
Phoneme-based or isolated-word modeling speech recognition system? An overview
author_facet Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N.
author_sort Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N.
title Phoneme-based or isolated-word modeling speech recognition system? An overview
title_short Phoneme-based or isolated-word modeling speech recognition system? An overview
title_full Phoneme-based or isolated-word modeling speech recognition system? An overview
title_fullStr Phoneme-based or isolated-word modeling speech recognition system? An overview
title_full_unstemmed Phoneme-based or isolated-word modeling speech recognition system? An overview
title_sort Phoneme-based or isolated-word modeling speech recognition system? An overview
publishDate 2011
container_title Proceedings - 2011 IEEE 7th International Colloquium on Signal Processing and Its Applications, CSPA 2011
container_volume
container_issue
doi_str_mv 10.1109/CSPA.2011.5759892
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-79957526190&doi=10.1109%2fCSPA.2011.5759892&partnerID=40&md5=d14274f39f7580865f84bd9b0d0963e7
description In this paper speech theories and some methodological concerns about feature extraction and classification techniques widely used in speech recognition system are surveyed and discussed. The shortage of isolated word speech recognition is addressed as compared to its phoneme-based counterpart. This paper could be regarded as a very early stage towards methodology establishment in searching for better accuracy and less complexity system which has more generic properties. It is hoped that the system can classify speech regardless of the varieties across languages or accents. Speaker independency (SI) manner speech recognition system is required for this application and in fact, in many other potential applications as much as a telephonic network (large database consists of many different speakers) is a primary requirement. Isolated-word ASR for fixed vocabularies has been successfully implemented using HMM, ANN and SVM but suffers from lack of adaptability to other languages and increase in complexity as number of vocabularies increases. Conversely, phonemes, the smallest unit of human speech sounds are apparently more feasible to represent the basic building block for cross-language mapping. In fact, the phonetic transcription systems such as IPA and SAMPA are widely recognized and standardized for several languages in the world. This paper intends to investigate the phoneme-based potential as language independent phonetic units to overcome the lack of available training data so as to achieve a more generic speech recognizer. © 2011 IEEE.
publisher
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1809677788455632896