Phoneme-based or isolated-word modeling speech recognition system? An overview

In this paper speech theories and some methodological concerns about feature extraction and classification techniques widely used in speech recognition system are surveyed and discussed. The shortage of isolated word speech recognition is addressed as compared to its phoneme-based counterpart. This...

Full description

Bibliographic Details
Published in:	Proceedings - 2011 IEEE 7th International Colloquium on Signal Processing and Its Applications, CSPA 2011
Main Author:	Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N.
Format:	Conference paper
Language:	English
Published:	2011
Online Access:	https://www.scopus.com/inward/record.uri?eid=2-s2.0-79957526190&doi=10.1109%2fCSPA.2011.5759892&partnerID=40&md5=d14274f39f7580865f84bd9b0d0963e7

id	2-s2.0-79957526190
spelling	2-s2.0-79957526190 Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N. Phoneme-based or isolated-word modeling speech recognition system? An overview 2011 Proceedings - 2011 IEEE 7th International Colloquium on Signal Processing and Its Applications, CSPA 2011 10.1109/CSPA.2011.5759892 https://www.scopus.com/inward/record.uri?eid=2-s2.0-79957526190&doi=10.1109%2fCSPA.2011.5759892&partnerID=40&md5=d14274f39f7580865f84bd9b0d0963e7 In this paper speech theories and some methodological concerns about feature extraction and classification techniques widely used in speech recognition system are surveyed and discussed. The shortage of isolated word speech recognition is addressed as compared to its phoneme-based counterpart. This paper could be regarded as a very early stage towards methodology establishment in searching for better accuracy and less complexity system which has more generic properties. It is hoped that the system can classify speech regardless of the varieties across languages or accents. Speaker independency (SI) manner speech recognition system is required for this application and in fact, in many other potential applications as much as a telephonic network (large database consists of many different speakers) is a primary requirement. Isolated-word ASR for fixed vocabularies has been successfully implemented using HMM, ANN and SVM but suffers from lack of adaptability to other languages and increase in complexity as number of vocabularies increases. Conversely, phonemes, the smallest unit of human speech sounds are apparently more feasible to represent the basic building block for cross-language mapping. In fact, the phonetic transcription systems such as IPA and SAMPA are widely recognized and standardized for several languages in the world. This paper intends to investigate the phoneme-based potential as language independent phonetic units to overcome the lack of available training data so as to achieve a more generic speech recognizer. © 2011 IEEE. English Conference paper
author	Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N.
spellingShingle	Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N. Phoneme-based or isolated-word modeling speech recognition system? An overview
author_facet	Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N.
author_sort	Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N.
title	Phoneme-based or isolated-word modeling speech recognition system? An overview
title_short	Phoneme-based or isolated-word modeling speech recognition system? An overview
title_full	Phoneme-based or isolated-word modeling speech recognition system? An overview
title_fullStr	Phoneme-based or isolated-word modeling speech recognition system? An overview
title_full_unstemmed	Phoneme-based or isolated-word modeling speech recognition system? An overview
title_sort	Phoneme-based or isolated-word modeling speech recognition system? An overview
publishDate	2011
container_title	Proceedings - 2011 IEEE 7th International Colloquium on Signal Processing and Its Applications, CSPA 2011
container_volume
container_issue
doi_str_mv	10.1109/CSPA.2011.5759892
url	https://www.scopus.com/inward/record.uri?eid=2-s2.0-79957526190&doi=10.1109%2fCSPA.2011.5759892&partnerID=40&md5=d14274f39f7580865f84bd9b0d0963e7
description	In this paper speech theories and some methodological concerns about feature extraction and classification techniques widely used in speech recognition system are surveyed and discussed. The shortage of isolated word speech recognition is addressed as compared to its phoneme-based counterpart. This paper could be regarded as a very early stage towards methodology establishment in searching for better accuracy and less complexity system which has more generic properties. It is hoped that the system can classify speech regardless of the varieties across languages or accents. Speaker independency (SI) manner speech recognition system is required for this application and in fact, in many other potential applications as much as a telephonic network (large database consists of many different speakers) is a primary requirement. Isolated-word ASR for fixed vocabularies has been successfully implemented using HMM, ANN and SVM but suffers from lack of adaptability to other languages and increase in complexity as number of vocabularies increases. Conversely, phonemes, the smallest unit of human speech sounds are apparently more feasible to represent the basic building block for cross-language mapping. In fact, the phonetic transcription systems such as IPA and SAMPA are widely recognized and standardized for several languages in the world. This paper intends to investigate the phoneme-based potential as language independent phonetic units to overcome the lack of available training data so as to achieve a more generic speech recognizer. © 2011 IEEE.
publisher
issn
language	English
format	Conference paper
accesstype
record_format	scopus
collection	Scopus
_version_	1809677788455632896

Phoneme-based or isolated-word modeling speech recognition system? An overview

Similar Items