Phoneme-based or isolated-word modeling speech recognition system? An overview
In this paper speech theories and some methodological concerns about feature extraction and classification techniques widely used in speech recognition system are surveyed and discussed. The shortage of isolated word speech recognition is addressed as compared to its phoneme-based counterpart. This...
Published in: | Proceedings - 2011 IEEE 7th International Colloquium on Signal Processing and Its Applications, CSPA 2011 |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
2011
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-79957526190&doi=10.1109%2fCSPA.2011.5759892&partnerID=40&md5=d14274f39f7580865f84bd9b0d0963e7 |
id |
2-s2.0-79957526190 |
---|---|
spelling |
2-s2.0-79957526190 Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N. Phoneme-based or isolated-word modeling speech recognition system? An overview 2011 Proceedings - 2011 IEEE 7th International Colloquium on Signal Processing and Its Applications, CSPA 2011 10.1109/CSPA.2011.5759892 https://www.scopus.com/inward/record.uri?eid=2-s2.0-79957526190&doi=10.1109%2fCSPA.2011.5759892&partnerID=40&md5=d14274f39f7580865f84bd9b0d0963e7 In this paper speech theories and some methodological concerns about feature extraction and classification techniques widely used in speech recognition system are surveyed and discussed. The shortage of isolated word speech recognition is addressed as compared to its phoneme-based counterpart. This paper could be regarded as a very early stage towards methodology establishment in searching for better accuracy and less complexity system which has more generic properties. It is hoped that the system can classify speech regardless of the varieties across languages or accents. Speaker independency (SI) manner speech recognition system is required for this application and in fact, in many other potential applications as much as a telephonic network (large database consists of many different speakers) is a primary requirement. Isolated-word ASR for fixed vocabularies has been successfully implemented using HMM, ANN and SVM but suffers from lack of adaptability to other languages and increase in complexity as number of vocabularies increases. Conversely, phonemes, the smallest unit of human speech sounds are apparently more feasible to represent the basic building block for cross-language mapping. In fact, the phonetic transcription systems such as IPA and SAMPA are widely recognized and standardized for several languages in the world. This paper intends to investigate the phoneme-based potential as language independent phonetic units to overcome the lack of available training data so as to achieve a more generic speech recognizer. © 2011 IEEE. English Conference paper |
author |
Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N. |
spellingShingle |
Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N. Phoneme-based or isolated-word modeling speech recognition system? An overview |
author_facet |
Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N. |
author_sort |
Yusnita M.A.; Paulraj M.P.; Yaacob S.; Abu Bakar S.; Saidatul A.; Abdullah A.N. |
title |
Phoneme-based or isolated-word modeling speech recognition system? An overview |
title_short |
Phoneme-based or isolated-word modeling speech recognition system? An overview |
title_full |
Phoneme-based or isolated-word modeling speech recognition system? An overview |
title_fullStr |
Phoneme-based or isolated-word modeling speech recognition system? An overview |
title_full_unstemmed |
Phoneme-based or isolated-word modeling speech recognition system? An overview |
title_sort |
Phoneme-based or isolated-word modeling speech recognition system? An overview |
publishDate |
2011 |
container_title |
Proceedings - 2011 IEEE 7th International Colloquium on Signal Processing and Its Applications, CSPA 2011 |
container_volume |
|
container_issue |
|
doi_str_mv |
10.1109/CSPA.2011.5759892 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-79957526190&doi=10.1109%2fCSPA.2011.5759892&partnerID=40&md5=d14274f39f7580865f84bd9b0d0963e7 |
description |
In this paper speech theories and some methodological concerns about feature extraction and classification techniques widely used in speech recognition system are surveyed and discussed. The shortage of isolated word speech recognition is addressed as compared to its phoneme-based counterpart. This paper could be regarded as a very early stage towards methodology establishment in searching for better accuracy and less complexity system which has more generic properties. It is hoped that the system can classify speech regardless of the varieties across languages or accents. Speaker independency (SI) manner speech recognition system is required for this application and in fact, in many other potential applications as much as a telephonic network (large database consists of many different speakers) is a primary requirement. Isolated-word ASR for fixed vocabularies has been successfully implemented using HMM, ANN and SVM but suffers from lack of adaptability to other languages and increase in complexity as number of vocabularies increases. Conversely, phonemes, the smallest unit of human speech sounds are apparently more feasible to represent the basic building block for cross-language mapping. In fact, the phonetic transcription systems such as IPA and SAMPA are widely recognized and standardized for several languages in the world. This paper intends to investigate the phoneme-based potential as language independent phonetic units to overcome the lack of available training data so as to achieve a more generic speech recognizer. © 2011 IEEE. |
publisher |
|
issn |
|
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677788455632896 |