Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language
Recognizing emotions using natural or spontaneous speech are extremely difficult compared to doing the same for acted or elicited speeches. Speech emotion recognition for real conversation such as spontaneous speech requires linguistic information of the speech to be included in the speech emotion r...
Published in: | 2017 9th International Conference on Speech Technology and Human-Computer Dialogue, SpeD 2017 |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
Institute of Electrical and Electronics Engineers Inc.
2017
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85034250749&doi=10.1109%2fSPED.2017.7990448&partnerID=40&md5=56d38d09ea07d36c03aa8a4761ac8f2f |
id |
2-s2.0-85034250749 |
---|---|
spelling |
2-s2.0-85034250749 Jamil N.; Apandi F.; Hamzah R. Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language 2017 2017 9th International Conference on Speech Technology and Human-Computer Dialogue, SpeD 2017 10.1109/SPED.2017.7990448 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85034250749&doi=10.1109%2fSPED.2017.7990448&partnerID=40&md5=56d38d09ea07d36c03aa8a4761ac8f2f Recognizing emotions using natural or spontaneous speech are extremely difficult compared to doing the same for acted or elicited speeches. Speech emotion recognition for real conversation such as spontaneous speech requires linguistic information of the speech to be included in the speech emotion recognition component to achieve a high recognition rate. However, with the lack of digital speech resources of an under-resourced language, this requirement poses a problem. In this paper, speech emotion recognition of spontaneous speech in Malay language using prosodic features and Random Forest classifier is presented. We also investigate the influence of age categorized as children, young adults and middle-aged on emotion recognition. Ninety spontaneous speech sentences from 30 native speakers of Malay language are collected and classified into three emotions, which are happy, angry and sad. Results show that the spontaneous speech of middle-aged group achieved the highest accuracy rate followed by children age group and finally the young adults. While sad emotions are recognized satisfactorily across all age groups, confusions exist between happy and angry emotions. © 2017 IEEE. Institute of Electrical and Electronics Engineers Inc. English Conference paper |
author |
Jamil N.; Apandi F.; Hamzah R. |
spellingShingle |
Jamil N.; Apandi F.; Hamzah R. Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language |
author_facet |
Jamil N.; Apandi F.; Hamzah R. |
author_sort |
Jamil N.; Apandi F.; Hamzah R. |
title |
Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language |
title_short |
Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language |
title_full |
Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language |
title_fullStr |
Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language |
title_full_unstemmed |
Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language |
title_sort |
Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language |
publishDate |
2017 |
container_title |
2017 9th International Conference on Speech Technology and Human-Computer Dialogue, SpeD 2017 |
container_volume |
|
container_issue |
|
doi_str_mv |
10.1109/SPED.2017.7990448 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85034250749&doi=10.1109%2fSPED.2017.7990448&partnerID=40&md5=56d38d09ea07d36c03aa8a4761ac8f2f |
description |
Recognizing emotions using natural or spontaneous speech are extremely difficult compared to doing the same for acted or elicited speeches. Speech emotion recognition for real conversation such as spontaneous speech requires linguistic information of the speech to be included in the speech emotion recognition component to achieve a high recognition rate. However, with the lack of digital speech resources of an under-resourced language, this requirement poses a problem. In this paper, speech emotion recognition of spontaneous speech in Malay language using prosodic features and Random Forest classifier is presented. We also investigate the influence of age categorized as children, young adults and middle-aged on emotion recognition. Ninety spontaneous speech sentences from 30 native speakers of Malay language are collected and classified into three emotions, which are happy, angry and sad. Results show that the spontaneous speech of middle-aged group achieved the highest accuracy rate followed by children age group and finally the young adults. While sad emotions are recognized satisfactorily across all age groups, confusions exist between happy and angry emotions. © 2017 IEEE. |
publisher |
Institute of Electrical and Electronics Engineers Inc. |
issn |
|
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677605684641792 |