Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language

Recognizing emotions using natural or spontaneous speech are extremely difficult compared to doing the same for acted or elicited speeches. Speech emotion recognition for real conversation such as spontaneous speech requires linguistic information of the speech to be included in the speech emotion r...

Full description

Bibliographic Details
Published in:2017 9th International Conference on Speech Technology and Human-Computer Dialogue, SpeD 2017
Main Author: Jamil N.; Apandi F.; Hamzah R.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2017
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85034250749&doi=10.1109%2fSPED.2017.7990448&partnerID=40&md5=56d38d09ea07d36c03aa8a4761ac8f2f
id 2-s2.0-85034250749
spelling 2-s2.0-85034250749
Jamil N.; Apandi F.; Hamzah R.
Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language
2017
2017 9th International Conference on Speech Technology and Human-Computer Dialogue, SpeD 2017


10.1109/SPED.2017.7990448
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85034250749&doi=10.1109%2fSPED.2017.7990448&partnerID=40&md5=56d38d09ea07d36c03aa8a4761ac8f2f
Recognizing emotions using natural or spontaneous speech are extremely difficult compared to doing the same for acted or elicited speeches. Speech emotion recognition for real conversation such as spontaneous speech requires linguistic information of the speech to be included in the speech emotion recognition component to achieve a high recognition rate. However, with the lack of digital speech resources of an under-resourced language, this requirement poses a problem. In this paper, speech emotion recognition of spontaneous speech in Malay language using prosodic features and Random Forest classifier is presented. We also investigate the influence of age categorized as children, young adults and middle-aged on emotion recognition. Ninety spontaneous speech sentences from 30 native speakers of Malay language are collected and classified into three emotions, which are happy, angry and sad. Results show that the spontaneous speech of middle-aged group achieved the highest accuracy rate followed by children age group and finally the young adults. While sad emotions are recognized satisfactorily across all age groups, confusions exist between happy and angry emotions. © 2017 IEEE.
Institute of Electrical and Electronics Engineers Inc.

English
Conference paper

author Jamil N.; Apandi F.; Hamzah R.
spellingShingle Jamil N.; Apandi F.; Hamzah R.
Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language
author_facet Jamil N.; Apandi F.; Hamzah R.
author_sort Jamil N.; Apandi F.; Hamzah R.
title Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language
title_short Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language
title_full Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language
title_fullStr Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language
title_full_unstemmed Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language
title_sort Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language
publishDate 2017
container_title 2017 9th International Conference on Speech Technology and Human-Computer Dialogue, SpeD 2017
container_volume
container_issue
doi_str_mv 10.1109/SPED.2017.7990448
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85034250749&doi=10.1109%2fSPED.2017.7990448&partnerID=40&md5=56d38d09ea07d36c03aa8a4761ac8f2f
description Recognizing emotions using natural or spontaneous speech are extremely difficult compared to doing the same for acted or elicited speeches. Speech emotion recognition for real conversation such as spontaneous speech requires linguistic information of the speech to be included in the speech emotion recognition component to achieve a high recognition rate. However, with the lack of digital speech resources of an under-resourced language, this requirement poses a problem. In this paper, speech emotion recognition of spontaneous speech in Malay language using prosodic features and Random Forest classifier is presented. We also investigate the influence of age categorized as children, young adults and middle-aged on emotion recognition. Ninety spontaneous speech sentences from 30 native speakers of Malay language are collected and classified into three emotions, which are happy, angry and sad. Results show that the spontaneous speech of middle-aged group achieved the highest accuracy rate followed by children age group and finally the young adults. While sad emotions are recognized satisfactorily across all age groups, confusions exist between happy and angry emotions. © 2017 IEEE.
publisher Institute of Electrical and Electronics Engineers Inc.
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1809677605684641792