Driver emotion profiling from speech

Humans sense, perceive, and convey emotion differently from each other due to physical, psychological, environmental, cultural, and language differences. For example, as recognized and studied by psychologists more than a century, it is easier for someone of the same culture to judge and recognize e...

Full description

Bibliographic Details
Published in:Digital Signal Processing for In-Vehicle Systems and Safety
Main Author: Kamaruddin N.; Wahab A.; Abut H.
Format: Conference paper
Language:English
Published: 2012
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-84897723074&doi=10.1007%2f978-1-4419-9607-7_2&partnerID=40&md5=2c34e50aebbb46e4c3222bc27d28c12f
id 2-s2.0-84897723074
spelling 2-s2.0-84897723074
Kamaruddin N.; Wahab A.; Abut H.
Driver emotion profiling from speech
2012
Digital Signal Processing for In-Vehicle Systems and Safety


10.1007/978-1-4419-9607-7_2
https://www.scopus.com/inward/record.uri?eid=2-s2.0-84897723074&doi=10.1007%2f978-1-4419-9607-7_2&partnerID=40&md5=2c34e50aebbb46e4c3222bc27d28c12f
Humans sense, perceive, and convey emotion differently from each other due to physical, psychological, environmental, cultural, and language differences. For example, as recognized and studied by psychologists more than a century, it is easier for someone of the same culture to judge and recognize emotion correctly compared to those from different culture. In this chapter, we attempt to study the speech emotion recognition problem by using two speech corpora from the Berlin dataset and the NAW datasets. We have investigated the universality as well as diversity of two different cultural speech datasets recorded by German and American speakers, respectively. Experiments were conducted for identifying three basic emotions, namely, angry, sad, and happy with neutral as emotionless state from these datasets. MFCC coefficients were used as feature sets in the experiments, and MLP was employed as classifiers to compare the performance of these datasets. In addition, real-time recorded speech from drivers was also tested to see the performance in a vehicular setting. Finally, speech emotion profiling approach was introduced to explore the universality and diversity of the speech emotion features. © Springer Science+Business Media, LLC 2012.


English
Conference paper

author Kamaruddin N.; Wahab A.; Abut H.
spellingShingle Kamaruddin N.; Wahab A.; Abut H.
Driver emotion profiling from speech
author_facet Kamaruddin N.; Wahab A.; Abut H.
author_sort Kamaruddin N.; Wahab A.; Abut H.
title Driver emotion profiling from speech
title_short Driver emotion profiling from speech
title_full Driver emotion profiling from speech
title_fullStr Driver emotion profiling from speech
title_full_unstemmed Driver emotion profiling from speech
title_sort Driver emotion profiling from speech
publishDate 2012
container_title Digital Signal Processing for In-Vehicle Systems and Safety
container_volume
container_issue
doi_str_mv 10.1007/978-1-4419-9607-7_2
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-84897723074&doi=10.1007%2f978-1-4419-9607-7_2&partnerID=40&md5=2c34e50aebbb46e4c3222bc27d28c12f
description Humans sense, perceive, and convey emotion differently from each other due to physical, psychological, environmental, cultural, and language differences. For example, as recognized and studied by psychologists more than a century, it is easier for someone of the same culture to judge and recognize emotion correctly compared to those from different culture. In this chapter, we attempt to study the speech emotion recognition problem by using two speech corpora from the Berlin dataset and the NAW datasets. We have investigated the universality as well as diversity of two different cultural speech datasets recorded by German and American speakers, respectively. Experiments were conducted for identifying three basic emotions, namely, angry, sad, and happy with neutral as emotionless state from these datasets. MFCC coefficients were used as feature sets in the experiments, and MLP was employed as classifiers to compare the performance of these datasets. In addition, real-time recorded speech from drivers was also tested to see the performance in a vehicular setting. Finally, speech emotion profiling approach was introduced to explore the universality and diversity of the speech emotion features. © Springer Science+Business Media, LLC 2012.
publisher
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1809677913763610624