Detection of malay phrase breaks using energy and duration

A simpler approach to identify and classify phrase breaks in prosodic phrasing using energy patterns and duration is useful in speech segmentation. Prosodic phrasing is useful to segment lengthy spontaneous speech into smaller meaningful utterance without analysis of linguistic information. We propo...

Full description

Bibliographic Details
Published in:International Journal of Simulation: Systems, Science and Technology
Main Author: Mohamed Hanum H.; Abu Bakar Z.
Format: Article
Language:English
Published: UK Simulation Society 2016
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85006056588&doi=10.5013%2fIJSSST.a.17.32.26&partnerID=40&md5=964fff492a5f38135a59d497365dd8d7
id 2-s2.0-85006056588
spelling 2-s2.0-85006056588
Mohamed Hanum H.; Abu Bakar Z.
Detection of malay phrase breaks using energy and duration
2016
International Journal of Simulation: Systems, Science and Technology
17
32
10.5013/IJSSST.a.17.32.26
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85006056588&doi=10.5013%2fIJSSST.a.17.32.26&partnerID=40&md5=964fff492a5f38135a59d497365dd8d7
A simpler approach to identify and classify phrase breaks in prosodic phrasing using energy patterns and duration is useful in speech segmentation. Prosodic phrasing is useful to segment lengthy spontaneous speech into smaller meaningful utterance without analysis of linguistic information. We propose a listening test that allow trained listener to classify the boundaries as minor or major breaks. This cheaper and faster approach is proven useful for under-resource language such as Malay which do not have comprehensive prosodic-annotated corpus. Word-related energy and duration features are extracted from the targeted phrase breaks. The training feature set is developed from evaluation of targeted phrase break from 100 sentences evaluated in the listening test. Evaluation of the features with RBF, MLP and logistics models reveal best detection accuracy of 80.6% which is comparable to existing context-based algorithm. Instead of labeling the phrase break using linguistic and phonetic meaning, the proposed listening test allows labeling of phrase break as perceived by listener. In addition, the results can be use as preliminary information for evaluation of boundary salience at the targeted boundary locations. © 2016, UK Simulation Society. All rights reserved.
UK Simulation Society
14738031
English
Article

author Mohamed Hanum H.; Abu Bakar Z.
spellingShingle Mohamed Hanum H.; Abu Bakar Z.
Detection of malay phrase breaks using energy and duration
author_facet Mohamed Hanum H.; Abu Bakar Z.
author_sort Mohamed Hanum H.; Abu Bakar Z.
title Detection of malay phrase breaks using energy and duration
title_short Detection of malay phrase breaks using energy and duration
title_full Detection of malay phrase breaks using energy and duration
title_fullStr Detection of malay phrase breaks using energy and duration
title_full_unstemmed Detection of malay phrase breaks using energy and duration
title_sort Detection of malay phrase breaks using energy and duration
publishDate 2016
container_title International Journal of Simulation: Systems, Science and Technology
container_volume 17
container_issue 32
doi_str_mv 10.5013/IJSSST.a.17.32.26
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85006056588&doi=10.5013%2fIJSSST.a.17.32.26&partnerID=40&md5=964fff492a5f38135a59d497365dd8d7
description A simpler approach to identify and classify phrase breaks in prosodic phrasing using energy patterns and duration is useful in speech segmentation. Prosodic phrasing is useful to segment lengthy spontaneous speech into smaller meaningful utterance without analysis of linguistic information. We propose a listening test that allow trained listener to classify the boundaries as minor or major breaks. This cheaper and faster approach is proven useful for under-resource language such as Malay which do not have comprehensive prosodic-annotated corpus. Word-related energy and duration features are extracted from the targeted phrase breaks. The training feature set is developed from evaluation of targeted phrase break from 100 sentences evaluated in the listening test. Evaluation of the features with RBF, MLP and logistics models reveal best detection accuracy of 80.6% which is comparable to existing context-based algorithm. Instead of labeling the phrase break using linguistic and phonetic meaning, the proposed listening test allows labeling of phrase break as perceived by listener. In addition, the results can be use as preliminary information for evaluation of boundary salience at the targeted boundary locations. © 2016, UK Simulation Society. All rights reserved.
publisher UK Simulation Society
issn 14738031
language English
format Article
accesstype
record_format scopus
collection Scopus
_version_ 1820775474666668032