PMLAP: a methodology for annotating SSML elements into HTML5

The flexibility to listen to the contents is critical for an improved understanding of web information. The Speech Synthesizer Markup Language (SSML) provides a link between the speech engine and HTML-formatted online content; nevertheless, it introduces the risks of inconsistent content between SSM...

Full description

Bibliographic Details
Published in:Journal of Supercomputing
Main Author: Johari A.; Ismail A.
Format: Article
Language:English
Published: Springer 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85173523177&doi=10.1007%2fs11227-023-05673-7&partnerID=40&md5=44f13bd6d19dc235c733f5090b318f96
id 2-s2.0-85173523177
spelling 2-s2.0-85173523177
Johari A.; Ismail A.
PMLAP: a methodology for annotating SSML elements into HTML5
2024
Journal of Supercomputing
80
4
10.1007/s11227-023-05673-7
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85173523177&doi=10.1007%2fs11227-023-05673-7&partnerID=40&md5=44f13bd6d19dc235c733f5090b318f96
The flexibility to listen to the contents is critical for an improved understanding of web information. The Speech Synthesizer Markup Language (SSML) provides a link between the speech engine and HTML-formatted online content; nevertheless, it introduces the risks of inconsistent content between SSML and HTML5 content and incorrect file linking between them. Because HTML5 is the container, annotating the SSML elements into the HTML5 is the logical solution to these challenges. Thus, in this paper, we propose a methodology called Position-based Markup Language Annotation Process (PMLAP) that aims to (1) streamline the annotation process of applying SSML elements at a specific position in the HTML5 elements through the custom data attribute called the data-* attribute and (2) offers a flat learning curve for the web developers to grasp the annotation process. The methodology consists of four distinct steps which produce an annotated HTML5. This output can then be used as the input of PMLAP transcoder to extract the relevant information and generate the respective SSML document automatically. Thus, we also present the design of the transcoder which has been implemented using Javascript. We then illustrate the applicability of PMLAP using a running example followed by validating the generated SSML produced by the transcoder using the available tools, namely, ExtendClass Text Compare, a text comparison tool to check the string well-formedness and AWS Polly TTS, a text-to-speech web service for checking the correctness of the generated speech. The validation results indicate the viability of the transcoder to achieve the research goal. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023.
Springer
9208542
English
Article

author Johari A.; Ismail A.
spellingShingle Johari A.; Ismail A.
PMLAP: a methodology for annotating SSML elements into HTML5
author_facet Johari A.; Ismail A.
author_sort Johari A.; Ismail A.
title PMLAP: a methodology for annotating SSML elements into HTML5
title_short PMLAP: a methodology for annotating SSML elements into HTML5
title_full PMLAP: a methodology for annotating SSML elements into HTML5
title_fullStr PMLAP: a methodology for annotating SSML elements into HTML5
title_full_unstemmed PMLAP: a methodology for annotating SSML elements into HTML5
title_sort PMLAP: a methodology for annotating SSML elements into HTML5
publishDate 2024
container_title Journal of Supercomputing
container_volume 80
container_issue 4
doi_str_mv 10.1007/s11227-023-05673-7
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85173523177&doi=10.1007%2fs11227-023-05673-7&partnerID=40&md5=44f13bd6d19dc235c733f5090b318f96
description The flexibility to listen to the contents is critical for an improved understanding of web information. The Speech Synthesizer Markup Language (SSML) provides a link between the speech engine and HTML-formatted online content; nevertheless, it introduces the risks of inconsistent content between SSML and HTML5 content and incorrect file linking between them. Because HTML5 is the container, annotating the SSML elements into the HTML5 is the logical solution to these challenges. Thus, in this paper, we propose a methodology called Position-based Markup Language Annotation Process (PMLAP) that aims to (1) streamline the annotation process of applying SSML elements at a specific position in the HTML5 elements through the custom data attribute called the data-* attribute and (2) offers a flat learning curve for the web developers to grasp the annotation process. The methodology consists of four distinct steps which produce an annotated HTML5. This output can then be used as the input of PMLAP transcoder to extract the relevant information and generate the respective SSML document automatically. Thus, we also present the design of the transcoder which has been implemented using Javascript. We then illustrate the applicability of PMLAP using a running example followed by validating the generated SSML produced by the transcoder using the available tools, namely, ExtendClass Text Compare, a text comparison tool to check the string well-formedness and AWS Polly TTS, a text-to-speech web service for checking the correctness of the generated speech. The validation results indicate the viability of the transcoder to achieve the research goal. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023.
publisher Springer
issn 9208542
language English
format Article
accesstype
record_format scopus
collection Scopus
_version_ 1809677572000186368