Melex: The construction of malay-english sentiment lexicon
Currently, the sentiment analysis research in the Malaysian context lacks in terms of the availability of the sentiment lexicon. Thus, this issue is addressed in this paper in order to enhance the accuracy of sentiment analysis. In this study, a new lexicon for sentiment analysis is constructed. A d...
Published in: | Computers, Materials and Continua |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Published: |
Tech Science Press
2022
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85118626345&doi=10.32604%2fcmc.2022.021131&partnerID=40&md5=dd752ed276ed5af643dc4385e6906a84 |
id |
2-s2.0-85118626345 |
---|---|
spelling |
2-s2.0-85118626345 Mahadzir N.H.; Omar M.F.; Nawi M.N.M.; Salameh A.A.; Hussin K.C.; Sohail A. Melex: The construction of malay-english sentiment lexicon 2022 Computers, Materials and Continua 71 1 10.32604/cmc.2022.021131 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85118626345&doi=10.32604%2fcmc.2022.021131&partnerID=40&md5=dd752ed276ed5af643dc4385e6906a84 Currently, the sentiment analysis research in the Malaysian context lacks in terms of the availability of the sentiment lexicon. Thus, this issue is addressed in this paper in order to enhance the accuracy of sentiment analysis. In this study, a new lexicon for sentiment analysis is constructed. A detailed review of existing approaches has been conducted, and a new bilingual sentiment lexicon known as MELex (Malay-English Lexicon) has been generated. Constructing MELex involves three activities: Seed words selection, polarity assignment, and synonym expansions.Our approach differs from previous works in thatMELex can analyze text for the two most widely used languages in Malaysia, Malay, and English, with the accuracy achieved, is 90%. It is evaluated based on the experimentation and case study approaches where the affordable housing projects inMalaysia are selected as case projects. This finding has given an implication on the ability of MELex to analyze public sentiments in the Malaysian context. The novel aspects of this paper are two-fold. Firstly, it introduces the new technique in assigning the polarity score, and second, it improves the performance over the classification of mixed language content. © 2022 Tech Science Press. All rights reserved. Tech Science Press 15462218 English Article All Open Access; Gold Open Access |
author |
Mahadzir N.H.; Omar M.F.; Nawi M.N.M.; Salameh A.A.; Hussin K.C.; Sohail A. |
spellingShingle |
Mahadzir N.H.; Omar M.F.; Nawi M.N.M.; Salameh A.A.; Hussin K.C.; Sohail A. Melex: The construction of malay-english sentiment lexicon |
author_facet |
Mahadzir N.H.; Omar M.F.; Nawi M.N.M.; Salameh A.A.; Hussin K.C.; Sohail A. |
author_sort |
Mahadzir N.H.; Omar M.F.; Nawi M.N.M.; Salameh A.A.; Hussin K.C.; Sohail A. |
title |
Melex: The construction of malay-english sentiment lexicon |
title_short |
Melex: The construction of malay-english sentiment lexicon |
title_full |
Melex: The construction of malay-english sentiment lexicon |
title_fullStr |
Melex: The construction of malay-english sentiment lexicon |
title_full_unstemmed |
Melex: The construction of malay-english sentiment lexicon |
title_sort |
Melex: The construction of malay-english sentiment lexicon |
publishDate |
2022 |
container_title |
Computers, Materials and Continua |
container_volume |
71 |
container_issue |
1 |
doi_str_mv |
10.32604/cmc.2022.021131 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85118626345&doi=10.32604%2fcmc.2022.021131&partnerID=40&md5=dd752ed276ed5af643dc4385e6906a84 |
description |
Currently, the sentiment analysis research in the Malaysian context lacks in terms of the availability of the sentiment lexicon. Thus, this issue is addressed in this paper in order to enhance the accuracy of sentiment analysis. In this study, a new lexicon for sentiment analysis is constructed. A detailed review of existing approaches has been conducted, and a new bilingual sentiment lexicon known as MELex (Malay-English Lexicon) has been generated. Constructing MELex involves three activities: Seed words selection, polarity assignment, and synonym expansions.Our approach differs from previous works in thatMELex can analyze text for the two most widely used languages in Malaysia, Malay, and English, with the accuracy achieved, is 90%. It is evaluated based on the experimentation and case study approaches where the affordable housing projects inMalaysia are selected as case projects. This finding has given an implication on the ability of MELex to analyze public sentiments in the Malaysian context. The novel aspects of this paper are two-fold. Firstly, it introduces the new technique in assigning the polarity score, and second, it improves the performance over the classification of mixed language content. © 2022 Tech Science Press. All rights reserved. |
publisher |
Tech Science Press |
issn |
15462218 |
language |
English |
format |
Article |
accesstype |
All Open Access; Gold Open Access |
record_format |
scopus |
collection |
Scopus |
_version_ |
1809678158265319424 |