Construction of Malay abbreviation corpus based on social media data
This study describes a construction of Malay abbreviation corpus by extracting and normalizing selected social media data with multilayer filtration pattern matching technique along with statistical machine translation approach. In this study, one million Malay Lingo user-generated-posts via Twitter...
Published in: | Journal of Engineering and Applied Sciences |
---|---|
Main Author: | Omar N.; Hamsani A.F.; Abdullah N.A.S.; Abidin S.Z.Z. |
Format: | Article |
Language: | English |
Published: |
Medwell Journals
2017
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85017468132&doi=10.3923%2fjeasci.2017.468.474&partnerID=40&md5=5840d75936698a956bbcf8ca6fc858bb |
Similar Items
-
Chinese Monolingual Sentiment Classifier for Social Media Data Using Corpus-Based Approach
by: Sia Abdullah N.A.; Low Cheng Cheng S.C.; Rosli M.M.
Published: (2024) -
MyDAS Corpus: Malay Social Media Texts for Detecting Depression, Anxiety, and Stress on Facebook
by: Ahmad Z.; Mohamed A.; Conway M.; Zakaria R.; Teo N.H.I.; Maskat R.
Published: (2024) -
Constructing a Data-Driven Model of English Language Teaching with a Multidimensional Corpus
by: Chen D.
Published: (2022) -
The impact of social media advertising features on the purchase intention of the Malay millennial consumer
by: Agil H.; Ahmad A.L.; Azlan A.A.
Published: (2022) -
Lexical Verbs in Verb-Noun Collocations: Empirical Evidence from a Malay ESL Learner Corpus
by: Abdullah S.; Aziz R.A.; Kamaruddin R.
Published: (2021)