Construction of Malay abbreviation corpus based on social media data

This study describes a construction of Malay abbreviation corpus by extracting and normalizing selected social media data with multilayer filtration pattern matching technique along with statistical machine translation approach. In this study, one million Malay Lingo user-generated-posts via Twitter...

Full description

Bibliographic Details
Published in:Journal of Engineering and Applied Sciences
Main Author: Omar N.; Hamsani A.F.; Abdullah N.A.S.; Abidin S.Z.Z.
Format: Article
Language:English
Published: Medwell Journals 2017
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85017468132&doi=10.3923%2fjeasci.2017.468.474&partnerID=40&md5=5840d75936698a956bbcf8ca6fc858bb