A bi-annotated Malay-English code-switching (Manglish) dataset of X posts for biological gender identification and authorship attribution
Low -resource languages, like Malay, face the threat of extinction when linguistic resources become scarce. This paper addresses the scarcity issue by contributing to the inventory of low -resource languages, specifically focusing on Malay -English, known as Manglish. Manglish speakers are primarily...
Published in: | DATA IN BRIEF |
---|---|
Main Authors: | Maskat, Ruhaila; Azman, Norazmiera Ayunie; Nulizairos, Nur Shaheera Shastera; Zahidin, Nurul Athirah; Mahadi, Adibah Humairah; Norshamsul, Siti Rubaya; Sharif, Mohd Mukhlis Mohd; Mahdin, Hairulnizam |
Format: | Article; Data Paper; Early Access |
Language: | English |
Published: |
ELSEVIER
2024
|
Subjects: | |
Online Access: | https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001157110000001 |
Similar Items
-
A bi-annotated Malay-English code-switching (Manglish) dataset of X posts for biological gender identification and authorship attribution
by: Maskat R.; Azman N.A.; Nulizairos N.S.S.; Zahidin N.A.; Mahadi A.H.; Norshamsul S.R.; Sharif M.M.M.; Mahdin H.
Published: (2024) -
The research's knowledge transfer through co-authorship collaboration
by: Rahman S.A.; Noordin S.A.; Rahmad F.; Mohamed A.N.; Abdullah H.; Salleh A.A.
Published: (2017) -
‘Know know married’: playfulness of Manglish in social media platforms
by: Serip Mohamad N.H.; Shafie H.
Published: (2024) -
Attire-based photo annotation
by: Jamil N.; Sa'dan S.A.; Narawi A.; Gobil A.R.
Published: (2014) -
Code-Switching and Code-Mixing in the Practice of Judgement Writing in Malaysia
by: Md Zolkapli R.B.; Mohamad H.A.; Mohaini M.L.; Wahab N.H.A.; Nath P.R.
Published: (2022)