The development of malaysian corpus of financial english (MaCFE)

This paper presents the processes involved in the design and development of the Malaysian Corpus of Financial English (MaCFE); a specialized corpus containing a wide range of online/internet documents (i.e. communiqué) from various financial institutions in Malaysia. It describes in detail the proce...

Full description

Bibliographic Details
Published in:GEMA Online Journal of Language Studies
Main Author: Sadjirin R.; Aziz R.A.; Nordin N.M.; Ismail M.R.; Baharum N.D.
Format: Article
Language:English
Published: Universiti Kebangsaan Malaysia Press 2018
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85052728020&doi=10.17576%2fgema-2018-1803-05&partnerID=40&md5=1fd305f2aef09ed9a9bd5e2657f3e79d
Description
Summary:This paper presents the processes involved in the design and development of the Malaysian Corpus of Financial English (MaCFE); a specialized corpus containing a wide range of online/internet documents (i.e. communiqué) from various financial institutions in Malaysia. It describes in detail the processes involved in the collection and selection of data and preprocessing of raw data, which includes data digitizing, cleansing and tagging. This paper also introduces the user interface for MaCFE with its built-in linguistic analysis features. MaCFE was designed and developed with the intention of providing corpus linguistic researchers with the avenue to explore the field and for ESP/EAP practitioners in Malaysia, as the resources for the development of local-based ESP/EAP curriculum and teaching and learning materials. It would also serve as a learning avenue for future financial professionals in their training. MaCFE corpus has approximately 4.3 million words from 1472 electronic documents retrieved from banks and financial institutions’ official websites. At present, users can make queries to the MaCFE database using its built-in concordancer. In the future, its language-data-processing facilities will be expanded to include tools for keyword, wordlist and word collocations queries. © 2018, Universiti Kebangsaan Malaysia Press. All rights reserved.
ISSN:16758021
DOI:10.17576/gema-2018-1803-05