The development of malaysian corpus of financial english (MaCFE)
This paper presents the processes involved in the design and development of the Malaysian Corpus of Financial English (MaCFE); a specialized corpus containing a wide range of online/internet documents (i.e. communiqué) from various financial institutions in Malaysia. It describes in detail the proce...
Published in: | GEMA Online Journal of Language Studies |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Published: |
Universiti Kebangsaan Malaysia Press
2018
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85052728020&doi=10.17576%2fgema-2018-1803-05&partnerID=40&md5=1fd305f2aef09ed9a9bd5e2657f3e79d |
id |
2-s2.0-85052728020 |
---|---|
spelling |
2-s2.0-85052728020 Sadjirin R.; Aziz R.A.; Nordin N.M.; Ismail M.R.; Baharum N.D. The development of malaysian corpus of financial english (MaCFE) 2018 GEMA Online Journal of Language Studies 18 3 10.17576/gema-2018-1803-05 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85052728020&doi=10.17576%2fgema-2018-1803-05&partnerID=40&md5=1fd305f2aef09ed9a9bd5e2657f3e79d This paper presents the processes involved in the design and development of the Malaysian Corpus of Financial English (MaCFE); a specialized corpus containing a wide range of online/internet documents (i.e. communiqué) from various financial institutions in Malaysia. It describes in detail the processes involved in the collection and selection of data and preprocessing of raw data, which includes data digitizing, cleansing and tagging. This paper also introduces the user interface for MaCFE with its built-in linguistic analysis features. MaCFE was designed and developed with the intention of providing corpus linguistic researchers with the avenue to explore the field and for ESP/EAP practitioners in Malaysia, as the resources for the development of local-based ESP/EAP curriculum and teaching and learning materials. It would also serve as a learning avenue for future financial professionals in their training. MaCFE corpus has approximately 4.3 million words from 1472 electronic documents retrieved from banks and financial institutions’ official websites. At present, users can make queries to the MaCFE database using its built-in concordancer. In the future, its language-data-processing facilities will be expanded to include tools for keyword, wordlist and word collocations queries. © 2018, Universiti Kebangsaan Malaysia Press. All rights reserved. Universiti Kebangsaan Malaysia Press 16758021 English Article All Open Access; Gold Open Access; Green Open Access |
author |
Sadjirin R.; Aziz R.A.; Nordin N.M.; Ismail M.R.; Baharum N.D. |
spellingShingle |
Sadjirin R.; Aziz R.A.; Nordin N.M.; Ismail M.R.; Baharum N.D. The development of malaysian corpus of financial english (MaCFE) |
author_facet |
Sadjirin R.; Aziz R.A.; Nordin N.M.; Ismail M.R.; Baharum N.D. |
author_sort |
Sadjirin R.; Aziz R.A.; Nordin N.M.; Ismail M.R.; Baharum N.D. |
title |
The development of malaysian corpus of financial english (MaCFE) |
title_short |
The development of malaysian corpus of financial english (MaCFE) |
title_full |
The development of malaysian corpus of financial english (MaCFE) |
title_fullStr |
The development of malaysian corpus of financial english (MaCFE) |
title_full_unstemmed |
The development of malaysian corpus of financial english (MaCFE) |
title_sort |
The development of malaysian corpus of financial english (MaCFE) |
publishDate |
2018 |
container_title |
GEMA Online Journal of Language Studies |
container_volume |
18 |
container_issue |
3 |
doi_str_mv |
10.17576/gema-2018-1803-05 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85052728020&doi=10.17576%2fgema-2018-1803-05&partnerID=40&md5=1fd305f2aef09ed9a9bd5e2657f3e79d |
description |
This paper presents the processes involved in the design and development of the Malaysian Corpus of Financial English (MaCFE); a specialized corpus containing a wide range of online/internet documents (i.e. communiqué) from various financial institutions in Malaysia. It describes in detail the processes involved in the collection and selection of data and preprocessing of raw data, which includes data digitizing, cleansing and tagging. This paper also introduces the user interface for MaCFE with its built-in linguistic analysis features. MaCFE was designed and developed with the intention of providing corpus linguistic researchers with the avenue to explore the field and for ESP/EAP practitioners in Malaysia, as the resources for the development of local-based ESP/EAP curriculum and teaching and learning materials. It would also serve as a learning avenue for future financial professionals in their training. MaCFE corpus has approximately 4.3 million words from 1472 electronic documents retrieved from banks and financial institutions’ official websites. At present, users can make queries to the MaCFE database using its built-in concordancer. In the future, its language-data-processing facilities will be expanded to include tools for keyword, wordlist and word collocations queries. © 2018, Universiti Kebangsaan Malaysia Press. All rights reserved. |
publisher |
Universiti Kebangsaan Malaysia Press |
issn |
16758021 |
language |
English |
format |
Article |
accesstype |
All Open Access; Gold Open Access; Green Open Access |
record_format |
scopus |
collection |
Scopus |
_version_ |
1823296162337128448 |