Data science technology course: The design, assessment and computing environment perspectives

This article discusses the key elements of the Data Science Technology course offered to postgraduate students enrolled in the Master of Data Science program. This course complements the existing curriculum by providing the skills to handle the Big Data platform and tools, in addition to data scienc...

Full description

Bibliographic Details
Published in:Education and Information Technologies
Main Author: Ismail A.; Mutalib S.; Haron H.
Format: Article
Language:English
Published: Springer 2023
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85146757690&doi=10.1007%2fs10639-022-11558-8&partnerID=40&md5=df435f3eda10b322fd842bb316ebc2a4
id 2-s2.0-85146757690
spelling 2-s2.0-85146757690
Ismail A.; Mutalib S.; Haron H.
Data science technology course: The design, assessment and computing environment perspectives
2023
Education and Information Technologies
28
8
10.1007/s10639-022-11558-8
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85146757690&doi=10.1007%2fs10639-022-11558-8&partnerID=40&md5=df435f3eda10b322fd842bb316ebc2a4
This article discusses the key elements of the Data Science Technology course offered to postgraduate students enrolled in the Master of Data Science program. This course complements the existing curriculum by providing the skills to handle the Big Data platform and tools, in addition to data science activities. We tackle the discussion about this course based on three main requirements, which are related to the need to exploit the key skills from two dimensions, namely, Data Science and Big Data, and the need for a cluster-based computing platform and its accessibility. We address these requirements by presenting the course design and its assessments, the configuration of the computing platform, and the strategy to enable flexible accessibility. In terms of course design, the offered course contributes to several innovative elements and has covered multiple key areas of the data science body of knowledge and multiple quadrants of the job and skills matrix. In the case of the computing platform, a stable deployment of a Hadoop cluster with flexible accessibility, triggered by the pandemic situation, has been established. Furthermore, through our experience with the implementation of the cluster, it has shown the ability of the cluster to handle computing problems with a larger dataset than the one used for the semesters within the scope of the study. We also provide some reflections and highlight future improvements. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
Springer
13602357
English
Article
All Open Access; Bronze Open Access; Green Open Access
author Ismail A.; Mutalib S.; Haron H.
spellingShingle Ismail A.; Mutalib S.; Haron H.
Data science technology course: The design, assessment and computing environment perspectives
author_facet Ismail A.; Mutalib S.; Haron H.
author_sort Ismail A.; Mutalib S.; Haron H.
title Data science technology course: The design, assessment and computing environment perspectives
title_short Data science technology course: The design, assessment and computing environment perspectives
title_full Data science technology course: The design, assessment and computing environment perspectives
title_fullStr Data science technology course: The design, assessment and computing environment perspectives
title_full_unstemmed Data science technology course: The design, assessment and computing environment perspectives
title_sort Data science technology course: The design, assessment and computing environment perspectives
publishDate 2023
container_title Education and Information Technologies
container_volume 28
container_issue 8
doi_str_mv 10.1007/s10639-022-11558-8
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85146757690&doi=10.1007%2fs10639-022-11558-8&partnerID=40&md5=df435f3eda10b322fd842bb316ebc2a4
description This article discusses the key elements of the Data Science Technology course offered to postgraduate students enrolled in the Master of Data Science program. This course complements the existing curriculum by providing the skills to handle the Big Data platform and tools, in addition to data science activities. We tackle the discussion about this course based on three main requirements, which are related to the need to exploit the key skills from two dimensions, namely, Data Science and Big Data, and the need for a cluster-based computing platform and its accessibility. We address these requirements by presenting the course design and its assessments, the configuration of the computing platform, and the strategy to enable flexible accessibility. In terms of course design, the offered course contributes to several innovative elements and has covered multiple key areas of the data science body of knowledge and multiple quadrants of the job and skills matrix. In the case of the computing platform, a stable deployment of a Hadoop cluster with flexible accessibility, triggered by the pandemic situation, has been established. Furthermore, through our experience with the implementation of the cluster, it has shown the ability of the cluster to handle computing problems with a larger dataset than the one used for the semesters within the scope of the study. We also provide some reflections and highlight future improvements. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
publisher Springer
issn 13602357
language English
format Article
accesstype All Open Access; Bronze Open Access; Green Open Access
record_format scopus
collection Scopus
_version_ 1809677582425128960