Evaluation of dataset metamodel for describing the structure of datasets

Datasets basically contains data and metadata. Data are often misinterpreted due to insufficient metadata and gives rise to quality issues associated with the datasets such as failure to clearly identify the entity being measured and inability to clarify how the metrics were generated. We believe cr...

Full description

Bibliographic Details
Published in:Frontiers in Artificial Intelligence and Applications
Main Author: Rosli M.M.
Format: Conference paper
Language:English
Published: IOS Press BV 2018
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85063395792&doi=10.3233%2f978-1-61499-900-3-650&partnerID=40&md5=27c975ec8ae560bd82a716f365b06031
id 2-s2.0-85063395792
spelling 2-s2.0-85063395792
Rosli M.M.
Evaluation of dataset metamodel for describing the structure of datasets
2018
Frontiers in Artificial Intelligence and Applications
303

10.3233/978-1-61499-900-3-650
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85063395792&doi=10.3233%2f978-1-61499-900-3-650&partnerID=40&md5=27c975ec8ae560bd82a716f365b06031
Datasets basically contains data and metadata. Data are often misinterpreted due to insufficient metadata and gives rise to quality issues associated with the datasets such as failure to clearly identify the entity being measured and inability to clarify how the metrics were generated. We believe creating common agreement about the terminology and concepts in datasets is important to ensure the meaning of data able to be interpreted correctly. We developed dataset metamodel that describes the structure and concepts in a dataset, and the relationships between each concept to gain a shared understanding of the content of datasets. As a preliminary evaluation, we conducted a user study to evaluate the effectiveness of dataset metamodel. We used an online survey as our user study method. The survey aims to study how well participants understand the definitions of dataset category elements in the dataset metamodel and able to apply them to a range of data sets. We found that participants who had relevant background knowledge and experience in research, particularly in analysing data sets able to answer more questions correctly than participants who had less relevant background knowledge and experience in research. The results of our survey provide evidence that our dataset metamodel is effective to be used by researchers to model datasets for analysis in software engineering. Future work, we need to reproduce the results with more appropriately sized samples of researchers in the relevant areas. © 2018 The authors and IOS Press. All rights reserved.
IOS Press BV
9226389
English
Conference paper

author Rosli M.M.
spellingShingle Rosli M.M.
Evaluation of dataset metamodel for describing the structure of datasets
author_facet Rosli M.M.
author_sort Rosli M.M.
title Evaluation of dataset metamodel for describing the structure of datasets
title_short Evaluation of dataset metamodel for describing the structure of datasets
title_full Evaluation of dataset metamodel for describing the structure of datasets
title_fullStr Evaluation of dataset metamodel for describing the structure of datasets
title_full_unstemmed Evaluation of dataset metamodel for describing the structure of datasets
title_sort Evaluation of dataset metamodel for describing the structure of datasets
publishDate 2018
container_title Frontiers in Artificial Intelligence and Applications
container_volume 303
container_issue
doi_str_mv 10.3233/978-1-61499-900-3-650
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85063395792&doi=10.3233%2f978-1-61499-900-3-650&partnerID=40&md5=27c975ec8ae560bd82a716f365b06031
description Datasets basically contains data and metadata. Data are often misinterpreted due to insufficient metadata and gives rise to quality issues associated with the datasets such as failure to clearly identify the entity being measured and inability to clarify how the metrics were generated. We believe creating common agreement about the terminology and concepts in datasets is important to ensure the meaning of data able to be interpreted correctly. We developed dataset metamodel that describes the structure and concepts in a dataset, and the relationships between each concept to gain a shared understanding of the content of datasets. As a preliminary evaluation, we conducted a user study to evaluate the effectiveness of dataset metamodel. We used an online survey as our user study method. The survey aims to study how well participants understand the definitions of dataset category elements in the dataset metamodel and able to apply them to a range of data sets. We found that participants who had relevant background knowledge and experience in research, particularly in analysing data sets able to answer more questions correctly than participants who had less relevant background knowledge and experience in research. The results of our survey provide evidence that our dataset metamodel is effective to be used by researchers to model datasets for analysis in software engineering. Future work, we need to reproduce the results with more appropriately sized samples of researchers in the relevant areas. © 2018 The authors and IOS Press. All rights reserved.
publisher IOS Press BV
issn 9226389
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1809677603729047552