Evaluation of dataset metamodel for describing the structure of datasets
Datasets basically contains data and metadata. Data are often misinterpreted due to insufficient metadata and gives rise to quality issues associated with the datasets such as failure to clearly identify the entity being measured and inability to clarify how the metrics were generated. We believe cr...
Published in: | Frontiers in Artificial Intelligence and Applications |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
IOS Press BV
2018
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85063395792&doi=10.3233%2f978-1-61499-900-3-650&partnerID=40&md5=27c975ec8ae560bd82a716f365b06031 |
id |
2-s2.0-85063395792 |
---|---|
spelling |
2-s2.0-85063395792 Rosli M.M. Evaluation of dataset metamodel for describing the structure of datasets 2018 Frontiers in Artificial Intelligence and Applications 303 10.3233/978-1-61499-900-3-650 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85063395792&doi=10.3233%2f978-1-61499-900-3-650&partnerID=40&md5=27c975ec8ae560bd82a716f365b06031 Datasets basically contains data and metadata. Data are often misinterpreted due to insufficient metadata and gives rise to quality issues associated with the datasets such as failure to clearly identify the entity being measured and inability to clarify how the metrics were generated. We believe creating common agreement about the terminology and concepts in datasets is important to ensure the meaning of data able to be interpreted correctly. We developed dataset metamodel that describes the structure and concepts in a dataset, and the relationships between each concept to gain a shared understanding of the content of datasets. As a preliminary evaluation, we conducted a user study to evaluate the effectiveness of dataset metamodel. We used an online survey as our user study method. The survey aims to study how well participants understand the definitions of dataset category elements in the dataset metamodel and able to apply them to a range of data sets. We found that participants who had relevant background knowledge and experience in research, particularly in analysing data sets able to answer more questions correctly than participants who had less relevant background knowledge and experience in research. The results of our survey provide evidence that our dataset metamodel is effective to be used by researchers to model datasets for analysis in software engineering. Future work, we need to reproduce the results with more appropriately sized samples of researchers in the relevant areas. © 2018 The authors and IOS Press. All rights reserved. IOS Press BV 9226389 English Conference paper |
author |
Rosli M.M. |
spellingShingle |
Rosli M.M. Evaluation of dataset metamodel for describing the structure of datasets |
author_facet |
Rosli M.M. |
author_sort |
Rosli M.M. |
title |
Evaluation of dataset metamodel for describing the structure of datasets |
title_short |
Evaluation of dataset metamodel for describing the structure of datasets |
title_full |
Evaluation of dataset metamodel for describing the structure of datasets |
title_fullStr |
Evaluation of dataset metamodel for describing the structure of datasets |
title_full_unstemmed |
Evaluation of dataset metamodel for describing the structure of datasets |
title_sort |
Evaluation of dataset metamodel for describing the structure of datasets |
publishDate |
2018 |
container_title |
Frontiers in Artificial Intelligence and Applications |
container_volume |
303 |
container_issue |
|
doi_str_mv |
10.3233/978-1-61499-900-3-650 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85063395792&doi=10.3233%2f978-1-61499-900-3-650&partnerID=40&md5=27c975ec8ae560bd82a716f365b06031 |
description |
Datasets basically contains data and metadata. Data are often misinterpreted due to insufficient metadata and gives rise to quality issues associated with the datasets such as failure to clearly identify the entity being measured and inability to clarify how the metrics were generated. We believe creating common agreement about the terminology and concepts in datasets is important to ensure the meaning of data able to be interpreted correctly. We developed dataset metamodel that describes the structure and concepts in a dataset, and the relationships between each concept to gain a shared understanding of the content of datasets. As a preliminary evaluation, we conducted a user study to evaluate the effectiveness of dataset metamodel. We used an online survey as our user study method. The survey aims to study how well participants understand the definitions of dataset category elements in the dataset metamodel and able to apply them to a range of data sets. We found that participants who had relevant background knowledge and experience in research, particularly in analysing data sets able to answer more questions correctly than participants who had less relevant background knowledge and experience in research. The results of our survey provide evidence that our dataset metamodel is effective to be used by researchers to model datasets for analysis in software engineering. Future work, we need to reproduce the results with more appropriately sized samples of researchers in the relevant areas. © 2018 The authors and IOS Press. All rights reserved. |
publisher |
IOS Press BV |
issn |
9226389 |
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677603729047552 |