Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation

Model construction is of significant importance for the extraction of information from datasets and the prediction of responses based on predictor variables. The objective of this study is to compare the Multiple Regression (MR) and model averaging approaches in the context of missing data and to va...

Full description

Bibliographic Details
Published in:Engineering, Technology and Applied Science Research
Main Author: Abdullah M.A.A.; Jessintha L.; Khuneswari G.P.; Jamil S.A.M.; Olaniran O.R.
Format: Article
Language:English
Published: Dr D. Pylarinos 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85211465762&doi=10.48084%2fetasr.8909&partnerID=40&md5=5d6d424469343797eb04c458990d1c96
id 2-s2.0-85211465762
spelling 2-s2.0-85211465762
Abdullah M.A.A.; Jessintha L.; Khuneswari G.P.; Jamil S.A.M.; Olaniran O.R.
Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation
2024
Engineering, Technology and Applied Science Research
14
6
10.48084/etasr.8909
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85211465762&doi=10.48084%2fetasr.8909&partnerID=40&md5=5d6d424469343797eb04c458990d1c96
Model construction is of significant importance for the extraction of information from datasets and the prediction of responses based on predictor variables. The objective of this study is to compare the Multiple Regression (MR) and model averaging approaches in the context of missing data and to validate the effectiveness of the Multiple Imputation (MI) method used to address missing data issues. A comparison was performed between the results obtained from the multiple-imputed data and those derived from the Complete Case (CC) data, using a diabetes dataset from Hospital Besar Alor Setar. Prior to the application of MI and model building, k-fold cross-validation was employed to partition the dataset, resulting in 90% of the data lacking complete covariates for training and 10% of the data comprising complete covariates for testing. Subsequently, MI was applied to the 90% training dataset. Model M115, derived from the multiple-imputed data, was identified as the optimal model for MR. In the model averaging approach, two models were identified as optimal: Model 1 (without interaction variables) and Model 2 (with interaction variables). The first one, exhibited the lowest values of Mean Square Error (MSE), Root Mean Square Error (RMSE), and Mean Absolute Error (MAE). These results indicate that model averaging, specifically Model 1, is the superior model-building approach for this study, demonstrating improved performance compared to MR and validating the effectiveness of the MI method. © by the authors.
Dr D. Pylarinos
22414487
English
Article
All Open Access; Gold Open Access
author Abdullah M.A.A.; Jessintha L.; Khuneswari G.P.; Jamil S.A.M.; Olaniran O.R.
spellingShingle Abdullah M.A.A.; Jessintha L.; Khuneswari G.P.; Jamil S.A.M.; Olaniran O.R.
Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation
author_facet Abdullah M.A.A.; Jessintha L.; Khuneswari G.P.; Jamil S.A.M.; Olaniran O.R.
author_sort Abdullah M.A.A.; Jessintha L.; Khuneswari G.P.; Jamil S.A.M.; Olaniran O.R.
title Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation
title_short Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation
title_full Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation
title_fullStr Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation
title_full_unstemmed Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation
title_sort Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation
publishDate 2024
container_title Engineering, Technology and Applied Science Research
container_volume 14
container_issue 6
doi_str_mv 10.48084/etasr.8909
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85211465762&doi=10.48084%2fetasr.8909&partnerID=40&md5=5d6d424469343797eb04c458990d1c96
description Model construction is of significant importance for the extraction of information from datasets and the prediction of responses based on predictor variables. The objective of this study is to compare the Multiple Regression (MR) and model averaging approaches in the context of missing data and to validate the effectiveness of the Multiple Imputation (MI) method used to address missing data issues. A comparison was performed between the results obtained from the multiple-imputed data and those derived from the Complete Case (CC) data, using a diabetes dataset from Hospital Besar Alor Setar. Prior to the application of MI and model building, k-fold cross-validation was employed to partition the dataset, resulting in 90% of the data lacking complete covariates for training and 10% of the data comprising complete covariates for testing. Subsequently, MI was applied to the 90% training dataset. Model M115, derived from the multiple-imputed data, was identified as the optimal model for MR. In the model averaging approach, two models were identified as optimal: Model 1 (without interaction variables) and Model 2 (with interaction variables). The first one, exhibited the lowest values of Mean Square Error (MSE), Root Mean Square Error (RMSE), and Mean Absolute Error (MAE). These results indicate that model averaging, specifically Model 1, is the superior model-building approach for this study, demonstrating improved performance compared to MR and validating the effectiveness of the MI method. © by the authors.
publisher Dr D. Pylarinos
issn 22414487
language English
format Article
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1820775428656201728