Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation

Model construction is of significant importance for the extraction of information from datasets and the prediction of responses based on predictor variables. The objective of this study is to compare the Multiple Regression (MR) and model averaging approaches in the context of missing data and to va...

Full description

Bibliographic Details
Published in:	Engineering, Technology and Applied Science Research
Main Author:	Abdullah M.A.A.; Jessintha L.; Khuneswari G.P.; Jamil S.A.M.; Olaniran O.R.
Format:	Article
Language:	English
Published:	Dr D. Pylarinos 2024
Online Access:	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85211465762&doi=10.48084%2fetasr.8909&partnerID=40&md5=5d6d424469343797eb04c458990d1c96

id	2-s2.0-85211465762
spelling	2-s2.0-85211465762 Abdullah M.A.A.; Jessintha L.; Khuneswari G.P.; Jamil S.A.M.; Olaniran O.R. Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation 2024 Engineering, Technology and Applied Science Research 14 6 10.48084/etasr.8909 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85211465762&doi=10.48084%2fetasr.8909&partnerID=40&md5=5d6d424469343797eb04c458990d1c96 Model construction is of significant importance for the extraction of information from datasets and the prediction of responses based on predictor variables. The objective of this study is to compare the Multiple Regression (MR) and model averaging approaches in the context of missing data and to validate the effectiveness of the Multiple Imputation (MI) method used to address missing data issues. A comparison was performed between the results obtained from the multiple-imputed data and those derived from the Complete Case (CC) data, using a diabetes dataset from Hospital Besar Alor Setar. Prior to the application of MI and model building, k-fold cross-validation was employed to partition the dataset, resulting in 90% of the data lacking complete covariates for training and 10% of the data comprising complete covariates for testing. Subsequently, MI was applied to the 90% training dataset. Model M115, derived from the multiple-imputed data, was identified as the optimal model for MR. In the model averaging approach, two models were identified as optimal: Model 1 (without interaction variables) and Model 2 (with interaction variables). The first one, exhibited the lowest values of Mean Square Error (MSE), Root Mean Square Error (RMSE), and Mean Absolute Error (MAE). These results indicate that model averaging, specifically Model 1, is the superior model-building approach for this study, demonstrating improved performance compared to MR and validating the effectiveness of the MI method. © by the authors. Dr D. Pylarinos 22414487 English Article All Open Access; Gold Open Access
author	Abdullah M.A.A.; Jessintha L.; Khuneswari G.P.; Jamil S.A.M.; Olaniran O.R.
spellingShingle	Abdullah M.A.A.; Jessintha L.; Khuneswari G.P.; Jamil S.A.M.; Olaniran O.R. Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation
author_facet	Abdullah M.A.A.; Jessintha L.; Khuneswari G.P.; Jamil S.A.M.; Olaniran O.R.
author_sort	Abdullah M.A.A.; Jessintha L.; Khuneswari G.P.; Jamil S.A.M.; Olaniran O.R.
title	Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation
title_short	Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation
title_full	Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation
title_fullStr	Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation
title_full_unstemmed	Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation
title_sort	Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation
publishDate	2024
container_title	Engineering, Technology and Applied Science Research
container_volume	14
container_issue	6
doi_str_mv	10.48084/etasr.8909
url	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85211465762&doi=10.48084%2fetasr.8909&partnerID=40&md5=5d6d424469343797eb04c458990d1c96
description	Model construction is of significant importance for the extraction of information from datasets and the prediction of responses based on predictor variables. The objective of this study is to compare the Multiple Regression (MR) and model averaging approaches in the context of missing data and to validate the effectiveness of the Multiple Imputation (MI) method used to address missing data issues. A comparison was performed between the results obtained from the multiple-imputed data and those derived from the Complete Case (CC) data, using a diabetes dataset from Hospital Besar Alor Setar. Prior to the application of MI and model building, k-fold cross-validation was employed to partition the dataset, resulting in 90% of the data lacking complete covariates for training and 10% of the data comprising complete covariates for testing. Subsequently, MI was applied to the 90% training dataset. Model M115, derived from the multiple-imputed data, was identified as the optimal model for MR. In the model averaging approach, two models were identified as optimal: Model 1 (without interaction variables) and Model 2 (with interaction variables). The first one, exhibited the lowest values of Mean Square Error (MSE), Root Mean Square Error (RMSE), and Mean Absolute Error (MAE). These results indicate that model averaging, specifically Model 1, is the superior model-building approach for this study, demonstrating improved performance compared to MR and validating the effectiveness of the MI method. © by the authors.
publisher	Dr D. Pylarinos
issn	22414487
language	English
format	Article
accesstype	All Open Access; Gold Open Access
record_format	scopus
collection	Scopus
_version_	1820775428656201728

Comparison of Multiple Regression and Model Averaging Model-Building Approach for Missing Data with Multiple Imputation

Similar Items