Partial Robust M-Regression Estimator in the Presence of Multicollinearity and Vertical Outliers

The objective of using regression is to explain the variation in one or more response variables by associating the variation with proportional variation in one or more explanatory variables. However, if the number of independent variables is multiple, they tend to be highly collinear and this contri...

Full description

Bibliographic Details
Published in:Journal of Physics: Conference Series
Main Author: Noh N.H.M.; Moktar B.; Yusoff S.; Majid M.N.A.
Format: Conference paper
Language:English
Published: Institute of Physics Publishing 2020
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85087980462&doi=10.1088%2f1742-6596%2f1529%2f2%2f022042&partnerID=40&md5=fbb4969cd7a750019e8467f7e236b1c7
id 2-s2.0-85087980462
spelling 2-s2.0-85087980462
Noh N.H.M.; Moktar B.; Yusoff S.; Majid M.N.A.
Partial Robust M-Regression Estimator in the Presence of Multicollinearity and Vertical Outliers
2020
Journal of Physics: Conference Series
1529
2
10.1088/1742-6596/1529/2/022042
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85087980462&doi=10.1088%2f1742-6596%2f1529%2f2%2f022042&partnerID=40&md5=fbb4969cd7a750019e8467f7e236b1c7
The objective of using regression is to explain the variation in one or more response variables by associating the variation with proportional variation in one or more explanatory variables. However, if the number of independent variables is multiple, they tend to be highly collinear and this contributes to multicollinearity problem. For instance, Ridge Regression (RR), Principal Component Regression (PCR), and Partial Least Squares Regression (PLSR) are some of the prediction methods used to handle dataset with multicollinearity. In addition, another problem that arises is the existence of outlying objects in a dataset. The effect of outlying data points in the presence of multicollinearity problem could be reduced with the implementation of robust regression method. A recently studied robust PLSR, which is called Partial Robust M-Regression (PRM) is found to be able in dealing with multicollinearity and outliers simultaneously. This method was employed in this study. Throughout this study, five methods of regression were chosen; OLS, RR, PCR, PLSR, and PRM, to compare which is the best method in their predictive ability. To compare these five regression methods, a simulation study had been conducted, and the mean square error of β estimate (MSE(β)) was calculated. The simulation results show that PRM outperforms other method in the presence of multicollinearity, outliers and leverage points. © 2020 IOP Publishing Ltd. All rights reserved.
Institute of Physics Publishing
17426588
English
Conference paper
All Open Access; Gold Open Access
author Noh N.H.M.; Moktar B.; Yusoff S.; Majid M.N.A.
spellingShingle Noh N.H.M.; Moktar B.; Yusoff S.; Majid M.N.A.
Partial Robust M-Regression Estimator in the Presence of Multicollinearity and Vertical Outliers
author_facet Noh N.H.M.; Moktar B.; Yusoff S.; Majid M.N.A.
author_sort Noh N.H.M.; Moktar B.; Yusoff S.; Majid M.N.A.
title Partial Robust M-Regression Estimator in the Presence of Multicollinearity and Vertical Outliers
title_short Partial Robust M-Regression Estimator in the Presence of Multicollinearity and Vertical Outliers
title_full Partial Robust M-Regression Estimator in the Presence of Multicollinearity and Vertical Outliers
title_fullStr Partial Robust M-Regression Estimator in the Presence of Multicollinearity and Vertical Outliers
title_full_unstemmed Partial Robust M-Regression Estimator in the Presence of Multicollinearity and Vertical Outliers
title_sort Partial Robust M-Regression Estimator in the Presence of Multicollinearity and Vertical Outliers
publishDate 2020
container_title Journal of Physics: Conference Series
container_volume 1529
container_issue 2
doi_str_mv 10.1088/1742-6596/1529/2/022042
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85087980462&doi=10.1088%2f1742-6596%2f1529%2f2%2f022042&partnerID=40&md5=fbb4969cd7a750019e8467f7e236b1c7
description The objective of using regression is to explain the variation in one or more response variables by associating the variation with proportional variation in one or more explanatory variables. However, if the number of independent variables is multiple, they tend to be highly collinear and this contributes to multicollinearity problem. For instance, Ridge Regression (RR), Principal Component Regression (PCR), and Partial Least Squares Regression (PLSR) are some of the prediction methods used to handle dataset with multicollinearity. In addition, another problem that arises is the existence of outlying objects in a dataset. The effect of outlying data points in the presence of multicollinearity problem could be reduced with the implementation of robust regression method. A recently studied robust PLSR, which is called Partial Robust M-Regression (PRM) is found to be able in dealing with multicollinearity and outliers simultaneously. This method was employed in this study. Throughout this study, five methods of regression were chosen; OLS, RR, PCR, PLSR, and PRM, to compare which is the best method in their predictive ability. To compare these five regression methods, a simulation study had been conducted, and the mean square error of β estimate (MSE(β)) was calculated. The simulation results show that PRM outperforms other method in the presence of multicollinearity, outliers and leverage points. © 2020 IOP Publishing Ltd. All rights reserved.
publisher Institute of Physics Publishing
issn 17426588
language English
format Conference paper
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1809677896336277504