Partial Robust M-Regression Estimator in the Presence of Multicollinearity and Vertical Outliers

The objective of using regression is to explain the variation in one or more response variables by associating the variation with proportional variation in one or more explanatory variables. However, if the number of independent variables is multiple, they tend to be highly collinear and this contri...

Full description

Bibliographic Details
Published in:Journal of Physics: Conference Series
Main Author: Noh N.H.M.; Moktar B.; Yusoff S.; Majid M.N.A.
Format: Conference paper
Language:English
Published: Institute of Physics Publishing 2020
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85087980462&doi=10.1088%2f1742-6596%2f1529%2f2%2f022042&partnerID=40&md5=fbb4969cd7a750019e8467f7e236b1c7
Description
Summary:The objective of using regression is to explain the variation in one or more response variables by associating the variation with proportional variation in one or more explanatory variables. However, if the number of independent variables is multiple, they tend to be highly collinear and this contributes to multicollinearity problem. For instance, Ridge Regression (RR), Principal Component Regression (PCR), and Partial Least Squares Regression (PLSR) are some of the prediction methods used to handle dataset with multicollinearity. In addition, another problem that arises is the existence of outlying objects in a dataset. The effect of outlying data points in the presence of multicollinearity problem could be reduced with the implementation of robust regression method. A recently studied robust PLSR, which is called Partial Robust M-Regression (PRM) is found to be able in dealing with multicollinearity and outliers simultaneously. This method was employed in this study. Throughout this study, five methods of regression were chosen; OLS, RR, PCR, PLSR, and PRM, to compare which is the best method in their predictive ability. To compare these five regression methods, a simulation study had been conducted, and the mean square error of β estimate (MSE(β)) was calculated. The simulation results show that PRM outperforms other method in the presence of multicollinearity, outliers and leverage points. © 2020 IOP Publishing Ltd. All rights reserved.
ISSN:17426588
DOI:10.1088/1742-6596/1529/2/022042