A Comparative Study between Ridge MM and Ridge Least Trimmed Squares Estimators in Handling Multicollinearity and Outliers

Common problems found in multiple linear regression models are the existence of multicollinearity and outliers. These obstacles usually produce undesirable effects on least squares estimators. Ridge regression estimator is suggested in handling severe multicollinearity while robust estimators such a...

Full description

Bibliographic Details
Published in:Journal of Physics: Conference Series
Main Author: Affindi A.N.; Ahmad S.; Mohamad M.
Format: Conference paper
Language:English
Published: Institute of Physics Publishing 2019
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85076096737&doi=10.1088%2f1742-6596%2f1366%2f1%2f012113&partnerID=40&md5=7f12e416a5d600b755b2aab572574caf
Description
Summary:Common problems found in multiple linear regression models are the existence of multicollinearity and outliers. These obstacles usually produce undesirable effects on least squares estimators. Ridge regression estimator is suggested in handling severe multicollinearity while robust estimators such as MM estimator and Least Trimmed Squares (LTS) estimator are recommended in tackling the outlier issues. An even worse scenario is when these two problems occur simultaneously. Combination of both leads to robust ridge regression methods which can be used to handle both conditions simultaneously. In this study, a comparative investigation is carried out to compare the performance between ridge MM and ridge LTS estimators. The Root Mean Square Error (RMSE) and Bias are obtained for each estimator to compare their performances. By using simulation study, Laplace and Cauchy distributions are used in introducing outliers to the simulated data with high multicollinearity ρ = 0.90, 0.95 and 0.98 for sample sizes n=25, 50 and 100. From the results, it is found that Ridge LTS is the best estimator for many combinations of error distributions and degrees of multicollinearity. Similar results were obtained when using two sets of real data. © Published under licence by IOP Publishing Ltd.
ISSN:17426588
DOI:10.1088/1742-6596/1366/1/012113