Handling Outliers and Missing Data in Regression Models Using R: Simulation Examples

Citation:
Abonazel, M. R., "Handling Outliers and Missing Data in Regression Models Using R: Simulation Examples", Academic Journal of Applied Mathematical Sciences, vol. 6, issue 8, pp. 187-203, 2020.

Abstract:

This paper has reviewed two important problems in regression analysis (outliers and missing data), as well as some handling methods for these problems. Moreover, two applications have been introduced to understand and study these
methods by R-codes. Practical evidence was provided to researchers to deal with those problems in regression modeling
with R. Finally, we created a Monte Carlo simulation study to compare different handling methods of missing data in the
regression model. Simulation results indicate that, under our simulation factors, the k-nearest neighbors method is the
best method to estimate the missing values in regression models.

Related External Link

Tourism