Please enable JavaScript.
Coggle requires JavaScript to display documents.
Linear Regression (Problems (Collinearity (What, Multi Collinearity…
Linear Regression
Problems
Outliers
- Truth far away from predicted value
- Detection : Studentised residual Vs
predicted values. Standing out point.
- studentised residual = residual value
divided by standard error of residuals.
- Value beyond 3 is outlier
High-leverage Points
- High leverage points impact
regression line to great extent.
- leverage statistic for SLR:
- Leverage statistics always between
1/n and 1. Average value (p+1)/n
-
-
-
-
MLR
-
-
2. Do all or subset of predictors are
useful? "Variable Selection" Best
Model selection using:adj R-square,
Mallows cp,AIC, BIC
Backward
- Start with all p variables
- Remove the variables
for which p values is max
- again fit with p-1 var.
- remove one which has
max p-value
Forward
- start with null model
- Do single regression
with each p predictors?
- Choose the one with
lowest RSS.
- Select next predictor
whom RSS gets lowest?
Mixed
- Start with null model
- Add predictors basis
forward selection.
- if at the time of adding
the p-value goes above
thresold value then
remove that variable.
-
4. Given Predictor values, how to predict response and prediction accuracy ?
-
-
Extension of
Linear Model
-
Linear
Assumption
-
Detection of non Linearity:
Plot Residual vs predicted
values If the trend line has
some non linear pattern, then
it exists.
Solution: Polynomial Regression
Do non linear transformation
of predictors and add them
as new predictors. Like take the
square or log or cube.