Chapter 2. Simple Linear Regression
Regression Analysis
Study a functional relationship between variables
- response variable y(dependent variable)
- explanatory variable x(independent variable)
Simple linear regression model
- When E(Y) is a linear function of parameters, the models is called a linear statistical model.
- Simple linear regression model : E(Y)=β0+β1x
data:image/s3,"s3://crabby-images/30bf0/30bf04710126bab8e130f4860f95096fe76402e1" alt=""
Method of estimation
data:image/s3,"s3://crabby-images/e358b/e358bbcd301229b4f61344f87d75316b24ddec0a" alt=""
data:image/s3,"s3://crabby-images/c2bad/c2bade8ce056f723782064deb1e699fe5e5e4690" alt=""
data:image/s3,"s3://crabby-images/87149/87149bdd3df4749d4f164c97cf16d7cebb53430e" alt=""
- The least squares estimators β0^ and β1^ are the estimators of β0 and β1 that minimize the sum of squares for error SSE(β0,β1)
data:image/s3,"s3://crabby-images/9eefd/9eefda37255b811038b10810c64341a88301a0d5" alt=""
Method of inference
data:image/s3,"s3://crabby-images/2f89a/2f89aba2a418e1a681a30b278d8eb76d52ad282b" alt=""
data:image/s3,"s3://crabby-images/e3a5e/e3a5e81e0702696a59f3a7dfc94a99d16fc6a546" alt=""
Measuring the quality of fit
Decomposition of Sum of Squares
data:image/s3,"s3://crabby-images/775d2/775d2e11b3f4bf81842c44f22f99134fa5082274" alt=""
Coefficient of determination
data:image/s3,"s3://crabby-images/a32ea/a32ea8ca27ce0f4cc4764ed07db86a213c2d3922" alt=""
- R2 : Proportion of variation of y explained by x
data:image/s3,"s3://crabby-images/7c1c9/7c1c9d855ba8f69262cd6568f40411960f36d25a" alt=""
Chapter 3. Multiple Linear Regression
- Multiple linear regression model : E(Y)=β0+β1x1+...+βkxk
Least squares estimates
- minimize ∑i=1n(yi−β0−β1xi1−...−βpxip)2
- normal equation : ei=yi−(β0^+β1^xi1+...+βp^xip)=yi−yi^
- estimate of σ2 : n−p−11∑i=1n(yi−yi^)2=n−p−11SSE
Matrix approach
data:image/s3,"s3://crabby-images/a9b92/a9b921733e6905be50a4cbe7021072efb236c47c" alt=""
data:image/s3,"s3://crabby-images/d6ce9/d6ce9467293263e94c95fe75a95a21fb8f3dd21c" alt=""
Method of inference
Properties of estimates
Recall that
data:image/s3,"s3://crabby-images/8960d/8960d3e49b9607ce1a2b4062f7690b6dcd423d8b" alt=""
data:image/s3,"s3://crabby-images/5e663/5e663e2558020f4fec2509c52a8231a699547db9" alt=""
data:image/s3,"s3://crabby-images/eb99a/eb99a6986565dfc017cedaba1e40b6683426952e" alt=""
Measuring the quality of fit
Decomposition of sum of squares
data:image/s3,"s3://crabby-images/a0d7d/a0d7dbd98344b3368792ce4e82690952b362a189" alt=""
Multiple correlation coefficient(MCC) & Adjusted MCC
data:image/s3,"s3://crabby-images/f7d4f/f7d4fccb1a95931925ec3984938b790fe845209d" alt=""
- R2↑1 means that determination of y by linear combination of x becomes larger or proportion of variation of y explained by x1,...,xp
- As the number of explanatory variables increases, R2 always increases and SSE unconditionally decreases.
- R2 is inappropriate for comparing the fitness between models with different numbers of explanatory variables. Therefore, consider the following adjusted R2
data:image/s3,"s3://crabby-images/8a8f2/8a8f290ce543e663adfd2b920d01567d19626933" alt=""
Interpretations of regression coefficients
yi=β0+β1xi1+...+βpxip+ϵi
- β0(constant coef.) : the value of y when x1=x2=...=xp=0
- βj(regression coef.) : the change of y corresponding to a unit change in xj when xi's are hold constant(fixed)
Chapter 4. Regression Diagnostics: Detection of Model Violations
Validity of model assumption
yi=β0+β1xi1+...+βpxip+ϵi, ϵi∼iidN(0,σ2)
Linearity assumption
data:image/s3,"s3://crabby-images/7de0d/7de0d3164bc717612ec242d85e7d9a15d2e278bb" alt=""
⇒ graphical methods(scatter plot for simple linear regression)
Error distribution assumption
data:image/s3,"s3://crabby-images/dce59/dce594379f60a34464ddfa6c12e6ee66ebbea83c" alt=""
⇒ graphical methods based on residuals
Assumptions about explanatory variables
data:image/s3,"s3://crabby-images/d5714/d571434a28552a177d1a285f412093f5555cb297" alt=""
⇒ graphical methods or correlation matrices
Residuals
- If a regression equation is obtained from the population, the difference between the predicted value and the actual observed value obtained through the regression equation is error
- On the other hand, if a regression equation was obtained from the sample group, the difference between the predicted value and the actual observed value obtained through the regression equation is the residual
data:image/s3,"s3://crabby-images/bf353/bf353513afb8bf3bfbfa64a8972ec857f9897f16" alt=""
Residual plot
(x1,r)/.../(xp,r) plot
- If the assumptions hold, this should be a random scatter plot
- Tools for checking non-linearity / non-homogeneous variance
data:image/s3,"s3://crabby-images/8d983/8d98313059fe249ac085dae0788c71bad56c306a" alt=""
Scatter plot
- (xi1,yi),...,(xip,yi) for linearity assumption
- (xil,xim)(l=m) for linear independence(multicollinearity)
Leverage, Influence and Outliers
- Leverage : Checking outliers in explanatory variables
- Measures of influence : Cook's distance, Difference in Fits, Hadi's measure & Potential-Residual Plot
- Outliers : Leverage(outliers in the predictors), Standardized(studentized) residual(outliers in the response variable)
Chapter 5. Qualitative Variable as Predictors
- Sometimes, it is necessary to use qualitative(or categorical) variable in a regression through indicator(dummy) variables
- Use transformation to achieve linearity and/or homoscedasticity
data:image/s3,"s3://crabby-images/92261/92261f1040389a8cc6a47e5863c1f790cd003a6f" alt=""
data:image/s3,"s3://crabby-images/711b4/711b42d9cc573e4de1dcb47789fe1bd4b4f76b41" alt=""
data:image/s3,"s3://crabby-images/ba23c/ba23c380f87f80a8a484afc71cf5d054ac4aa2ab" alt=""
- The distribution of Y∣x may not be a normal distribution.
- Therefore, E(Y∣x) and V(Y∣x) may have a functional relationship with each other. Example: Poisson distribution, binomial distribution, negative binomial distribution
- When the distribution of Y∣x or the functional relationship between E(Y∣x) and V(Y∣x) can be known, a special transformation can satisfy the assumption of the normal distribution and eliminate the functional relationship.
- Log transformation is typically used a lot to reduce variance
Chapter 7. Weighted Least Squares(WLS)
data:image/s3,"s3://crabby-images/87dd6/87dd6f72ad84c5e1f67294f56343bd1018f2a287" alt=""
⇒ Residual plot shows the empirical evidence of heteroscedasticity(이분산성)
Strategies for treating heteroskedasticity
- Transformation of variable
data:image/s3,"s3://crabby-images/270ad/270ad703d269e79f1b37264333d5440b6abc1ddc" alt=""
- WLS
data:image/s3,"s3://crabby-images/333bb/333bb786c69431c8d8857dbe515719e089cc90ae" alt=""
- (b) of Transformation of variables gives the same result as WLS, but it is difficult to interpret the result.
Weighted Least Squares(WLS)
- We use WLS when we suspect an equally distributed assumption of error.
- It is used when you want to create a regression model that is less affected by outliers.
data:image/s3,"s3://crabby-images/911af/911af6b989a71643915b5a42c7db2753b9a4f353" alt=""
- Idea
- Incorrect observations adjust the weight to have less effect on the min of SSE
- If wi=0, the observation is excluded from the estimation and is the same as OLS if all wi are equal.
Sums of Squares Decomposition in WLS
data:image/s3,"s3://crabby-images/c7372/c73727c673c91d33e9958206a63272c927e40227" alt=""
data:image/s3,"s3://crabby-images/88b70/88b70769e0886df7dba996d9c918de34655f44b3" alt=""
- Assumption of independence in the regression model: the error terms ei and ej are not correlated with each other. Cov(ei,ej)=0,i=j
- Autocorrelation
- The correlation when the observations have a natural sequential order
- Adjacent residuals tend to be similar in both temporal and spatial diemensions(economic time series)
Effect of Autocorrelation of Errors on Regression Analysis
- The efficiency of LSE for regression coefficients is poor(unbiased but no minimum variance)
- σ2 or the s.e. of the regression coefficient may be underestimated. In other words, the significance of the regression coefficient is overestimated
- Commonly used confidence intervals or significance tests are no longer valid
Two types of the autocorrelation problem
- Type 1: autocorrelation in appearance(omission of a variable that should be in the model)
→ Once this variable is uncovered, the problem is resolved
- Type 2: pure autocorrelation
→ involving a transformation of the data
- residuals plot(index plot) : a particular pattern
- runs test, Durbin-Watson test
- Type 1: consider another variables if possible
- Type 2: consider AR model to the error → reduce to a model with uncorrelated error
Runs test
- uses signs(+,-) of residuals
- Run: repeated occurrence of the same sign
- NR: # of runs
- Idea: NR ↑ if negative corr, NR ↓ if positive corr
Durbin-Watson test(a popular test of autocorrelation in regression analysis)
- Use it under the assumption called as AutoRegressive model of order 1(AR1)
data:image/s3,"s3://crabby-images/fb335/fb335a4627211f77885364445959dd59eadf7aed" alt=""
- Durbin-Watson's statistic & Estimator of autocorrelation
data:image/s3,"s3://crabby-images/74cef/74cef9388b61c0fd3fdd6fae1844db57f191f5fe" alt=""
- Idea: small values of d is positive correlation & large values of d is negative correlation
Chapter 9. Analysis of Collinear Data
- Interpretation of the multiple regression equation depends implicitly on the assumption that the predictor variables are not strongly interrelated
- If the predictors are so strongly interrelated, the regression results are ambiguous : problem of collinear data or multicollinearity
Multicollinearity(다중공선성)
data:image/s3,"s3://crabby-images/ead10/ead10699de05b8da789d2441c57438c4497ab032" alt=""
- Regression assumption: rank(X)=p+1
- Multicollinearity is not found through residual analysis.
- The cause of multicollinearity may be a lack of observation or the uniqueness of the independent variables to be analyzed
- The multicollinearity problem is considered after regression diagnosis including residual analysis
Symptom of multicollinearity
- Model is significant byt some of xi are not significant
- Estimation of βi^ are unstable and drastic change of βi^ by adding or deleting a variable
- Estimation result contrary to the common sense
Numerical measure of multicollinearity
Correlation coefficients of xi and xj(i=j)
- Pairwise linear relation but can't detect linear relation among 3 or more variables
Variance Inflation Factor(VIF)
data:image/s3,"s3://crabby-images/b279a/b279ac589eb643ca0720469e74a00fe153c44136" alt=""
- VIF>10 evidence of multicollinearity
Principal components
data:image/s3,"s3://crabby-images/6705c/6705c1ca9455ee8b6a413fb773486f5b78bd8055" alt=""
- Overall measure of multicollinearity
data:image/s3,"s3://crabby-images/1fbf5/1fbf56dcff8e59f27220bddde5edf5242442aa77" alt=""
What to do with multicollinearity data
- (Experimental situation) : design an experiment so that multicollinearity does not occur
- (Observational situation) : reduce the model(essentially reduce the variables) using the information from the PC's, Ridge regression
Chapter 11. Variable Selection
- Goal: to explain the response with the smallest number of explanatory variables
- Balancing between goodness of fit and simplicity
data:image/s3,"s3://crabby-images/c9500/c950056af805bb8fe18519cc4924d69110e278eb" alt=""
Statictics used in Variable Selection
- To decide that one subset is better than another, we need some criteria for subset selection
- The criteria is minimizing a modified SSEp
Adjusted multiple correlation coefficient
- For fixed p, maximize among possible choices of p variables
data:image/s3,"s3://crabby-images/52d67/52d67b059a68a94697ab41a1bccba87993715734" alt=""
- For different p's, maximize
data:image/s3,"s3://crabby-images/fab09/fab094a2ffa6b9359056a0cec5785f12966b201f" alt=""
Mallow's Cp
data:image/s3,"s3://crabby-images/f4244/f42443b69c9ac31a3b2b0bc94ce0fba9828ec7f3" alt=""
AIC
data:image/s3,"s3://crabby-images/dc9b2/dc9b2527763577201ee67cc0fce1296028e28cc7" alt=""
BIC
data:image/s3,"s3://crabby-images/3819b/3819b31a1d20f87b6b9293965ae788c26dbdec81" alt=""
Partial F-test statistics for testing
data:image/s3,"s3://crabby-images/99e87/99e87f5928ca02d6498f189054a3d467eb876669" alt=""
Variable Selection
- Evaluating all possible equations
data:image/s3,"s3://crabby-images/b9d30/b9d30c64b93812de3fcb81322d805b8d567d97a6" alt=""
- Variable selection precedures(Partial F-test)
data:image/s3,"s3://crabby-images/4cb6a/4cb6ac0637a7d7e68e18b6ca3c00af9b0c1a1d57" alt=""
- Forward selection
- Backward elimination
- Stepwise selection
Chapter 12.Logistic Regression
- Dependent variable:Quanlitative & Independent variables:Quantitative or Qualitative
Modeling Qualitative Data
- Rather than predicting these two values of the binary response variable, try to model the probabilities that the response takes one of these two values
- Let π denote the probability that Y=1 when X=x
- Logistic model
data:image/s3,"s3://crabby-images/bb1ef/bb1ef6396b16317e02fd9207a7478b1f6265ae82" alt=""
data:image/s3,"s3://crabby-images/db6df/db6df589d14efae51671da6d44b534b41d70135c" alt=""
- Logistic regression function(logistic model for multiple regression)
data:image/s3,"s3://crabby-images/754d9/754d98bd5361a7b5b35f20afcd404bb68d4a4f47" alt=""
- Nonlinear in the paramters but it can be linearized by the logit transformation
data:image/s3,"s3://crabby-images/05bfd/05bfd0972b0107921af5631fbf41299ed43242d9" alt=""
- Odds : Indicates how many times the probability of success is that of failure
data:image/s3,"s3://crabby-images/368c3/368c30107c1fa762174d00de3e9210bc39833c91" alt=""
- Logit
data:image/s3,"s3://crabby-images/193c7/193c7f2f2bb0ce2b10de972e4b81217458c9f03f" alt=""
- Modeling and estimating the logistic regression model
- Maximum likelihood estimation
- No closed-form expression exists for the estimates of the parameters. To fit a logistic regression in practice a computer program is essential
- Information criteria as AIC and BIC can be used for model selection
- Instead of SSE, the logarithm of the likelihood for the fitted model is used
Diagnostics in logistic regression
- Diagnostic measures
data:image/s3,"s3://crabby-images/8f84a/8f84aa050f9feeb39a4b08ee8727136579f38196" alt=""
- How to use the measures: same way as the corresponding ones from a linear regression
data:image/s3,"s3://crabby-images/88555/8855542bc293a4873dbaefde15c3613cf003b98a" alt=""