Title: 12-1 Multiple Linear Regression Models
1(No Transcript)
2(No Transcript)
3(No Transcript)
412-1 Multiple Linear Regression Models
12-1.1 Introduction
- Many applications of regression analysis involve
situations in which there are more than one
regressor variable. - A regression model that contains more than one
regressor variable is called a multiple
regression model.
512-1 Multiple Linear Regression Models
12-1.1 Introduction
- For example, suppose that the effective life of
a cutting tool depends on the cutting speed and
the tool angle. A possible multiple regression
model could be
where Y tool life x1 cutting speed x2 tool
angle
612-1 Multiple Linear Regression Models
12-1.1 Introduction
Figure 12-1 (a) The regression plane for the
model E(Y) 50 10x1 7x2. (b) The contour
plot
712-1 Multiple Linear Regression Models
12-1.1 Introduction
812-1 Multiple Linear Regression Models
12-1.1 Introduction
Figure 12-2 (a) Three-dimensional plot of the
regression model E(Y) 50 10x1 7x2 5x1x2.
(b) The contour plot
912-1 Multiple Linear Regression Models
12-1.1 Introduction
Figure 12-3 (a) Three-dimensional plot of the
regression model E(Y) 800 10x1 7x2 8.5x12
5x22 4x1x2. (b) The contour plot
1012-1 Multiple Linear Regression Models
12-1.2 Least Squares Estimation of the Parameters
1112-1 Multiple Linear Regression Models
12-1.2 Least Squares Estimation of the Parameters
- The least squares function is given by
- The least squares estimates must satisfy
1212-1 Multiple Linear Regression Models
12-1.2 Least Squares Estimation of the Parameters
- The least squares normal Equations are
- The solution to the normal Equations are the
least squares estimators of the regression
coefficients.
1312-1 Multiple Linear Regression Models
Example 12-1
1412-1 Multiple Linear Regression Models
Example 12-1
1512-1 Multiple Linear Regression Models
Figure 12-4 Matrix of scatter plots (from
Minitab) for the wire bond pull strength data in
Table 12-2.
1612-1 Multiple Linear Regression Models
Example 12-1
1712-1 Multiple Linear Regression Models
Example 12-1
1812-1 Multiple Linear Regression Models
Example 12-1
1912-1 Multiple Linear Regression Models
12-1.3 Matrix Approach to Multiple Linear
Regression
Suppose the model relating the regressors to the
response is
In matrix notation this model can be written as
2012-1 Multiple Linear Regression Models
12-1.3 Matrix Approach to Multiple Linear
Regression
where
2112-1 Multiple Linear Regression Models
12-1.3 Matrix Approach to Multiple Linear
Regression
We wish to find the vector of least squares
estimators that minimizes
The resulting least squares estimate is
2212-1 Multiple Linear Regression Models
12-1.3 Matrix Approach to Multiple Linear
Regression
2312-1 Multiple Linear Regression Models
Example 12-2
24Example 12-2
2512-1 Multiple Linear Regression Models
Example 12-2
2612-1 Multiple Linear Regression Models
Example 12-2
2712-1 Multiple Linear Regression Models
Example 12-2
2812-1 Multiple Linear Regression Models
Example 12-2
29(No Transcript)
3012-1 Multiple Linear Regression Models
Estimating ?2
An unbiased estimator of ?2 is
3112-1 Multiple Linear Regression Models
12-1.4 Properties of the Least Squares Estimators
Unbiased estimators
Covariance Matrix
3212-1 Multiple Linear Regression Models
12-1.4 Properties of the Least Squares Estimators
Individual variances and covariances
In general,
3312-2 Hypothesis Tests in Multiple Linear
Regression
12-2.1 Test for Significance of Regression
The appropriate hypotheses are
The test statistic is
3412-2 Hypothesis Tests in Multiple Linear
Regression
12-2.1 Test for Significance of Regression
3512-2 Hypothesis Tests in Multiple Linear
Regression
Example 12-3
3612-2 Hypothesis Tests in Multiple Linear
Regression
Example 12-3
3712-2 Hypothesis Tests in Multiple Linear
Regression
Example 12-3
3812-2 Hypothesis Tests in Multiple Linear
Regression
Example 12-3
3912-2 Hypothesis Tests in Multiple Linear
Regression
R2 and Adjusted R2
The coefficient of multiple determination
- For the wire bond pull strength data, we find
that R2 SSR/SST 5990.7712/6105.9447 0.9811. - Thus, the model accounts for about 98 of the
variability in the pull strength response.
4012-2 Hypothesis Tests in Multiple Linear
Regression
R2 and Adjusted R2
The adjusted R2 is
- The adjusted R2 statistic penalizes the analyst
for adding terms to the model. - It can help guard against overfitting
(including regressors that are not really useful)
4112-2 Hypothesis Tests in Multiple Linear
Regression
12-2.2 Tests on Individual Regression
Coefficients and Subsets of Coefficients
The hypotheses for testing the significance of
any individual regression coefficient
4212-2 Hypothesis Tests in Multiple Linear
Regression
12-2.2 Tests on Individual Regression
Coefficients and Subsets of Coefficients
The test statistic is
- Reject H0 if t0 gt t?/2,n-p.
- This is called a partial or marginal test
4312-2 Hypothesis Tests in Multiple Linear
Regression
Example 12-4
4412-2 Hypothesis Tests in Multiple Linear
Regression
Example 12-4
4512-2 Hypothesis Tests in Multiple Linear
Regression
The general regression significance test or the
extra sum of squares method
We wish to test the hypotheses
4612-2 Hypothesis Tests in Multiple Linear
Regression
A general form of the model can be written
where X1 represents the columns of X associated
with ?1 and X2 represents the columns of X
associated with ?2
4712-2 Hypothesis Tests in Multiple Linear
Regression
For the full model
If H0 is true, the reduced model is
4812-2 Hypothesis Tests in Multiple Linear
Regression
The test statistic is
Reject H0 if f0 gt f?,r,n-p The test in Equation
(12-32) is often referred to as a partial F-test
4912-2 Hypothesis Tests in Multiple Linear
Regression
Example 12-5
5012-2 Hypothesis Tests in Multiple Linear
Regression
Example 12-5
5112-2 Hypothesis Tests in Multiple Linear
Regression
Example 12-5
5212-3 Confidence Intervals in Multiple Linear
Regression
12-3.1 Confidence Intervals on Individual
Regression Coefficients
Definition
5312-3 Confidence Intervals in Multiple Linear
Regression
Example 12-6
5412-3 Confidence Intervals in Multiple Linear
Regression
12-3.2 Confidence Interval on the Mean Response
The mean response at a point x0 is estimated by
The variance of the estimated mean response is
5512-3 Confidence Intervals in Multiple Linear
Regression
12-3.2 Confidence Interval on the Mean Response
Definition
5612-3 Confidence Intervals in Multiple Linear
Regression
Example 12-7
5712-3 Confidence Intervals in Multiple Linear
Regression
Example 12-7
5812-4 Prediction of New Observations
A point estimate of the future observation Y0 is
A 100(1-?) prediction interval for this future
observation is
5912-4 Prediction of New Observations
Figure 12-5 An example of extrapolation in
multiple regression
6012-4 Prediction of New Observations
Example 12-8
6112-5 Model Adequacy Checking
12-5.1 Residual Analysis
Example 12-9
Figure 12-6 Normal probability plot of residuals
6212-5 Model Adequacy Checking
12-5.1 Residual Analysis
Example 12-9
6312-5 Model Adequacy Checking
12-5.1 Residual Analysis
Example 12-9
Figure 12-7 Plot of residuals against yi.
6412-5 Model Adequacy Checking
12-5.1 Residual Analysis
Example 12-9
Figure 12-8 Plot of residuals against x1.
6512-5 Model Adequacy Checking
12-5.1 Residual Analysis
Example 12-9
Figure 12-9 Plot of residuals against x2.
6612-5 Model Adequacy Checking
12-5.1 Residual Analysis
6712-5 Model Adequacy Checking
12-5.1 Residual Analysis
The variance of the ith residual is
6812-5 Model Adequacy Checking
12-5.1 Residual Analysis
6912-5 Model Adequacy Checking
12-5.2 Influential Observations
Figure 12-10 A point that is remote in x-space.
7012-5 Model Adequacy Checking
12-5.2 Influential Observations
Cooks distance measure
7112-5 Model Adequacy Checking
Example 12-10
7212-5 Model Adequacy Checking
Example 12-11
7312-6 Aspects of Multiple Regression Modeling
12-6.1 Polynomial Regression Models
7412-6 Aspects of Multiple Regression Modeling
Example 12-12
7512-6 Aspects of Multiple Regression Modeling
Example 12-11
Figure 12-11 Data for Example 12-11.
76Example 12-12
7712-6 Aspects of Multiple Regression Modeling
Example 12-12
7812-6 Aspects of Multiple Regression Modeling
12-6.2 Categorical Regressors and Indicator
Variables
- Many problems may involve qualitative or
categorical variables. - The usual method for the different levels of a
qualitative variable is to use indicator
variables. - For example, to introduce the effect of two
different operators into a regression model, we
could define an indicator variable as follows
7912-6 Aspects of Multiple Regression Modeling
Example 12-13
8012-6 Aspects of Multiple Regression Modeling
Example 12-13
8112-6 Aspects of Multiple Regression Modeling
Example 12-13
82Example 12-12
8312-6 Aspects of Multiple Regression Modeling
Example 12-13
8412-6 Aspects of Multiple Regression Modeling
Example 12-13
8512-6 Aspects of Multiple Regression Modeling
12-6.3 Selection of Variables and Model Building
8612-6 Aspects of Multiple Regression Modeling
12-6.3 Selection of Variables and Model
Building All Possible Regressions Example 12-15
8712-6 Aspects of Multiple Regression Modeling
12-6.3 Selection of Variables and Model
Building All Possible Regressions Example 12-15
8812-6 Aspects of Multiple Regression Modeling
12-6.3 Selection of Variables and Model
Building All Possible Regressions Example 12-15
Figure 12-12 A matrix of Scatter plots from
Minitab for the Wine Quality Data.
89(No Transcript)
9012-6.3 Selection of Variables and Model
Building Stepwise Regression Example 12-15
9112-6.3 Selection of Variables and Model
Building Backward Regression Example 12-15
9212-6 Aspects of Multiple Regression Modeling
12-6.4 Multicollinearity
Variance Inflation Factor (VIF)
9312-6 Aspects of Multiple Regression Modeling
12-6.4 Multicollinearity
The presence of multicollinearity can be detected
in several ways. Two of the more easily
understood of these are
94(No Transcript)