created by erin hodgess, houston, texas section 9-5 multiple regression

13
Created by Erin Hodgess, Houston, Texas Section 9-5 Multiple Regression

Upload: winfred-paul

Post on 05-Jan-2016

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Created by Erin Hodgess, Houston, Texas Section 9-5 Multiple Regression

Created by Erin Hodgess, Houston, Texas

Section 9-5 Multiple Regression

Page 2: Created by Erin Hodgess, Houston, Texas Section 9-5 Multiple Regression

Multiple Regression

Definition Multiple Regression Equation

A linear relationship between a dependent

variable y and two or more independent

variables (x1, x2, x3 . . . , xk)

Page 3: Created by Erin Hodgess, Houston, Texas Section 9-5 Multiple Regression

Definition Multiple Regression Equation

A linear relationship between a dependent

variable y and two or more independent

variables (x1, x2, x3 . . . , xk)

y = b0 + b1x1 + b2x2 + . . . + bkxk ^

Multiple Regression

Page 4: Created by Erin Hodgess, Houston, Texas Section 9-5 Multiple Regression

y = b0 + b1 x1+ b2 x2+ b3 x3 +. . .+ bk xk(General form of the estimated multiple regression equation)

n = sample size

k = number of independent variables

y = predicted value of the dependent variable y

x1, x2, x3 . . . , xk are the independent

variables

^

^

Notation

Page 5: Created by Erin Hodgess, Houston, Texas Section 9-5 Multiple Regression

ß0 = the y-intercept, or the value of y when all of the predictor variables are 0

b0 = estimate of ß0 based on the sample data

ß1, ß2, ß3 . . . , ßk are the coefficients of

the independent variables x1, x2, x3 . . . , xk

b1, b2, b3 . . . , bk are the sample estimates of the

coefficients ß1, ß2, ß3 . . . , ßk

Notation

Page 6: Created by Erin Hodgess, Houston, Texas Section 9-5 Multiple Regression

AssumptionUse a statistical software package such as

STATDISK

Minitab

Excel

Page 7: Created by Erin Hodgess, Houston, Texas Section 9-5 Multiple Regression

Example: Bears

For reasons of safety, a study of bears involved the collection of various measurements that were taken after the bears were anesthetized. Using the data in Table 9-3, find the multiple regression equation in which the dependent variable is weight and the independent variables are head length and total overall length.

Page 8: Created by Erin Hodgess, Houston, Texas Section 9-5 Multiple Regression

Example: Bears

Page 9: Created by Erin Hodgess, Houston, Texas Section 9-5 Multiple Regression

Example: Bears

Page 10: Created by Erin Hodgess, Houston, Texas Section 9-5 Multiple Regression

Example: Bears

The regression equation is:

WEIGHT = –374 + 18.8 HEADLEN + 5.87 LENGTH

y = –374 + 18.8x3 + 5.87x6

Page 11: Created by Erin Hodgess, Houston, Texas Section 9-5 Multiple Regression

Adjusted R2

The multiple coefficient of determination is a measure of how well the multiple regression equation fits the sample data.

The Adjusted coefficient of determination

R2 is modified to account for the number of variables and the sample size.

Definitions

Page 12: Created by Erin Hodgess, Houston, Texas Section 9-5 Multiple Regression

Adjusted R2

Adjusted R2 = 1 – (n – 1)

[n – (k + 1)](1– R2)

Formula 9-6

where n = sample size

k = number of independent (x) variables

Page 13: Created by Erin Hodgess, Houston, Texas Section 9-5 Multiple Regression

Finding the Best MultipleRegression Equation

1. Use common sense and practical considerations to include or exclude variables.

2. Instead of including almost every available variable, include relatively few independent (x) variables, weeding out independent variables that don’t have an effect on the dependent variable.

3. Select an equation having a value of adjusted R2 with this property: If an additional independent variable is included, the value of adjusted R2 does not increase by a substantial amount.

4. For a given number of independent (x) variables, select the equation with the largest value of adjusted R2.

5. Select an equation having overall significance, as determined by the P-value in the computer display.