slide slide 1 copyright 2007 pearson education, inc publishing as pearson addison-wesley. lecture...

48
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola

Upload: annabelle-anderson

Post on 19-Jan-2018

219 views

Category:

Documents


0 download

DESCRIPTION

Slide Slide 3 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Section 10-1 Overview Created by Erin Hodgess, Houston, Texas Revised to accompany 10 th Edition, Tom Wegleitner, Centreville, VA

TRANSCRIPT

Page 1: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 1Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Lecture Slides

Elementary Statistics Tenth Edition

and the Triola Statistics Series

by Mario F. Triola

Page 2: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 2Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Chapter 10Correlation and Regression

10-1 Overview

10-2 Correlation

10-3 Regression

10-4 Variation and Prediction Intervals

10-5 Multiple Regression

10-6 Modeling

Page 3: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 3Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Section 10-1Overview

Created by Erin Hodgess, Houston, TexasRevised to accompany 10th Edition, Tom Wegleitner, Centreville, VA

Page 4: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 4Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Overview

This chapter introduces important methods for making inferences about a correlation (or relationship) between two variables, and describing such a relationship with an equation that can be used for predicting the value of one variable given the value of the other variable.

We consider sample data that come in pairs.

Page 5: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 5Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Section 10-2 Correlation

Created by Erin Hodgess, Houston, TexasRevised to accompany 10th Edition, Tom Wegleitner, Centreville, VA

Page 6: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 6Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Key Concept

This section introduces the linear correlation coefficient r, which is a numerical measure of the strength of the relationship between two variables representing quantitative data.

Because technology can be used to find the value of r, it is important to focus on the concepts in this section, without becoming overly involved with tedious arithmetic calculations.

Page 7: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 7Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Part 1: Basic Concepts of Correlation

Page 8: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 8Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Definition

A correlation exists between two variables when one of them is related to the other in some way.

Page 9: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 9Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Definition

The linear correlation coefficient r measures the strength of the linear relationship between paired x- and y- quantitative values in a sample.

Page 10: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 10Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Exploring the Data

We can often see a relationship between two variables by constructing a scatterplot.

Figure 10-2 following shows scatterplots with different characteristics.

Page 11: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 11Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Scatterplots of Paired Data

Figure 10-2

Page 12: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 12Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Scatterplots of Paired Data

Figure 10-2

Page 13: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 13Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Requirements

1. The sample of paired (x, y) data is a random sample of independent quantitative data.

2. Visual examination of the scatterplot must confirm that the points approximate a straight-line pattern.

3. The outliers must be removed if they are known to be errors. The effects of any other outliers should be considered by calculating r with and without the outliers included.

Page 14: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 14Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Notation for the Linear Correlation Coefficient

n represents the number of pairs of data present.

denotes the addition of the items indicated.

x denotes the sum of all x-values.

x2 indicates that each x-value should be squared and then those squares added.

(x)2 indicates that the x-values should be added and the total then squared.

xy indicates that each x-value should be first multiplied by its corresponding y-value. After obtaining all such products, find their sum.

r represents linear correlation coefficient for a sample.

represents linear correlation coefficient for a population.

Page 15: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 15Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Formula 10-1

nxy – (x)(y)n(x2) – (x)2 n(y2) – (y)2

r =

The linear correlation coefficient r measures the strength of a linear relationship between the paired values in a sample.

Calculators can compute r

Formula

Page 16: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 16Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Interpreting r

Using Table A-6: If the absolute value of the computed value of r exceeds the value in Table A-6, conclude that there is a linear correlation. Otherwise, there is not sufficient evidence to support the conclusion of a linear correlation.

Using Software: If the computed P-value is less than or equal to the significance level, conclude that there is a linear correlation. Otherwise, there is not sufficient evidence to support the conclusion of a linear correlation.

Page 17: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 17Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Round to three decimal places so that it can be compared to critical values in Table A-6.

Use calculator or computer if possible.

Rounding the Linear Correlation Coefficient r

Page 18: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 18Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

35

18

36

54

Datax

y

Example: Calculating rUsing the simple random sample of data below, find the value of r.

Page 19: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 19Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Example: Calculating r - cont

Page 20: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 20Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

nxy – (x)(y)n(x2) – (x)2 n(y2) – (y)2

r =

61 – (12)(23)4(44) – (12)2 4(141) – (23)2

r =

-3233.466

r = = -0.956

35

18

36

54

Datax

y

Example: Calculating r - cont

Page 21: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 21Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Example: Calculating r - cont

Given r = - 0.956, if we use a 0.05 significance level we conclude that there is a linear correlation between x and y since the absolute value of r exceeds the critical value of 0.950. However, if we use a 0.01 significance level, we do not conclude that there is a linear correlation because the absolute value of r does not exceed the critical value of 0.999.

Page 22: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 22Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Page 23: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 23Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Page 24: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 24Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Page 25: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 25Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Page 26: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 26Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Page 27: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 27

Correlation in Excel

• Excel>File>Option>Adds- In> Analysis Toolpack> Go

• Data> Data Analysis> Correlation• Data> Data Analysis> RegresssionScatter PlotInsert> scatter plotYou can add trendline by: put cursor on the dot

and right click > add trendline> linear

Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Page 28: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 28Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Page 29: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 29Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Properties of the Linear Correlation Coefficient r

1. –1 r 1

2. The value of r does not change if all values of either variable are converted to a different scale.

3. The value of r is not affected by the choice of x and y. Interchange all x- and y-values and the value of r will not change.

4. r measures strength of a linear relationship.

Page 30: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 30Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Interpreting r : Explained Variation

The value of r2 is the proportion of the variation in y that is explained by the linear relationship between x and y.

Page 31: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 31Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Page 32: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 32Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Page 33: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 33Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Page 34: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 34Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Common Errors Involving Correlation

1. Causation: It is wrong to conclude that correlation implies causality.

2. Averages: Averages suppress individual variation and may inflate the correlation coefficient.

3. Linearity: There may be some relationship between x and y even when there is no linear correlation.

Page 35: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 35Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Page 36: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 36Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Page 37: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 37Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Part 2: Formal Hypothesis Test

Page 38: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 38Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Formal Hypothesis Test

We wish to determine whether there is a significant linear correlation between two variables.

We present two methods.In both methods let H0: = (no significant linear correlation)

H1: (significant linear correlation)

Page 39: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 39Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Figure 10-3

Hypothesis Test for a Linear Correlation

Page 40: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 40Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Test statistic:

Critical values:

Use Table A-3 with degrees of freedom = n – 2

1 – r 2

n – 2

rt =

Method 1: Test Statistic is t

Page 41: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 41Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Method 1 - cont P-value: Use Table A-3 with degrees of freedom = n – 2

Conclusion: If the absolute value of t is > critical value from Table A-3, reject H0 and conclude that there is a linear correlation. If the absolute value of t ≤ critical value, fail to reject H0; there is not sufficient evidence to conclude that there is a linear correlation.

Page 42: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 42Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Method 2: Test Statistic is r

Test statistic: rCritical values: Refer to Table A-6Conclusion: If the absolute value of r is > critical value from Table A-6, reject H0 and conclude that there is a linear correlation. If the absolute value of r ≤ critical value, fail to reject H0; there is not sufficient evidence to conclude that there is a linear correlation.

Page 43: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 43Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Page 44: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 44Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Page 45: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 45Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Page 46: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 46

HW

• 9.2:24

Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Page 47: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 47Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Justification for r Formula

The text presents a detailed rationale for the use of Formula 9-1. The student should study it carefully.

Page 48: Slide Slide 1 Copyright  2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the

SlideSlide 48Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

Recap

In this section, we have discussed: Correlation. The linear correlation coefficient r. Requirements, notation and formula for r. Interpreting r. Formal hypothesis testing (two methods). Rationale for calculating r.