warm-up do the work on the slip of paper (handout)

Download WARM-UP Do the work on the slip of paper (handout)

If you can't read please download the document

Upload: clinton-dean

Post on 14-Dec-2015

218 views

Category:

Documents


3 download

TRANSCRIPT

  • Slide 1

WARM-UP Do the work on the slip of paper (handout) Slide 2 HOMEWORK QUESTIONS Slide 3 LEAST SQUARES REGRESSION LINES SECTION 3.2 Slide 4 REGRESSION LINE A regression line is a straight line that describes how a response variable (y) changes as an explanatory variable (x) changes. You can use a regression line to predict the value of y for any value of x by substituting this x into the equation of the line. Slide 5 INTERPRETING REGRESSION LINES Slide 6 INFLUENTIAL POINT An observation is influential if removing it would markedly change the position of the regression line. Points that are outliers in the x direction are often influential. Slide 7 EXTRAPOLATION Extrapolation is the use of a regression line for prediction using values of the explanatory variable (x) outside the range of the data from which the line was calculated. This should be avoided, as it leads to incorrect conclusions. See warm-up What if I told you that the xs were supposed to represent months and that the ys were supposed to represent lows in temperature? Are your predictions still correct? Slide 8 RESIDUALS Slide 9 RESIDUAL PLOTS A residual plot is a scatterplot that uses our explanatory variable as the x and the residuals as the y. We can use the residual plot to determine if a scatterplot has a linear fit. Slide 10 TWO IMPORTANT THINGS The residual plot should show no obvious pattern. A curved pattern shows that the relationship is not linear. A straight line may not be the best model for such data. Increasing (or decreasing) spread about the line as x increases indicates that prediction of y will be less accurate for larger x (for smaller x). The residuals should be relatively small in size. A regression line in a model that fits the data well should come close to most of the points. That is, the residuals should be fairly small. How do we decide whether the residuals are small enough? We consider the size of a typical prediction error. Slide 11 EXAMPLE FAT GAIN Almost all of the residuals are between 0.7 and 0.7. For these individuals, the predicted fat gain from the least-squares line is within 0.7 kg of their actual fat gain during the study. That sounds pretty good. But the subjects gained only between 0.4 kg and 4.2 kg, so a prediction error of 0.7 kg is relatively large compared with the actual fat gain for an individual. The largest residual, 1.64, corresponds to a prediction error of 1.64 kg. This subject's actual fat gain was 3.8 kg, but the regression line predicted a fat gain of only 2.16 kg. That's a pretty large error, especially from the subject's perspective! Slide 12 SOMETHING UNUSUAL Residuals from the least squares regression line have an unusual property the mean of the residuals is always zero. Why does this make sense? Slide 13 LEAST SQUARES REGRESSION LINE Slide 14 CALCULATING THE LSRL Slide 15 COEFFICIENT OF DETERMINATION Slide 16 CAUTION! Correlation and regression must be interpreted with caution. Plot the data to be sure that the relationship is roughly linear and to detect outliers. Also, the correlation and regression line are nonresistant, often outliers in x will greatly influence the regression line. Most of all, be careful not to conclude that there is a cause-and-effect relationship between two variables just because they are strongly linear. (Dont mistake correlation with causation!) Slide 17 HOMEWORK Page 191 (35-42, 44-46)