week 10 comparing two means or proportions. generalising from sample...

40
Week 10 Comparing Two Means or Proportions

Upload: russell-randall

Post on 18-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Week 10

Comparing Two Means or Proportions

Page 2: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Generalising from sample

Individuals Measurement Groups Question

Children aged 10

Mark in maths test

Boys & girls

Are male marks higher on average?

Plots in field Yield of wheat Varieties A & B

Which gives higher yields?

Cars leaving production line

CO emissions from exhaust

Production lines 1 & 2

Are both lines same?

Page 3: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Generalising from sample

Individuals Measurement Groups Question

Children aged 10

Pass/fail in maths test

Boys & girls Are males more likely to pass?

Cabbages in field

Infected by cabbage butterfly

Varieties A & B

Which is less likely to be infected?

Cars leaving production line

Rattle in exhaust

Production lines 1 & 2

Do both lines have same chance of rattle?

Page 4: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Numerical measurements: means

Difference in average weight loss for those who diet compared to those who exercise to lose weight?

Difference is there between the mean foot lengths of men and women?

Population parameter 2 – 1 = difference between population means

Sample estimate x2 – x1 = difference between sample means

Page 5: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Categorical measurements: propns

Difference between the proportions that would quit smoking if taking the antidepressant buproprion (Zyban) versus wearing a nicotine patch?

Difference between proportion who have heart disease of men who snore and men who don’t snore?

Population parameter 2 – 1 = difference between population

proportions

Sample estimate p2 – p1 = difference between sample proportions

Page 6: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Requirement: independent samples

Random samples taken separately from 2 populations

Randomised experiment with 2 treatments

One random sample, but a categorical variable splits individuals into 2 groups.

Two samples are called independent samples when the measurements in one sample are not related to the measurements in the other sample.

Page 7: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Model for numerical data

Sample 1 ~ population (mean 1, s.d. 1)Sample 2 ~ population (mean 2, s.d. 2)

Estimation: estimate (2 – 1) with Standard error? Confidence interval?

Testing: is (2 – 1) zero? p-value

x2 − x1( )

Page 8: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Model for categorical data

Sample 1 ~ population (proportion 1)Sample 2 ~ population (proportion 2)

Estimation: estimate (2 – 1) with (p2 – p1) Standard error? Confidence interval?

Testing: is (2 – 1) zero? p-value

Page 9: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Distribution of difference

In both cases, we need to find distribution of difference (p2 – p1) or

Independent samples >> difference of independent random variables.

We already know distns of the two parts — what is distn of their difference?

x2 − x1( )

Page 10: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Sum of 2 variables

Sample mean:

Sample total:

X1 + X22

~ distn, 2

⎝ ⎜

⎠ ⎟

X1 + X2 ~ distn2, 2( )

Same distns

Different distns

X1 + X2 ~ distn1 +2 , 12 +2

2 ⎛ ⎝ ⎜

⎞ ⎠ ⎟

Page 11: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Difference between 2 variables

Same standard devn as sum

X2 − X1 ~ distn2 −1, 12 +2

2 ⎛ ⎝ ⎜

⎞ ⎠ ⎟

If X1 and X2 are normal

X2 − X1 ~ normal2 −1, 12 +2

2 ⎛ ⎝ ⎜

⎞ ⎠ ⎟

Remember that X1 and X2 must be independent

Page 12: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Example

Husband height ~ normal(1.85, 0.1)Wife height ~ normal(1.7, 0.08)

Assume independent. (Probably not!!)

Prob that wife is taller than husband?

(Husband - Wife) ~

normal0.15, 0.12 + 0.082 ⎛ ⎝ ⎜

⎞ ⎠ ⎟ = normal0.15, 0.1281( )

Page 13: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Example

Husband height ~ normal(1.85, 0.1)Wife height ~ normal(1.7, 0.08)

Husband - Wife ~ normal(0.15, 0.1281)

P (diff ≤ 0) = area

0.15 0.28 0.410.02-0.11

z = −0.150.1281

= −0.534 Prob = 0.297

Page 14: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Difference between proportions

If X1 and X2 are independent,

X2 − X1 ~ distn2 −1, 12 +2

2 ⎛ ⎝ ⎜

⎞ ⎠ ⎟

If p1 and p2 are independent,

p2 − p1 ~ distn2 −1, 1 1−1( )

n1+2 1−2( )

n2

⎜ ⎜

⎟ ⎟

For large samples, p1 and p2 are approx normal, so their difference is too.

Page 15: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

n1 = n2 = 244 randomly assigned to each treatment

Std error for difference in propns

Nicotine patches vs Antidepressant (Zyban)?

Zyban: 85 out of 244 quit smokingPatch: 52 out of 244 quit smoking

s.e.(p1−p2 ) =p1 1−p1( )

n1+p2 1−p2( )

n2

So,

p1−p2 =.348 −.213 =.135

and s.e.(p1−p2 ) =.348 1−.348( )

244+.213 1−.213( )

244=.040

Page 16: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Approximate 95% C.I.

Best you can do for difference between proportions

For means, CI can be improved by replacing ‘2’ by a different value.

For sufficiently large samples, the interval

Estimate 2 Standard error

is an approximate 95% C.I.

Page 17: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Patch vs Antidepressant

Approx 95% C.I. .135 2(.040) => .135 .080 => .055 to .215

Study: n1 = n2 = 244 randomly assigned to each group

Zyban:85 of the 244 Zyban users quit smoking = .348

Patch: 52 of the 244 patch users quit smoking = .2131p̂2p̂

So, 135.213.348.ˆˆ 21 =−=−pp 040.)ˆˆ.(. and 21 =−ppes

We are 95% confident that Zyban gives an improvement of between 5.5% and 21.5% of the probability of quitting smoking.

Page 18: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Difference between means

If X1 and X2 are independent,

X2 − X1 ~ distn2 −1, 12 +2

2 ⎛ ⎝ ⎜

⎞ ⎠ ⎟

If X1 and X2 are independent,

X2 − X1 ~ distn2 −1, 12

n1+22

n2

⎜ ⎜

⎟ ⎟

If both populations are normal, so is the difference.

Page 19: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

n1 = 42 men on diet, n2 = 27 men on exercise routine

Std error for difference in means

Lose More Weight by Diet or Exercise?

2

22

1

21

21 ).(.n

s

n

sxxes +=−

Diet: Lost an average of 7.2 kg with std dev of 3.7 kgExercise: Lost an average of 4.0 kg with std dev of 3.9 kg

So, kg 2.30.42.721 =−=−xx

( ) ( )81.0

47

9.3

42

7.3).(. and

22

21 =+=− xxes

Page 20: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

We are 95% confident that those who diet lose on average 1.58 to 4.82 kg more than those who exercised.

Approximate 95% Confidence Interval: 3.2 2(.81) => 3.2 1.62 => 1.58 to 4.82 kg

Study: n1 = 42 men on diet, n2 = 27 men exercise

Diet: Lost an average of 7.2 kg with std dev of 3.7 kgExercise: Lost an average of 4.0 kg with std dev of 3.9 kg

So, kg 2.30.42.721 =−=−xx kg 81.0).(. and 21 =−xxes

Diet vs Exercise

Page 21: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

A CI for the Difference Between Two Means(Independent Samples):

where t* is a value from t-tables.

2

22

1

21*

21 n

s

n

stxx +±−

Better C.I. for mean

d.f. = min(n1–1, n2–1) Welch’s approx gives a different d.f. (higher)

but is a complicated formula t* is approx 1.96 if d.f. is high

Page 22: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Randomized experiment: Researchers either stared or did not stare at drivers stopped at a campus stop sign; Timed how long (sec) it took driver to proceed from sign to a mark on other side of the intersection.

Estimate difference between the mean crossing times.

No Stare Group (n = 14): 8.3, 5.5, 6.0, 8.1, 8.8, 7.5, 7.8, 7.1, 5.7, 6.5, 4.7, 6.9, 5.2, 4.7

Stare Group (n = 13): 5.6, 5.0, 5.7, 6.3, 6.5, 5.8, 4.5, 6.1, 4.8, 4.9, 4.5, 7.2, 5.8

Effect of a stare on driving

Page 23: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

No outliers; no strong skewness.

Crossing times in stare group seem faster & less variable.

Checking data

Page 24: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

A 95% CI for 2–1 is

Effect of stare on driving

Using df = min(n1–1, n2–1) = 12, gives t* = 2.179

x2 − x1 = 6.63 −5.59 = 1.04 sec

s.e.(x 2 − x 1) =1.36( )

2

14+

0.822( )2

13= 0.43

1.04 ± 2.179 × 0.43 = 0.10 to 1.98 sec

Page 25: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Slightly narrower C.I. that we got with d.f. = 12.

N.B. C.I. is based on df = 21 (Welch’s approx)

Effect of stare on driving

Minitab

Page 26: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Interpretation

We are 95% confident that it takes drivers between 0.17 and 1.91 seconds less on average to cross intersection if someone stares at them.

A 95% CI for 2–1 is 0.17 to 1.91 sec

Page 27: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Testing two proportions

Hypotheses

H0: 1 – 2 = 0 HA: 1 – 2 ≠ 0

or 1 – 2 < 0or 1 – 2 > 0

Watch how Population 1 and 2 are defined. Data requirements

Independent samples n1 p1, n1(1-p1), n2 p2, n2(1-p2) all at least 5, preferably ≥10

Page 28: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Test statistic

Based on p1 – p2

Standardise:

z = p1 − p2( ) − 0

se p1 − p2( )

se p1 − p2( ) = π1 1− π1( )

n1

+π 2 1− π 2( )

n2

=1

n1

+1

n2

⎝ ⎜

⎠ ⎟π 1− π( ) if H0 is true

Page 29: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Test statistic

If H0 is true, best estimate of is

z = p1 − p2( ) − 0

1

n1

+1

n2

⎝ ⎜

⎠ ⎟p 1− p( )

p = x1 + x2( )n1 + n2( )

So we use test statistic

If H0 is true, this has standard normal distn p-value from normal distn

Page 30: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Prevention of Ear Infections

Does the use of sweetener xylitol reduce the incidence of ear infections?

Randomized Experiment:Of 165 children on placebo, 68 got ear infection.Of 159 children on xylitol, 46 got ear infection.

Hypotheses: H0: 1 – 2 = 0 Ha: 1 – 2 > 0 Data check: At least 5 success & failure in each group

123.ˆˆ and ,289.159

46ˆ ,412.

165

68ˆ 2121 =−==== pppp

Page 31: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Prevention of Ear Infections

Overall propn getting infection

35.324

114

159165

4668ˆˆˆ

21

2211 ==++

=++

=nnpnpn

p

( ) ( )32.2

1591

1651

35.135.

0123.

11ˆ1ˆ

0ˆˆ

21

21 =

⎟⎠⎞

⎜⎝⎛ +−

−=

⎟⎟⎠

⎞⎜⎜⎝

⎛+−

−−=

nnpp

ppz

Test statistic

p-value = 0.01

Conclusion: Strong evidence xylitol reduces

chance of ear infection

Page 32: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Testing two means

Hypotheses

H0: 1 – 2 = 0 HA: 1 – 2 ≠ 0

or 1 – 2 < 0or 1 – 2 > 0

Watch how Population 1 and 2 are defined. Data requirements

Fairly large n1 and n2 (say 30 or more), or Not much skewness & no outliers (normal model reasonable)

Page 33: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Test statistic

Based on

Standardise:

t = x 1 − x 2s1

2

n1

+s2

2

n2€

se x 1 − x 2( ) = σ 1

2

n1

+σ 2

2

n2

≈s1

2

n1

+s2

2

n2

x 1 − x 2

t = x 1 − x 2( ) − 0

se x 1 − x 2( )

Page 34: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Test

Test statistic:

If H0 is true, this has approx t-distn with

d.f. = min(n1–1, n2–1) Same d.f. as CI for 1 – 2

p-value from t distn Minitab or Excel

t = x 1 − x 2s1

2

n1

+s2

2

n2

n1 and n2 ≥ 30 Use normal tables

Page 35: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Randomized experiment: Researchers either stared or did not stare at drivers stopped at a campus stop sign; Timed how long (sec) it took driver to proceed from sign to a mark on other side of the intersection.

Test whether stare speeds up crossing times.

No Stare Group (n = 14): 8.3, 5.5, 6.0, 8.1, 8.8, 7.5, 7.8, 7.1, 5.7, 6.5, 4.7, 6.9, 5.2, 4.7

Stare Group (n = 13): 5.6, 5.0, 5.7, 6.3, 6.5, 5.8, 4.5, 6.1, 4.8, 4.9, 4.5, 7.2, 5.8

Effect of a stare on driving

Page 36: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Small sample sizes, but

No outliers; no strong skewness.

Checking data

Page 37: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Effect of stare on driving

x 1 − x 2 = 6.63 − 5.59 = 1.04 sec

HypothesesH0: 1 – 2 = 0 HA: 1 – 2 > 0

where 1 = no-stare, 2 = stare

t = x 1 − x 2

se x 1 − x 2( ) =

1.04

0.429 = 2.42

s.e.(x 2 − x 1) =1.36( )

2

14+

0.822( )2

13= 0.429

Page 38: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Effect of stare on driving

Test statistic

df = min(n1–1, n2–1) = 12

Upper tail area of t-distn (12 d.f.)

p = 0.016

P-value

t = x 1 − x 2

se x 1 − x 2( ) =

1.04

0.429 = 2.42

Strong evidence that stare speeds up crossing

Page 39: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Very similar p-value and same conclusion

N.B. Test is based on df = 21 (Welch’s approx)

Effect of stare on driving

Minitab

Strong evidence that stare speeds up crossing

Page 40: Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls

Paired data and 2-sample data

Make sure you distinguish between: 2 measurements on each individual (e.g. before &

after)

Measurements from 2 independent groups

Different cars assessed for insurance claims in garages A and B

Same cars assessed by both garages

2 independent samples

Paired data