1 type ii error, power and sample size calculations chapter 23 inference for means: part 2

1

Type II Error, Power and Sample Size

Calculations

Type II Error, Power and Sample Size

Calculations

Chapter 23 Inference for Means: Part 2

2

Hypothesis Testing for , Type II Error Probabilities (Right-tail example)

• Example– A new billing system for a department store will be cost-

effective only if the mean monthly account is more than $170.

– A sample of 400 accounts has a mean of $174 and s = $65.

– Can we conclude that the new system will be cost effective?

3

• Hypotheses– The population of interest is the credit accounts at

the store.– We want to know whether the mean account for all

customers is greater than $170.

HA : m > 170

– Where m is the mean account value for all customers

Example (cont.)

H0 : m = 170

4

–Test statistic:H0 : m = 170HA : m > 170

174 1701.23

65 400

xts n

Example (cont.)

174, 65x s

5

399-value ( 1.23) .1097P P t

01.23t

P-value: The probability of observing a value of the test statistic as extreme or more extreme then t = 1.23, given that m = 170 is…

Example (cont.)

t399

Since the P-value > .05, we conclude that there is not sufficient evidence to reject H0 : =170.

Type II error is possible

6

Calculating , the Probability of aType II Error

• Calculating for the t test is not at all straightforward and is beyond the level of this course– The distribution of the test statistic t is quite

complicated when H0 is false and HA is true– However, we can obtain very good approximate

values for using z (the standard normal) in place of t.

7

Calculating , the Probability of aType II Error (cont.)

• We need to1. specify an appropriate significance level ;2. Determine the rejection region in terms of z3. Then calculate the probability of not being in the

rejection when = 1, where 1 is a value of that makes HA true.

8

–Test statistic:H0 : m = 170HA : m > 170

Choose = .05Rejection region in terms of z: z > z.05 = 1.645

Example (cont.) calculating

rejection region in terms of :

1701.645

65

40065

170 1.645 175.34.400

x

xz

x

a= 0.05

175.34170

9

Express the rejection region directly, not in standardized terms

175.34

a=.05

m= 170


– The rejection region with a = .05.

34.175x

Do not reject H0

=m 180

HA: m = 180

H0: m = 170

Specify the alternative value

under HA.

– Let the alternative value be m = 180 (rather than just m>170)

10

175.34

a=.05

m= 170


34.175x =m 180

H1: m = 180

H0: m = 170

–A Type II error occurs when a false H0 is not rejected. Suppose =180, that is, H0 is false.

A false H0……is not rejected

11

175.34m= 170


=m 180

H1: m = 180

H0: m = 170

0(180) ( 175.34 )

( 175.34 180)

P x given that H is false

P x given that

0764.)40065

18034.175z(P

Power when =180 =

1-(180)=.9236

12

• Increasing the significance level ,a decreases the value of , b and vice versa.

Effects on b of changing a

m= 170 =m 180

a1b1a2 >b2 <

13

• A hypothesis test is effectively defined by the significance level a and by the sample size n.

• If the probability of a Type II error b is judged to be too large, we can reduce it by– increasing a, and/or– increasing the sample size.

Judging the Test

14

• Increasing the sample size reduces b

Judging the Test

By increasing the sample size the standard deviation of the sampling distribution of the mean decreases. Thus, the cutoff value of for the rejection region decreases.

Recall : ,x s

RR z z or x zs n n

15Lx =m 180m= 170

Judging the Test

Lx

Note what happens when n increases:

Lx LxLx Lx

a does not change,but b becomes smaller

• Increasing the sample size reduces b

Recall : ,x s

RR z z or x zs n n

16

• Increasing the sample size reduces b• In the example, suppose n increases from 400 to

1000.65

170 1.645 173.381000

173.38 180( ) ( 3.22) 0

65 1000

sx z

n

P Z P Z

Judging the Test

• a remains 5%, but the probability of a Type II error decreases dramatically.

17

A Left - Tail Test• Self-Addressed Stamped Envelopes.

– The chief financial officer in FedEx believes that including a stamped self-addressed (SSA) envelope in the monthly invoice sent to customers will decrease the amount of time it take for customers to pay their monthly bills.

– Currently, customers return their payments in 24 days on the average, with a standard deviation of 6 days.

– Stamped self-addressed envelopes are included with the bills for 75 randomly selected customers. The number of days until they return their payment is recorded.

18

• The parameter tested is the population mean payment period (m) for customers who receive self-addressed stamped envelopes with their bill.

• The hypotheses are:H0: m = 24H1: m < 24

• Use = .05; n = 75.

A Left - Tail Test: Hypotheses

19

• The rejection region: • t < t.05,74 = 1.666• Results from the 75 randomly selected

customers:

A Left - Tail Test: Rejection Region

22.95 days, 6 daysx s

20

• The test statistic is:

A Left -Tail Test: Test Statistic

.05 1.666t t t

22.95 241.52

6 75

xts n

We do not reject the null hypothesis. Note that the P-value = P(t74 < -1.52) = .066.

Since our decision is to not reject the null hypothesis,A Type II error is possible.

Since the rejection region is

21

• The CFO thinks that a decrease of one day in the average payment return time will cover the costs of the envelopes since customer checks can be deposited earlier.

• What is (23), the probability of a Type II error when the true mean payment return time is 23 days?

Left-Tail Test: Calculating , the Probability of a Type II Error

22

–Test statistic:H0 : m = 24HA : m < 24

Choose = .05Rejection region in terms of z: z < -z.05 = -1.645

Left-tail test: calculating (cont.)


241.645

6

756

24 1.645 22.86.75

x

xz

x

a = 0.05

22.86 24

23

Express the rejection region directly, not in standardized terms

a=.05


– The rejection region with a = .05.

22.86x

Do not reject H0

22.86 =m 24

H0: m = 24

m= 23

HA: m = 23Specify the alternative value

under HA.

– Let the alternative value be m = 23 (rather than just m < 24)

24

22.86 m= 23


=m 24

HA: m = 23

H0: m = 24

0(23) (fail to reject : 24 when 23)

( 22.86 when 23)

P H

P x

22.86 23.58

6 75P z

Power when =23 is 1-(23)=1-.58=.42

a=.05

25

A Two - Tail Test for • The Federal Communications Commission

(FCC) wants competition between phone companies. The FCC wants to investigate if AT&T rates differ from their competitor’s rates.

• According to data from the (FCC) the mean monthly long-distance bills for all AT&T residential customers is $17.09.

26

A Two - Tail Test (cont.)• A random sample of 100 AT&T customers is

selected and their bills are recalculated using a leading competitor’s rates.

• The mean and standard deviation of the bills using the competitor’s rates are

• Can we infer that there is a difference between AT&T’s bills and the competitor’s bills (on the average)?

$17.55, $3.87x s

27

• Is the mean different from 17.09?

• n = 100; use = .05

H0: m = 17.09

: 17.09AH

A Two - Tail Test (cont.)

28

0

/a 2 = 0.025 /a 2 = 0.025

17.55 17.09

3.87.1

1001 9

xts n

-t /2a = -1.9842 t /2a = 1.9842

Rejection region

Rejection region

A Two – Tail Test (cont.)

.025,99 .025,99

1.9842 1.9842

t t or t t

t or t

t99

29

/a 2 = 0.025 /a 2 = 0.02517.55 17.09

1.193.87 100

xts n

-t /2a = -1.9842 t /2a = 1.9842

There is insufficient evidence to conclude that there is a difference between the bills of AT&T and the competitor.

-1.19

Also, by the P-value approach:The P-value = P(t < -1.19) + P(t > 1.19) = 2(.1184) = .2368 > .05

1.190

A Two – Tail Test: Conclusion

A Type II error is possible

30

• The FCC would like to detect a decrease of $1.50 in the average competitor’s bill. (17.09-1.50=15.59)

• What is (15.59), the probability of a Type II error when the true mean competitor’s bill is $15.59?

Two-Tail Test: Calculating , the Probability of a Type II Error

31

/a 2 = 0.025

17.0916.33 17.85

/a 2 = 0.025

Rejection regionTwo – Tail Test: Calculating (cont.)

.025 .025

1.96 1.96

z z or z z

z or z


17.091.96

3.87100

3.8717.09 1.96

10016.33

17.091.96

3.87100

3.8717.09 1.96

10017.85

x

xz

x

x

xz

x

x

Do not reject H0

Reject H0

3216.63

m= 15.59

Two – Tail Test: Calculating (cont.)

=m 17.09

HA: m = 15.59

H0: m = 17.09

(15.59) (16.33 17.85 given that 15.59)P x 16.33 15.59 15.59 17.85 15.59

3.87 100 3.87 100 3.87 100

(1.912 5.84) .028

xP

P z

Power when =15.59 is 1-(15.59)=.972

17.85

General formula: Type II Error Probability (A) for a Level Test

33

00

00

0 00 /2 /2

:

: 1

:

AA

AA

A AA

H P z zn

H P z zn

H P z z P z zn n

Sample Size n for which a level test also has (A) =

34

2

0

2

/2

0

( )for a 1-tailed (right or left) test

( ) for a 2-tailed test (approx. solution)

A

A

z z

nz z

1 type ii error, power and sample size calculations chapter 23 inference for means: part 2

Documents

false h

error slide

error calculating

possible slide

h0 slide

t test

tail example example

test statistic t