chapter7_printable.pdf

25
Dr. Janet Winter, [email protected] Stat 200 Page 1 Chapter 7: Confidence Interval and Sample Size Learning Objectives Upon successful completion of Chapter 7, you will be able to: β€’ Find the confidence interval for the mean, proportion, and variance. β€’ Determine the minimum sample size when determining a confidence interval for the mean and for a proportion. β€’ Level of confidence, maximum error of Estimate (E) and the sample size are inter-related. I. Inference Includes: 1. Estimation of a population parameter (ΞΌ, ρ, or ) using data from a sample. 2. Hypothesis Testing or using sample data to test a conjecture about the population mean (ΞΌ), population proportion (ρ), or population standard deviation ( ). II. Two Kinds of Estimate for Parameters 1. A point estimate of the population parameter is the sample statistic, i.e., the point estimate for the population mean ΞΌ is the sample mean of , the point estimate for the population proportion is the sample proportion, and the point estimate for the population standard deviation is the sample standard deviation s. 2. An interval estimate of a parameter is a range of values determined from the point estimate.

Upload: qq

Post on 27-Jan-2016

216 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 1

Chapter 7: Confidence Interval and Sample Size

Learning Objectives Upon successful completion of Chapter 7, you will be able to:

β€’ Find the confidence interval for the mean, proportion, and variance. β€’ Determine the minimum sample size when determining a confidence interval for the mean

and for a proportion.

β€’ Level of confidence, maximum error of Estimate (E) and the sample size are inter-related.

I. Inference Includes: 1. Estimation of a population parameter (μ, ρ, or ) using data from a sample.

2. Hypothesis Testing or using sample data to test a conjecture about the population mean (μ), population proportion (ρ), or population standard deviation ( ).

II. Two Kinds of Estimate for Parameters 1. A point estimate of the population parameter is the sample statistic, i.e., the point estimate

for the population mean ΞΌ is the sample mean of , the point estimate for the population proportion is the sample proportion, and the point estimate for the population standard deviation is the sample standard deviation s.

2. An interval estimate of a parameter is a range of values determined from the point estimate.

Page 2: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 2

III. Confidence Interval Estimates for Population Parameters The confidence level is the probability that intervals determined by these methods will contain the parameter. A confidence interval is the range of values determined from a sample statistic and the specified confidence level. The common confidence intervals use 90%, 95%, or 99% confidence levels.

IV.Confidence Interval Estimates for the Population Mean ΞΌ A. When to use the Normal Distribution (z) and when to use the t Distribution

for Confidence Interval Estimates of the Population Mean β€œElementary Statistics: Using the Graphing Calculator for the TI-83/84

”, Triola, Mario F.

Is Οƒ known?

Yes

Yes

Yes

Yes

Yes

No

No

No

No

No

Start

Is n > 30?

t

Use the t distribution

Is the

population normally distributed?

Is n > 30?

Use nonparametric or bootstrapping methods.

z Use the normal distribution

Use nonparametric or bootstrapping methods.

Is the population normally

distributed?

Page 3: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 3

B. Rounding Rules for all Confidence Intervals Estimates of the Mean I. When using actual data:

a) find the mean and standard deviation to 2 extra places than the data. b) round the answer to one more decimal place than the original data.

Note: This is very important! Answers not rounded correctly are marked wrong on Mathzone.

II. When using a mean and standard deviation, work with one more decimal place than the data and round to the same number of decimal places given for the mean.

C. Meaning of ALL Confidence Interval Estimates Be sure to reread P 353 (6th edition) or P 361 (7th edition) in the textbook to better understand the meaning of the confidence interval.

For example: a 90% confidence interval estimate for the population mean is interpreted as 90% of the confidence interval estimates formed with this process include the value of the population mean.

D. z Interval Estimates for the population Mean

I. Requirements a) the population standard deviation ( ) is given b) the sample size n 30; c) But, if the sample size n < 30, the variable must be selected from a normal

distribution

II. Confidence Coefficient

Page 4: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 4

a) Meaning of the Confidence Coefficient

z is called the confidence coefficient, i.e., the number of multiples of the standard error for an interval estimate with a level of confidence. Complete the rest of the table using the confidence level (1-∝). The first 2 have been completed for you (answers at the end). .

.90 .10 .05 .95 .05 .025

b) Method to find the Confidence Coefficient: Find the z value with area to its left, i.e.,

1. Locate inside the Normal Probability Table (Table E)

2. Starting at , move your hand to the left along the row until you reach the Z column. This is the integer and tenths digits. Go back to , next move your hand to the top of its column. This is the hundreds digits.

3. Add the integer and tenths digits to the hundredths digits to find the value for z.

4. Affix a sign in front of the number.

Page 5: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 5

Using the method described, complete the table below. The first 2 have been completed for you (answer at the end).

Confidence Level 1 Ξ±βˆ’

Ξ±

Ξ±/2

(1 Ξ±βˆ’ ) + 2Ξ± Confidence Coefficient

𝑧(π‘Ž/2) .90 .10 .95 .95 1.645 .95 .05 .975 .975 1.96 .99 .80 .98 .96 .93

III. Development of the Confidence Interval Formula

οΏ½Μ…οΏ½ βˆ’ 𝑧 πœŽβˆšπ‘›

< πœ‡ < οΏ½Μ…οΏ½ + 𝑧 πœŽβˆšπ‘›

Whenever the population standard deviation 𝜎 is known and either the population is normally distributed or n β‰₯ 30, the Central Limit Theorem guarantees the sample mean is normally distributed or:

βˆ’π‘§ < οΏ½Μ…οΏ½ βˆ’ πœ‡πœŽοΏ½Μ…οΏ½

< 𝑧

βˆ’ 𝑧 βˆ™ πœŽοΏ½Μ…οΏ½ < οΏ½Μ…οΏ½ βˆ’ πœ‡ < 𝑧 βˆ™ 𝜎 οΏ½

βˆ’οΏ½Μ…οΏ½ βˆ’ 𝑧 βˆ™ πœŽοΏ½Μ…οΏ½ < βˆ’πœ‡ < βˆ’οΏ½Μ…οΏ½ + 𝑧 βˆ™ πœŽοΏ½Μ…οΏ½

(βˆ’οΏ½Μ…οΏ½ βˆ’ 𝑧 βˆ™ πœŽοΏ½Μ…οΏ½) < (βˆ’πœ‡) < (βˆ’οΏ½Μ…οΏ½ + π‘§πœŽοΏ½Μ…οΏ½)

βˆ’(βˆ’οΏ½Μ…οΏ½ βˆ’ 𝑧 βˆ™ πœŽοΏ½Μ…οΏ½) > βˆ’(βˆ’πœ‡) > βˆ’(βˆ’οΏ½Μ…οΏ½ + π‘§πœŽοΏ½Μ…οΏ½)

οΏ½Μ…οΏ½ + 𝑧 βˆ™ πœŽοΏ½Μ…οΏ½ > πœ‡ > οΏ½Μ…οΏ½ βˆ’ π‘§πœŽοΏ½Μ…οΏ½

οΏ½Μ…οΏ½ βˆ’ 𝑧 βˆ™ 𝜎π‘₯ < πœ‡ < οΏ½Μ…οΏ½ + π‘§πœŽοΏ½Μ…οΏ½

οΏ½Μ…οΏ½ βˆ’ π‘§πœŽβˆšπ‘›

< πœ‡ < οΏ½Μ…οΏ½ + π‘§πœŽβˆšπ‘›

Note: If the population standard deviation is not known or stated, use xοΏ½ βˆ’ t s

√n< πœ‡ < xοΏ½ + t s

√n (see section E page 9).

Page 6: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 6

IV. Review of Concepts and Maximum Error of Estimate is the point estimate and the center of the confidence interval

z is the confidence coefficient, the number of multiples of the standard error needed to construct an interval estimate of the correct width to have a level of confidence

is called the maximum error of estimate.

V. Example35 fifth-graders have a mean reading score of 82. The standard deviation of the population is 15.

:

a) Find the 95% confidence interval estimate for the mean reading scores of all fifth-graders. Since we know the population standard deviation and nβ‰₯30, use

. Use Table E backwards with the area to the left of z equal to .025. The value of z or the confidence coefficient is z = 1.9 + .06 = 1.96. This means approximately 95% of the sample means will fall within 1.96 standard errors of the population mean. Use z = 1.96 in the formula.

4.97, rounded to 5, is the maximum error of estimate. Be sure to list it for full credit in your answers.

1βˆ’Ξ±

Page 7: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 7

b) Find the 99% confidence interval estimate of the mean reading scores of all fifth-graders. Since approximately 99% of the sample means will fall within 2.58 standard errors of the population mean, use z = 2.58

𝑋� = 82.1,𝑛 = 35,𝜎 = 15

𝑋� βˆ’ π‘§πœŽβˆšπ‘›

< πœ‡ < 𝑋� + π‘§πœŽβˆšπ‘›

82.1 βˆ’ 2.5815√35

< πœ‡ < 82.1 + 2.5815√35

82.1 βˆ’ 6.54 < πœ‡ < 82.1 + 6.54

82.1 Β± 6.5

75.6 < πœ‡ < 88.5

6.54, rounded to 6.5, is the maximum error of estimate. Be sure to list it in the next to last step.

c) Is the 95% confidence interval or the 99% confidence interval larger? Explain why.

95% confidence level: 77 < ΞΌ < 87

99% confidence level: 75 < ΞΌ < 89

The 99% confidence level is larger because it has a larger z value.

Question 1 A study of 40 English composition professors showed that they spent, on average, 12.6 minutes correcting a student’s term paper.

Find the 90% confidence interval of the mean time for all composition papers when 𝜎 = 2.5 minutes. n = 40 𝑋�= 12.6

Since the population standard deviation is given and n = 40 is greater than 30, use the formula:

𝑿� βˆ’ π’›πˆβˆšπ’

< πœ‡ < 𝑿� + π’›πˆβˆšπ’

If a professor stated that he spent, on average, 30 minutes correcting a term paper, what would be your reaction?

Page 8: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 8

VI. Maximum Error of Estimate for Confidence Interval Estimates of ΞΌ a) Definition The maximum error or estimate is always the largest difference between

the point estimate of a parameter and the actual value of the parameter.

The maximum error of estimate is Β½ the width of the confidence interval.

b) Maximum Error of Estimate for Confidence Interval Estimates of ΞΌ It is the term 𝐸 = 𝑧 𝜎

βˆšπ‘›

VII. Find the Sample Size Using E and the Confidence Level a) Concept: E is like tolerance or allowable error where:

𝐸 = π‘§πœŽβˆšπ‘›

βˆšπ‘›πΈ = π‘§πœŽ

βˆšπ‘› =π‘§πœŽπΈ

𝑛 = οΏ½π‘§πœŽπΈοΏ½2

b) Formula for the Minimum Sample Size for an Interval estimate of the population mean

𝒏 = οΏ½π’›πˆπ‘¬οΏ½πŸ

where E is the maximum error of estimate. If the answer is not a whole number, round up to the next larger whole number to find the sample size, n. If the population standard deviation is not available use the sample standard deviation.

c) Example: An insurance company is trying to estimate the average number of sick days that full-time food service workers use per year. A pilot study found the standard deviation to be 2.5 days. How large a sample must be selected if the company wants to be 95% confident of getting an interval that contains the true mean with a maximum error of 1 day?

s= 2.5 confidence level = 95% maximum error = 1 day

𝒏 = οΏ½π’›πˆπ‘¬οΏ½πŸ

= �𝟏.πŸ—πŸ” βˆ™ 𝟐.πŸ“

𝟏�𝟐

= πŸ’.πŸ—πŸπ¨π« 𝒏 = πŸπŸ’.𝟎𝟏 𝒏 = πŸπŸ“ π’˜π’π’“π’Œπ’†π’“π’”

Page 9: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 9

Question 2 Find the sample size necessary to estimate a population mean to within 0.5 with 95% confidence if the standard deviation is 6.2

𝒏 = �𝒛 βˆ™ πˆπ‘¬

�𝟐

Note: When solving for sample size n, always round up to the next larger integer.

E. t Confidence Interval Estimates for the Population Mean

I. Requirements

a) Οƒ is unknown b) n β‰₯30 c) But, if n < 30, the variable is normally distributed.

II. Characteristics of the t Distribution

a) Bell shaped. Similarities with the normal distribution:

b) Symmetrical about the mean. c) The mean, median, and mode are equal to 0 at the center of the distribution. d) The curve never touches the x axis.

a) The variance is greater than 1. Differences from the standard normal:

b) The t distribution is actually a family of curves based on the degrees of freedom, which is related to sample size.

c) As the sample size increases, the t distribution approaches the standard normal distribution.

Page 10: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 10

Read textbook page 362 (6th Edition) or page 370 (7th Edition) for the comparison between Normal and t distributions.

(Triola & Triola, 2006)

III. Tabled Values for the t Table F: a) Location

β€’ 6th Edition: Table F – located on the inside cover of the text on the opposite side from Table E (standard normal).

β€’ 7th Edition: Table F – located on the last page of the textbook or the pull-out card.

b) Method to find the confidence coefficient for t β€’ Use the column for the appropriate confidence level β€’ Use the row for the appropriate degrees of freedom. β€’ The intersection of the appropriate column and appropriate row is the

confidence coefficient.

Note: If the degrees of freedom needed are not listed in the table, always round down to the nearest table value. For example, if we need degrees of freedom 44, use df=40 since 44 is not listed in the table.

IV. Degrees of Freedom for Estimates of the Population Mean Degrees of freedom are the number of values that are free to vary after a sample statistic has been computed. For the confidence interval for the mean the degrees of freedom are:

sample size minus 1 or d.f. = n – 1

Page 11: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 11

V. Example28 employees of XYZ Company travel an average (mean) of 14.3 miles to work. The standard deviation of their travel time was 2 miles. Find the 95% confidence interval of the true mean or population mean.

:

Since the population standard deviation is not given, use the formula: 𝑋 βˆ’ 𝑑 𝑠

βˆšπ‘›< πœ‡ < 𝑋 + 𝑑 𝑠

βˆšπ‘› 𝑛 = 28 𝑋 = 14.3 𝑠 = 2 𝑑𝑓:27

14.3 βˆ’ 2.0522√28

< πœ‡ < 14.3 + 2.0522√28

14.3 βˆ’ 0.776 < πœ‡ < 14.3 + 0.776

14.3 Β± .8

13.5 < πœ‡ < 15.1

VI. ExampleThe average yearly income for 28 engineering graduates in 2008 is $56,718. The standard deviation was $650.

:

1. Find the 95% confidence interval estimate for the population mean.

Since the population standard deviation is not given, use the formula: 𝑋 βˆ’ 𝑑 𝑠

βˆšπ‘›< πœ‡ < 𝑋 + 𝑑 𝑠

βˆšπ‘› 𝑛 = 28 𝑋 = 56718 𝑠 = 650 𝑑𝑓:27

$πŸ“πŸ”,πŸ•πŸπŸ– βˆ’ 𝟐.πŸŽπŸ“πŸπŸ”πŸ“πŸŽβˆšπŸπŸ–

< πœ‡ < $56,718 + 2.052πŸ”πŸ“πŸŽβˆšπŸπŸ–

πŸ“πŸ”πŸ•πŸπŸ– βˆ’ πŸπŸ“πŸ < πœ‡ < 56718 + 252

$πŸ“πŸ”,πŸ•πŸπŸ– < πœ‡ < $56,970

Note: Now that you are familiar with this problem, it is simpler to record the steps:

πŸ“πŸ”πŸ•πŸπŸ– Β± 𝟐.πŸŽπŸ“πŸπŸ”πŸ“πŸŽβˆšπŸπŸ–

πŸ“πŸ”πŸ•πŸπŸ– Β± πŸπŸ“πŸ.𝟏

56718 Β± 252 (rounded to the same number of places as the mean)

πŸ“πŸ”πŸ’πŸ”πŸ” < πœ‡ < 56970

2. If an individual graduate wishes to see if he or she is being paid below average, what salary value should he or she use?

Use the lower bound of the confidence interval: $56,466.

Page 12: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 12

Question 3 The prices (in dollars) for a particular model of 6.0 megapixels digital camera with 3x optical zoom are listed as: $225, $240, $215, $202, $206, $211, $210, $193, $250, $225. Estimate the true mean using this data with 90% confidence.

Since the population standard deviation is not given, use: 𝑿 Β± 𝒕 π’”βˆšπ’

. Do not use the Οƒ from the calculator. This is a sample, so be sure to use s and work in 2 more places than the data and round the answers to one more place than the data.

𝑿 = πŸπŸπŸ•.πŸ•πŸŽ 𝒔 = πŸπŸ•.πŸ’πŸ— 𝒕 = 𝟏.πŸ–πŸ‘πŸ‘ 𝒅𝒇:πŸ— 𝒏 = 𝟏𝟎

Question 4 John wants to estimate the average value of the homes in his town with a 99% confidence interval. Use his random sample of 36 homes with an average value of $251,131.42 and standard deviation $1321.46 to find the confidence interval.

Since the population standard deviation is not given, use the formula - X ± t s√n

. .

The degrees of freedom equals 35, but df = 35 is not available in the table. Use the next lower df or df = 34.

V. Confidence Interval Estimates for Population Proportions A. Symbols Used to Estimate Proportion

β€’ p = symbol for the population proportion β€’ pοΏ½ = symbol for the sample proportion; read p β€œhat” β€’ π‘žοΏ½ = 1 βˆ’ οΏ½Μ‚οΏ½= symbol for the same proportion of failures.

Where x = number of sample units that possess the characteristics of interest and n = sample size. β€’ οΏ½Μ‚οΏ½ + π‘₯

𝑛

B. Development of the Formula For a Binomial Probability Distribution with x = number of successes For x: with n p β‰₯ 5 and n (1 – p) β‰₯ 5

X = number of successes is approximately normally distributed with: πœ‡ = 𝑛𝑝 𝜎 = �𝑛𝑝(1 βˆ’ 𝑝)

Thus for proportions 𝑝 = π‘₯𝑛

πœ‡πœŒ =πœ‡π‘›

=𝑛𝑝𝑛

= 𝑝

Page 13: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 13

1. The mean of οΏ½Μ‚οΏ½ is 𝑝 2. The standard deviation for οΏ½Μ‚οΏ½ becomes:

𝜎𝜌 =πœŽπ‘›

=�𝑛𝑝(1 βˆ’ 𝑝)

𝑛

.

𝜎𝜌 = �𝑛𝑝(1 βˆ’ 𝑝)𝑛2

𝜎𝜌 = �𝑝(1 βˆ’ 𝑝)𝑛

Next we use the pattern for the confidence interval estimate of the population mean, point estimate Β± 𝒛 βˆ™ 𝒔𝒕𝒂𝒏𝒅𝒂𝒓𝒅 π’…π’†π’—π’Šπ’‚π’•π’Šπ’π’

It becomes 𝒑� Β± 𝒛�𝒑�(πŸβˆ’π’‘οΏ½)𝒏

C. Formula for the confidence interval estimate of the population proportion

Note: a shorter version of the formula to estimate the population proportion is:

οΏ½Μ‚οΏ½ Β± π‘§οΏ½π‘οΏ½βˆ™π‘žοΏ½π‘›

when np and nq are each greater than or equal to 5.

D. Rounding Rules for proportions Always use 4 decimal places for the computation and round the answers to 3 decimal places.

E. 1. 55 students in a random sample of 450 enrolled in summer classes. Estimate the

population proportion of students taking classes this summer.

Example:

𝒑� =𝑿𝒏

=πŸ“πŸ“πŸ’πŸ“πŸŽ

= 𝟎.𝟏𝟐𝟐𝟐 𝒒� = πŸβˆ’.𝟏𝟐𝟐𝟐 =.πŸ–πŸ•πŸ•πŸ–

𝒑� βˆ’ 𝒛�𝒑�𝒒�𝒏

< 𝑝 < 𝒑� + 𝒛�𝒑�𝒒�𝒏

𝟎.𝟏𝟐𝟐𝟐 Β± 𝟏.πŸ—πŸ”οΏ½πŸŽ.𝟏𝟐𝟐𝟐 βˆ™ 𝟎.πŸ–πŸ•πŸ•πŸ–

πŸ’πŸ“πŸŽ

𝟎.𝟏𝟐𝟐±.πŸŽπŸ‘πŸŽ 𝟎.πŸŽπŸ—πŸ < 𝑝 < .152 πŸ—.𝟐% < 𝑝 < 15.2%

Page 14: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 14

2. Is an estimate of 11% about right?

Yes, 11% is about right since it is contained within the confidence interval estimate.

Question 5 A survey found that out of 200 students, 168 said they needed loans or scholarships to pay their tuition and expenses. Find the 90% confidence interval for the population proportion of students needing loans or scholarships. 𝒑� = 𝟎.πŸ–πŸ’ 𝒒� = 𝟎.πŸπŸ”

οΏ½Μ‚οΏ½ βˆ’ 𝑧 οΏ½οΏ½Μ‚οΏ½ π‘žοΏ½π‘›

< 𝑝 < οΏ½Μ‚οΏ½ + 𝑧 οΏ½οΏ½Μ‚οΏ½ π‘žοΏ½π‘›

Question 6 A study by the University of Michigan found that one in five 13 and 14 year olds is a sometime smoker. To see how the smoking rate of the students at a large school district compared to the national rate, the superintendent surveyed two hundred 13 and 14 year old students and found that 23% said they were sometime smokers. Find the 99% confidence interval of the true proportion and compare this with the University of Michigan’s study.

𝑛 = 200 𝒑� = 0.23 𝒒� = 1 βˆ’ 0.23 = 0.77 οΏ½Μ‚οΏ½ βˆ’ 𝑧 �𝑝� π‘žοΏ½π‘›

< 𝑝 < οΏ½Μ‚οΏ½ + 𝑧 �𝑝� π‘žοΏ½π‘›

F. Formula for the Minimum Sample Size to Estimate a Population Proportion

𝑬 = 𝒛�𝒑�𝒒�𝒏

𝑬𝒛

= �𝒑�𝒒�𝒏

οΏ½π‘¬π’›οΏ½πŸ

=𝒑�𝒒�𝒏

οΏ½π’›π‘¬οΏ½πŸ

=𝒏𝒑�𝒒�

π’›πŸ

π‘¬πŸ=

𝒏𝒑�𝒒�

𝒑�𝒒� βˆ™π’›πŸ

π‘¬πŸ= 𝒏

𝑛 = οΏ½Μ‚οΏ½π‘žοΏ½ �𝑧𝐸�2

Use οΏ½Μ‚οΏ½ from a pilot study or previous estimate if it is available. Otherwise, use οΏ½Μ‚οΏ½ = .5. 𝑛 must be a whole number. If it’s not a whole number, round up to the next larger whole number.

Page 15: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 15

G. A medical researcher wishes to determine the percentage of drivers using GPS systems in their car. He wishes to be 99% confident that the estimate is within 2 percentage points of the true proportion. A recent study of 180 drivers showed that 25% used GPS systems.

Example:

a) How large should the sample size be? Since a recent study showed 25% used GPS Systems, οΏ½Μ‚οΏ½ = 0.25 π‘Žπ‘›π‘‘ π‘žοΏ½ = 0.75. .

𝑛 = οΏ½Μ‚οΏ½π‘žοΏ½ �𝑧𝐸�2

𝑛 = 0.25 βˆ™ 0.75 οΏ½2.58.02

οΏ½2

𝑛 = 3120.187

Since the computed n is not a whole number, round up and use n = 3121.

b) If no estimate of the sample proportion is available, how large should the sample be? Since there is no prior estimate of p, use p = 0.5 and q = 0.5

𝒏 = 𝒑�𝒒� οΏ½π’›π‘¬οΏ½πŸ

𝒏 = 𝟎.πŸ“ βˆ™ 𝟎.πŸ“ �𝟐.πŸ“πŸ–πŸŽ.𝟎𝟐

�𝟐

𝒏 = πŸ’πŸπŸ”πŸŽ.πŸπŸ“

Since the computed n is a not a whole number, round up and use n = 4161.

Note: the sample size needs to be larger when there is no prior estimate for p.

VI.Confidence Interval Estimate for the Population Variance and Population Standard Deviation A. General Comments

To find confidence intervals for variances and standard deviations, β€’ Use the chi-square distribution β€’ Samples must be selected from normally distributed populations. β€’ Assume the population variance is π›”πŸ.

β€’ The chi-square distribution is obtained from the values of (π§βˆ’πŸ)𝐬𝟐

π›”πŸ or 𝐱𝟐 = (π§βˆ’πŸ)𝐬𝟐

π›”πŸ

Page 16: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 16

B. Chi-Square Distribution Reference textbook page 378 (6th Edition) or page 386 (7th Edition).

I. Characteristics

β€’ Chi-Square is always positive. β€’ It is a family of distributions dependent on degrees of freedom (n – 1). β€’ The mode is always slightly to the left of degrees of freedom. β€’ As n increases, Chi-Square walks off to the right. β€’ Chi-Square distribution is skewed to the right.

II. Finding Chi-Square values on the Chi-Square table Since is not symmetrical, two different values are used in the confidence interval formula for the population variance.

For right, use the column . For left, use the column in the table for .

Process: 1. Use the confidence level to find . 2. Use the column with the appropriate degrees of freedom to find right. 3. Find . 4. Use the column with the appropriate degrees of freedom to find left. Chi-Square Values of df: 18

Chi-Square left

Left

Confidence Level

Right

Chi-Square right

9.390 .95 .90 .05 28.869 8.231 .975 .95 .25 31.526 10.865 .90 .80 .10 25.989 7.015 .99 .98 .01 34.805 6.265 .995 .99 .005 37.156

Page 17: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 17

C. Formulas 1. Confidence Interval Estimate for the Population Variance:

df: n - l

Note: right is on the left side of the equation but the right side of the graph and left is on the right side of the equation but the left side of the graph.

2. Confidence Interval Estimate for the Population Standard Deviation Since the population standard deviation is the square root of the population variance, the confidence interval estimate of the population standard deviation is:

D. Rounding Rules for Standard Deviation or Variance I. When using actual data:

a) find the standard deviation to 2 extra places than the data. b) round the answer to one more decimal place than the original data.

II. When using sample standard deviation or variance, work with one more decimal place than the statistic and round to the same number of places as the standard deviation or variance given.

E. Find the confidence interval for the standard deviation in the time it takes to fill a car with gas. In a sample of 23 fill-ups, the standard deviation of the time it takes to fill the car is 3.8 minutes. Assume the variable is normally distributed.

Example:

Page 18: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 18

Note: the answer has the same number of decimal places as the given sample standard deviation since the work is done with statistics instead of data.

Question 7 Find the 99% confidence interval for the variance and standard deviation of the weights of one-gallon containers of motor oil when the sample of 14 containers has a variance of 3.2. Assume the variable is normally distributed.

F. The number of calories in a 1-ounce serving of various regular cheeses is shown. Estimate the population variance with 90% confidence.

Example:

110 45 100 95 110 110 100 110 95 120 130 100 80 105 105 90 110 70 125 108

Is the 90% confidence interval estimate for the population variance.

Note: β€’ Use the tabled values for and use s rounded to 2 more places than the data in the

computation. β€’ Since data is used to compute the standard deviation, the answer has one more place

than the original data.

Page 19: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 19

Question 8 A service station advertises a wait of no more than 30 minutes for an oil change. A sample of 28 oil changes has a standard deviation of 5.2 minutes. Find the 95% confidence interval of the population standard deviation of the times spent waiting for an oil change.

(𝒏 βˆ’ 𝟏)π’”πŸ

π’™πŸπ’“π’Šπ’ˆπ’‰π’•< 𝝈𝟐 <

(𝒏 βˆ’ 𝟏)π’”πŸ

π’™πŸπ’π’†π’‡π’•

VII. Summaries A. Estimates for Population Parameters

β€’ Estimation is an important aspect of inferential statistics. β€’ A point estimate is a single value with no accuracy specified. β€’ An interval estimate is a range of values with its accuracy specified by the confidence level. β€’ Every question about a confidence interval will have the words β€œFind a confidence interval

estimate for…”. β€’ Pay particular attention to the words determine whether the confidence interval is for the

mean, proportion, or variance (or its square root, the standard deviation).

B. Minimum Sample Sizes to Estimate Population Parameters β€’ You always need to know both the confidence level and the maximum error of estimate.

In addition, for the: 1. Mean – the population standard deviation (given or estimate) is also required.

𝑛 = �𝑧 βˆ™ 𝜎𝐸�2

2. Proportion – an estimate of the proportion from a pilot study is preferred (or use p = .5).

𝑛 = οΏ½Μ‚οΏ½π‘žοΏ½ �𝑧𝐸�2

Page 20: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 20

C. Rounding Rules I. For Estimates of the Mean

a) When using actual data: (a) find the mean and standard deviation to 2 extra places than the date. (b) round the answer to one more decimal place than the original data.

Note: This is very important! Answers not rounded correctly are marked wrong on Mathzone.

b) When using a mean and standard deviation, work with one more decimal place than the data and round to the same number of decimal places given for the mean.

II. For Estimates of the Standard Deviation or Variance a) When using actual data:

(a) find the standard deviation to 2 extra places than the data. (b) round the answer to one more decimal place than the original data.

b) When using sample standard deviation or variance, work with one more decimal place than the statistic and round to the same number of places as the standard deviation or variance given.

III. For Estimates of the Proportions a) Always use 4 decimal places for the computation and round the answers to 3

decimal places.

Page 21: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 21

Answer: Confidence Coefficient z is called the confidence coefficient, i.e., the numbers of multiples of the standard error for an interval estimate with a 1βˆ’ ∝ level of confidence.

1βˆ’ ∝ ∝ ∝2

.90 .10 .05

.95 .05 .025

.99 .01 .005

.80 .20 .10

.85 .15 .075

Answer: Method to find the Confidence Coefficient

1. Use the confidence level (1 βˆ’ 𝛼) to find 𝛼/2. 2. Use the 𝛼/2 column with the appropriate degrees of freedom to find x2 right. 3. Find 1 βˆ’ 𝛼/2. 4. Use the 1 βˆ’ 𝛼/2 column with the appropriate degrees of freedom to find x2 left.

Confidence Level 1 Ξ±βˆ’

Ξ±

Ξ±/2

(1 Ξ±βˆ’ ) + 2Ξ± Confidence Coefficient

𝑧(π‘Ž/2)

.90 .10 .95 .95 1.645

.95 .05 .975 .975 1.96

.99 .01 .995 .995 2.58

.80 .20 .90 .90 1.28

.98 .02 .99 .99 2.33

.96 .04 .98 .98 2.05

.93 .07 .965 .965 1.81

Page 22: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 22

Answer: Question 1 A study of 40 English composition professors showed that they spent, on average, 12.6 minutes correcting a student’s term paper.

a) Find the 90% confidence interval of the mean time for all composition papers when Οƒ= 2.5 minutes. n = 40 XοΏ½= 12.6

Since the population standard deviation is given and n=40 is greater than 30, use the formula:

𝑋� βˆ’ π‘§πœŽβˆšπ‘›

< πœ‡ < 𝑋� + π‘§πœŽβˆšπ‘›

12.6 βˆ’ 1.6452.5√40

< πœ‡ < 12.6 + 1.6452.5√40

12.6 βˆ’ 0.6502 < πœ‡ < 12.6 + 0.6502 12.6 βˆ’ 0.7 < πœ‡ < 12.6 + 0.7 𝟏𝟏.πŸ— < πœ‡ < 13.3

b) If a professor stated that he spent, on average, 30 minutes correcting a term paper, what would be your reaction?

11.9 < πœ‡ < 13.3

It would be highly unlikely since 30 minutes is far longer than the upper bound of 13.3 minutes.

Answer: Question 2 Find the sample size necessary to estimate a population mean to within 0.5 with 95% confidence if the standard deviation is 6.2.

𝑛 = �𝑧 βˆ™ 𝜎𝐸�2

.

𝑛 = οΏ½(1.96)(6.2)

0.5οΏ½2

= [24.304]2 = 590.684

𝑛 = 591

Note: When solving for sample size n, always round up to the next larger integer (Why?)

Page 23: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 23

Answer: Question 3 The prices (in dollars) for a particular model of 6.0 megapixels digital camera with 3x optical zoom are listed as: $225, $240, $215, $202, $206, $211, $210, $193, $250, $225. Estimate the true mean using this data with 90% confidence.

Since the population standard deviation is not given, use: X ± t s√n

. Do not use the Οƒ from the

calculator. This is a sample, so be sure to use s and work in 2 more places than the data and round the answers to one more place than the data. 𝑋 = 217.70 𝑠 = 17.49 𝑑 = 1.833 𝑑𝑓: 9 𝑛 = 10

217.70 ± 1.83317.49√10

217.7 Β± 10.1 πŸπŸŽπŸ•.πŸ” < πœ‡ < 227.8

Note: 𝑋 and 𝑠 are found to two decimal places more than the data, but the answer is rounded back to one more place than the data.

Answer: Question 4 John wants to estimate the average value of the homes in his town with a 99% confidence interval. Use his random sample of 36 homes with an average value of $251,131.42 and standard deviation $1321.46 to find the confidence interval.

Since the population standard deviation is not given, use the formula X ± t s√n

.

The degrees of freedom equals 35, but df = 35 is not available in the table. Use the next lower df or df = 34. .

251131.42 ± 2.7281321.46√36

251131.42 Β± 600.82 πŸπŸ“πŸŽπŸ“πŸ‘πŸŽ.πŸ”πŸŽ < πœ‡ < 251732.24

Note: Since statistics are given, work one more place than the statistic but round the answer back to the same number of places as 𝑋.

Page 24: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 24

Answer: Question 5 A survey found that out of 200 students, 168 said they needed loans or scholarships to pay their tuition and expenses. Find the 90% confidence interval for the population proportion of students needing loans or scholarships.

οΏ½Μ‚οΏ½ = 0.84 π‘žοΏ½ = 0.16

οΏ½Μ‚οΏ½ βˆ’ π‘§οΏ½οΏ½Μ‚οΏ½π‘žοΏ½π‘›

< 𝑝 < οΏ½Μ‚οΏ½ + π‘§οΏ½οΏ½Μ‚οΏ½π‘žοΏ½π‘›

0.84 βˆ’ 1.645οΏ½0.84 βˆ™ 0.16

200< 𝑝 < 0.84 + 1.645οΏ½

0.84 βˆ™ 0.16200

0.84 Β± 1.645οΏ½(0.84 βˆ™ 0.16 Γ· 200) 0.84 βˆ’ 0.043 < 𝑝 < 0.84 + 0.043 0.797 < 𝑝 < 0.883 πŸ•πŸ—.πŸ•% < 𝑝 < 88.3%

Answer: Question 6 A study by the University of Michigan found that one in five 13 and 14 year olds is a sometime smoker. To see how the smoking rate of the students at a large school district compared to the national rate, the superintendent surveyed two hundred 13 and 14 year old students and found that 23% said they were sometime smokers. Find the 99% confidence interval of the true proportion and compare this with the University of Michigan’s study. . 𝑛 = 200 οΏ½Μ‚οΏ½ = 0.23 π‘žοΏ½ = 1 βˆ’ 0.23 = 0.77

οΏ½Μ‚οΏ½ βˆ’ π‘§οΏ½οΏ½Μ‚οΏ½π‘žοΏ½π‘›

< 𝑝 < οΏ½Μ‚οΏ½ + π‘§οΏ½οΏ½Μ‚οΏ½π‘žοΏ½π‘›

0.23 Β± 2.58οΏ½0.23 βˆ™ 0.77

200

0.23 Β± 2.58 βˆ™ οΏ½(0.23 βˆ™ 0.77 Γ· 200) 0.23 Β± 0.077 0.153 < 𝑝 < 0.307 πŸπŸ“.πŸ‘% < 𝑝 < 30.7%

Since 1/5 = 0.20, the University of Michigan study falls within the confidence interval and it is OK.

Page 25: Chapter7_Printable.pdf

Dr. Janet Winter, [email protected] Stat 200 Page 25

Answer: Question 7 Find the 99% confidence interval for the variance and standard deviation of the weights of one-gallon containers of motor oil when the sample of 14 containers has a variance of 3.2. Assume the variable is normally distributed.

𝑛 = 14 𝑠2 = 3.2 (𝑛 βˆ’ 1)𝑠2

π‘₯2π‘Ÿπ‘–π‘”β„Žπ‘‘ < 𝜎2 <(𝑛 βˆ’ 1)𝑠2

π‘₯2𝑙𝑒𝑓𝑑

13 βˆ™ 3.229.819

< 𝜎2 <13 βˆ™ 3.23.565

. 𝟏.πŸ’ < π›”πŸ < 11.7 variance . 𝟏.𝟐 < 𝜎 < 3.4 standard deviation

Note: The answer has the same number of decimal places as the given sample standard deviation.

Answer: Question 8 A service station advertises a wait of no more than 30 minutes for an oil change. A sample of 28 oil changes has a standard deviation of 5.2 minutes. Find the 95% confidence interval of the population standard deviation of the times spent waiting for an oil change.

(𝑛 βˆ’ 1)𝑠2

π‘₯2π‘Ÿπ‘–π‘”β„Žπ‘‘ < 𝜎2 <(𝑛 βˆ’ 1)𝑠2

π‘₯2𝑙𝑒𝑓𝑑

27 βˆ™ 5.22

43.194 < 𝜎2 <27 βˆ™ 5.22

14.573

πŸπŸ”.πŸ— < π›”πŸ < 50.1 variance in waiting time. πŸ’.𝟏 < 𝜎 < 7.1 standard deviation in waiting time.

Works Cited Triola, M.D., Marc M. and Mario F. Triola. Biostatistics for the Biologoical and Health Sciences. New York: Pearson Education, Inc., 2006.