elementary statistics

27
Definition: The probability distribution of SQQS1013 Elementary Statistics SAMPLING SAMPLING DISTRIBUTIONS DISTRIBUTIONS 6.1 POPULATION AND SAMPLING DISTRIBUTION 6.1.1 Population Distribution Suppose there are only five students in an advanced statistics class and the midterm scores of these five students are: 70 78 80 80 95 Let x denote the score of a student. Chapter 6: Sampling Distributions 1 Example 1

Upload: ripkah-vyanee

Post on 30-Oct-2014

117 views

Category:

Documents


2 download

DESCRIPTION

Chapter 6

TRANSCRIPT

Page 1: Elementary Statistics

Definition:

The probability distribution of the population data.

SQQS1013 Elementary Statistics

SAMPLING SAMPLING DISTRIBUTIONSDISTRIBUTIONS

6.1 POPULATION AND SAMPLING DISTRIBUTION

6.1.1 Population Distribution

Suppose there are only five students in an advanced statistics class and the midterm scores

of these five students are:

70 78 80 80 95

Let x denote the score of a student.

Chapter 6: Sampling Distributions 1

Example 1

Page 2: Elementary Statistics

Definition:

The probability distribution of a sample statistic.

Definition:

The sampling distribution of is a distribution obtained by

using the means computed from random samples of a

specific size taken from a population.

SQQS1013 Elementary Statistics

Mean for Population

Based on Example 1, to calculate mean for population:

Standard Deviation for Population

Based on example 1, to calculate standard deviation for population:

6.1.2 Sampling Distribution

Sample statistic such as median, mode, mean and standard deviation

6.1.2.1 The Sampling Distribution of the Sample Mean

Chapter 6: Sampling Distributions 2Example 2

Page 3: Elementary Statistics

SQQS1013 Elementary Statistics

Reconsider the population of midterm scores of five students given in example 1. Let say we draw all possible samples of three numbers each and compute the mean.

Total number of samples = 5C3 =

Suppose we assign the letters A, B, C, D and E to scores of the five students, so that

A = 70, B = 78, C=80, D = 80, E = 95

Then the 10 possible samples of three scores each are

ABC, ABD, ABE, ACD, ACE, ADE, BCD, BCE, BDE, CDE

Chapter 6: Sampling Distributions 3

for sample ABC,

Use all the possible means samples to form the frequency distribution table

Page 4: Elementary Statistics

SQQS1013 Elementary Statistics

Sampling Error

Sampling error is the difference between the value of a sample statistic and

the value of the corresponding population parameter. In the case of mean,

Sampling error =

Mean ( ) for the Sampling Distribution of

Based on example 2,

Chapter 6: Sampling Distributions 4

Sampling distribution offor samples of three scores

“The mean of the sampling distribution of is always equal to the mean of the population”

x

FORMULA

Page 5: Elementary Statistics

SQQS1013 Elementary Statistics

Standard Deviation ( ) for the Sampling Distribution of

Where: is the standard deviation of the population

n is the sample size

This formula is used when

When N is the population size

Where is the finite population correction factor

This formula is used when

The spread of the sampling distribution of is smaller than the spread of the corresponding population distribution, .

The standard deviation of the sampling distribution of decreases as the

sample size increase.

The standard deviation of the sample means is called the standard error of the mean.

Chapter 6: Sampling Distributions 5

FORMULA

FORMULA

Page 6: Elementary Statistics

SQQS1013 Elementary Statistics

6.1.3 Sampling From a Normally Distributed Population

The criteria for sampling from a normally distributed population are:

The shape of the sampling distribution of is normal, whatever the value of n.

o Shape of the sampling distribution

Chapter 6: Sampling Distributions 6

Page 7: Elementary Statistics

SQQS1013 Elementary Statistics

6.1.4 Sampling From a Not Normally Distributed Population

Most of the time the population from which the samples are selected is not

normally distributed. In such cases, the shape of the sampling distribution of

is inferred from central limit theorem.

Central limit theorem

For a large sample size, the sampling distribution of is approximately normal, irrespective of the population distribution.

The sample size is usually considered to be large if .

Shape of the sampling distribution

6.1.5 Application of the Sampling Distribution of

Chapter 6: Sampling Distributions 7

Page 8: Elementary Statistics

SQQS1013 Elementary Statistics

The z value for a value of is calculated as:

In a study of the life expectancy of 500 people in a certain geographic region, the mean

age at death was 72 years and the standard deviation was 5.3 years. If a sample of 50

people from this region is selected, find the probability that the mean life expectancy will

be less than 70 years.

Solution:

Chapter 6: Sampling Distributions 8

FORMULA

Example 3

Example 4

Page 9: Elementary Statistics

SQQS1013 Elementary Statistics

Assume that the weights of all packages of a certain brand of cookies are normally

distributed with a mean of 32 ounces and a standard deviation of 0.3 ounce. Find the

probability that the mean weight, of a random sample of 20 packages of this brand of

cookies will be between 31.8 and 31.9 ounces.

Solution:

Although the sample size is small ( ), the shape of the sampling distribution of is

normal because the population is normally distributed.

Chapter 6: Sampling Distributions 9

Page 10: Elementary Statistics

SQQS1013 Elementary Statistics

A Bulletin reported that children between the ages of 2 and 5 watch an average of 25 hours

of television per week. Assume the variable is normally distributed and the standard

deviation is 3 hours. If 20 children between the ages of 2 and 5 are randomly selected, find

the probability that the mean of the number of hours they watch television will be:

a) greater than 26.3 hours.

b) less than 24 hours

c) between 24 and 26.3 hours.

Solution:

Chapter 6: Sampling Distributions 10

Example 5

Page 11: Elementary Statistics

SQQS1013 Elementary Statistics

The average number of pounds of meat that a person consumes a year is 218.4 pounds.

Assume that the standard deviation is 25 pounds and the distribution is approximately

normal.

a) Find the probability that a person selected at random consumes less than 224

pounds per year.

b) If a sample of 40 individuals selected, find the probability that the mean of the

sample will be less than 224 pounds per year.

Chapter 6: Sampling Distributions 11

Remember!Sometimes you have difficulty deciding whether to use:

; should be used to gain information about a

sample mean.

OR

; used to gain information about an

individual data value obtained from the population.

Example 6

Page 12: Elementary Statistics

SQQS1013 Elementary Statistics

Solution:

6.2 POPULATION AND SAMPLE PROPORTION

Population proportion Sample proportion

Where; N = total number of element in the population

n = total number of element in the sample

X = number of element in the population that possess a specific characteristic

x = number of element in the sample that possess a specific characteristic

Chapter 6: Sampling Distributions 12

FORMULA

Example

7

Page 13: Elementary Statistics

SQQS1013 Elementary Statistics

Suppose a total of 789 654 families live in a city and 563 282 of them own homes. A sample

of 240 families is selected from the city and 158 of them own homes. Find the proportion of

families who own homes in the population and in the sample.

Solution:

The proportion of all families in this city who own homes is

The sample proportion is

6.2.1 Sampling Distribution of

Boe Consultant Associates has five employees. Table below gives the names of these five employees and information concerning their knowledge of statistics.

Where: p = population proportion of employees who know statistics

Let say we draw all possible samples of three employees each and compute the proportion.Total number of samples

= 5C3 =

Chapter 6: Sampling Distributions 13

Example 8

Page 14: Elementary Statistics

SQQS1013 Elementary Statistics

Mean of the sample proportion:

Standard deviation of the sample proportion:

; if

; if

where p = population proportion q = 1 – p n = sample size

Chapter 6: Sampling Distributions 14

FORMULA

FORMULA

Example 9

Page 15: Elementary Statistics

SQQS1013 Elementary Statistics

Based on Boe Consultant Associates in Example 8,

Shape of the sampling distribution of____ According to the central limit theorem, the sampling distribution of is

approximately normal for a sufficiently large sample size.

In the case of proportion, the sample size is considered to be sufficiently large if np

> 5 and nq > 5

A binomial distribution has p = 0.3. How large must sample size be such that a normal distribution can be used to approximate sampling distribution of .

Solution:

Chapter 6: Sampling Distributions 15

Example 10

Page 16: Elementary Statistics

SQQS1013 Elementary Statistics

6.2.2 Application of the Sampling Distribution of

z value for a value of

The Dartmouth Distribution Warehouse makes deliveries of a large number of products to its customers. It is known that 85% of all the orders it receives from its customers are delivered on time.

a) Find the probability that the proportion of orders in a random sample of 100 are delivered on time:i. less than 0.87ii. between 0.81 and 0.88

b) Find the probability that the proportion of orders in a random sample of 100 are not delivered on time greater than 0.1.

Solution:

Chapter 6: Sampling Distributions 16

FORMULA

Example 11

Page 17: Elementary Statistics

SQQS1013 Elementary Statistics

The machine that is used to make these CDs is known to produce 6% defective CDs. The quality control inspector selects a sample of 100 CDs every week and inspects them for being good or defective. If 8% or more of the CDs in the sample are defective, the process is stopped and the machine is readjusted. What is the probability that based on a sample of 100 CDs the process will be stopped to readjust the mashine?

Solution:

Chapter 6: Sampling Distributions 17

Example 12

Page 18: Elementary Statistics

SQQS1013 Elementary Statistics

6.3 OTHER TYPES OF SAMPLING DISTRIBUTIONS

6.3.1 Sampling Distribution for the Difference Between Means With Two Independent Population

When independent random sample of n1 and n2 observations have been

selected from populations with means and and variance and

respectively, for the sampling distribution of the difference has the

following properties:

the mean of is

the standard error of is

If the sample populations are normally distributed then the sampling

distribution of is exactly normally distributed, regardless of the

sample size.

Chapter 6: Sampling Distributions 18

Page 19: Elementary Statistics

SQQS1013 Elementary Statistics

If the sample populations are not normally distributed then the sampling

distribution of is approximately normally distributed when n1 and

n2 are both 30 or more, due to the Central Limit Theorem

6.3.2 Sampling Distribution for the Difference Between Proportions With Two Independent Population

Assume that independence random samples of n1 and n2 observations have been

selected from binomial populations with parameters p1 and p2 respectively. The

sampling distribution of the difference between sample proportions

has these properties;

The mean of is p1 - p2

The standard error is

EXERCISE

1. Given a population with mean, µ = 400 and standard deviation, σ = 60.

a) If the population is normally distributed, what is the shape for the sampling distribution of sample mean with random sample size of 16

b) If we do not know the shape of the population in 1(a), Can we answer 1(a)? Explain.

c) Can we answer 1(a) if we do not know the population distribution but we have been given random sample with size 36? Explain.

Chapter 6: Sampling Distributions 19

Page 20: Elementary Statistics

SQQS1013 Elementary Statistics

2. A random sample with size, n = 30, is obtained from a normal distribution population with µ = 13 and σ = 7.

a) What are the mean and the standard deviation for the sampling distribution of sample mean.

b) What is the shape of the sampling distribution? Explain.c) Calculate

i) P ( < 10)ii) P ( < 16)

3. Given a population size of 5000 with standard deviation 25, Calculate the standard error of mean sample for:

a) n = 300b) n = 100

4. Given X ~ N (5.55, 1.32). If a sample size of 50 is randomly selected, find the sampling distribution for . (Hint: Give the name of distribution, mean and variance).Then, Calculate:

a) P ( 5.25 ≤ ≤ 5.90)b) P (5.45 ≤ ≤ 5.75)

5. 64 units from a population size of 125 is randomly selected with mean 105 and variance 289, Find:

a) the standard error of the sampling distribution aboveb) P( 107.5 < < 109.0)

6. The serving time for clerk at the bank counter is normally distributed with mean 8 minutes and standard deviation 2 minutes. If 36 customers is randomly selected:

a) Calculate

b) The probability that the mean of serving time of a clerk at the bank counter is between 7.7 minutes and 8.3 minutes

7. The workers at the walkie-talkie factory received salary at an average of RM3.70 per hour and the standard deviation is RM0.80. If a sample of 100 is randomly selected, find the probability the mean of sample is:

a) at least RM 3.50 per hourb) between RM 3.20 and R3.60 per hour

8. 1,000 packs of pistachio nut have been sent to one of hyper supermarket in Puchong. The weight of pistachio nut packs is normally distributed with mean 99.3g and standard deviation is 1.8g.

a) If a random sample with 300 packs of pistachio nut is selected, find the probability that the mean of the sample will be between 99.2g and 99.5g.

Chapter 6: Sampling Distributions 20

Page 21: Elementary Statistics

SQQS1013 Elementary Statistics

b) Find the probability that mean of sample 300 packs of pistachio nut is between 99.2g and 99.5g with delivery of

i) 2,000 packsii) 5,000 packs

c) What is the consequence of the incremental in population size toward the probability value on the 9(b)?

9. An average age of 1500 staffs Tebrau Co. Limited is 38 years old with standard deviation 6.2 years old. If the company selects 50 staffs at random,

a) Do we need correction factor in this situation? Justify your answer.b) Find the probability of average age for the group of this staff is between 35

and 40 years old.

10. A research has been conducted by an independent research committee about the efficiency of wire harness, A12-3 production at the P.Tex Industries Sdn. Bhd. An average number of wire harness that has been produced a day is 60 pieces with standard deviation 10. A random sample of 90 pieces of wire harness is selected.

a) Find mean and standard error for the wire harness that has been produced a day.

b) Find the probability of wire harness that can be produced in a day is between 58 pieces and 62 pieces.

11. A test of string breaking strength that has been produced by Z factory shows that the

strength of string is only 60%. A random sample of 200 pieces of string is selected for the test.

a) State the shape of sampling distributionb) Calculate the probability of string strength is at lest 42%

12. Mr. Jay is a teacher at the Henry Garden School. He has conducted a research about bully case at his school. 61.6% students said that they are ever being a bully victim. A random sample of 200 students is selected at random. Find the proportion of bully victim is

a) between 60% and 66%b) more than 64%

13. The information given below shows the response of 40 college students for the question, “Do you work during semester break time?” (The answer is Y=Yes or N=No).

N N Y N N Y N Y N Y N N Y N Y Y N N N Y N Y N N N N Y N N Y Y N N N Y N N Y N N

If the proportion fo population is 0.30,

a) Find the proportion of sampling for the college student who works during semester break.

b) Calculate the standard error for the proportion in (a).

Chapter 6: Sampling Distributions 21

Page 22: Elementary Statistics

SQQS1013 Elementary Statistics

14. A credit officer at the Tiger Bank believes that 25% from the total credit card users will not pay their minimum charge of credit card debt at the end of every month. If a sample 100 credit card user is randomly selected:

a) What is the standard error for the proportion of the customer who does not pay their minimum charge of credit card debt at the end of every month?

b) Find the probability that the proportion of customer in a random sample of 100 do not pay their minimum charge of credit card debt:

i) less than 0.20ii) more than 0.30

Chapter 6: Sampling Distributions 22