0.3 if 5 10 d x f ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -standard deviation = original...

21
1 Worksheet 1 Prep-Work (Distributions) 1)Let X be the random variable whose c.d.f. is given below. Compute the mean, X . (Hint: First identify all possible values of X, then compute values for the p.m.f., ) ( x X f ). 12 2)Let X be binomial random variable with 40 n and 15 . 0 p . Use Excel to compute (i) ) 8 ( X f =0.1086 and ) 8 ( X F =0.8645. 3)Let X be a continuous random variable that is uniform on the interval ] 10 , 0 [ . (i) What is the probability that X is at most 8.75? =.875 (ii) What is the probability that X is no less than 4.25? = .575 4)Let W be the working lifetime, measured in years, of the microchip in your new digital watch. Suppose that W has an exponential distribution with mean 4 years. Use Integrating.xls and the probability density function W f to compute the probabilities that the chip lasts for (i) at least 8 years=.1348 and (ii) at most 2 years=.3935 5) Let X be an exponential random variable with 2 . 9 X . Compute the following. (i) ) 6 ( X f =.0566 (ii) ) 6 ( X P =0 (iii) ) 6 ( X F =.4791 (iv) ) 6 ( X P =.5209 (v) ) ( X E =9.2 x x x x x x F X 20 if 0 . 1 20 15 if 8 . 0 15 10 if 5 . 0 10 5 if 3 . 0 5 if 0 ) (

Upload: others

Post on 21-May-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

1

Worksheet 1

Prep-Work (Distributions)

1)Let X be the random variable whose c.d.f. is given below.

Compute the mean, X . (Hint: First identify all possible values of X, then compute values for the p.m.f., )(xXf ).

12

2)Let X be binomial random variable with 40n and 15.0p . Use Excel to compute (i) )8(Xf =0.1086

and )8(XF =0.8645.

3)Let X be a continuous random variable that is uniform on the interval ]10,0[ . (i) What is the probability that X is at

most 8.75? =.875

(ii) What is the probability that X is no less than 4.25? = .575

4)Let W be the working lifetime, measured in years, of the microchip in your new digital watch. Suppose that W has an

exponential distribution with mean 4 years. Use Integrating.xls and the probability density function Wf to compute

the probabilities that the chip lasts for (i) at least 8 years=.1348 and (ii) at most 2 years=.3935

5) Let X be an exponential random variable with 2.9X . Compute the following. (i) )6(Xf =.0566

(ii) )6( XP =0 (iii) )6(XF =.4791 (iv) )6( XP =.5209 (v) )(XE =9.2

x

x

x

x

x

xFX

20 if0.1

2015 if8.0

1510 if5.0

105 if3.0

5 if0

)(

Page 2: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

2

6)Use Integrating.xls to determine whether or not the function given below could be a p.d.f. for some continuous

random variable.

elsewhere0

10 if2.12.1)(

2 xxxxf X

Page 3: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

3

Worksheet 2

Prep-Work (Variance)

Part 1-Variance(Dispersion) and Standard Deviation

1)Discrete Random Variable: Example 1(MBD Proj2.ppt) –from Text from Variance Section What is similar and what is different

about the two random variables, X and Y in the text Example 1?

a) What is the mean of each random variable, X and Y?

4

4

b) Looking at the values of X and Y, which random variable has the larger variance?

y

c) From the tables, what is the variance of X? .7 And of Y? 3.3

d) From the tables, what is the standard deviation of X? .84 And of Y? 1.82

e) Look at the calculation of the variance of X and Y . From this, write down the formula for the variance of a discrete random

variable.

2)The p.m.f. of a finite random variable Y is given below.

y 2 1 0 1 2

)(yfY 0.10 0.15 0.20 0.25 0.30

Compute )(YV =1.75 and Y =1.32

Page 4: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

4

3)Continuous Random Variable: Example 4(MBD Proj2.ppt) –from Text from Variance Section

a. Write down the formula for the variance of a continuous random variable.

b. The random variable giving the time between computer breakdowns is an exponential random variable(a continuous random variable) with α = 16.8.

c. What is the formula for the pdf of this random variable?

d. What is the formula for the mean of this random variable?

E(X) =

xife

xif

xf xX 08.16

1

00

)( 8.16/

Page 5: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

5

e. Find the mean using Integrating.xls.

=x*(1/16.8)*EXP(-x/16.8)

f. What is the formula for the variance of this random variable?

0

2

8.16

1*)8.16()( dx exXV x/16.8

(When using Excel)

2)( XV

g. Find the variance.

24.282)8.16()( 2 XV

.)()(

dxxfxXE XX

Page 6: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

6

h. What is the standard deviation of this random variable?

i. Sketch a graph of the pdf of this random variable.

=IF(x<0,0,(1/16.8*EXP(-x/16.8)))

Definition

Computation

Plot Interval

Constants

Formula for f(x)

x f(x)

a b

s

0.05952

4

0.05952

4

-10 100

t

u

v

w

j. Guess the standard deviation of a general exponential random variable.

8.16)( 2 XVX

2)(XVX

Page 7: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

7

4)Uniform Distribution

A uniformly distributed random variable has a pdf with the same value for all values of the variable. Suppose X is uniform random

variable taking all values between 0 and 8.

a) Sketch a graph of the pdf..

b)

What must be true of the area under the graph?

1

C)What is the formula for the pdf?

xif0

xif

xif

xf X

8

808

1

00

)(

Page 8: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

8

D)What is the mean of the random variable X? (Excel not needed.)

(0+8)/2=4

E)Find the variance of X. (Excel needed.)

(8-0)2 /12=5.33

F)Find the standard deviation of X.

2.309

Page 9: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

9

Part 2-Variance of Distributions; Sample Statistics

1)Variance of Binomial Distribution: Use Bionomial2.xls

a) The Excel file contains the calculation to find the expected value, variance, and standard deviation of the Binomial

distribution with n = 28 and p = 0.2. Note down the answers.

expected value(5.6) , variance(4.48), and standard deviation(2.1166)

b) Now adapt the file to find the expected value, variance, and standard deviation for n = 50 and p = 0.2. Note down the

answers.

the expected value(10), variance(8), and standard deviation(2.824)

c) Adapt the file again for n = 50 and p = 0.4. Write down the expected value, variance, and standard deviation. Similar to part

(b)

the expected value(20), variance(12), and standard deviation(3.46)

d) In some order, the formulas for the expected value, variance, and standard deviation of the Binomial distribution with n

trial and probability p are the following: ; ; . Match them up by checking the formulas against

the values you found in Questions #1-3.

Binomial Distribution

Expected value

Variance

Standard deviation

2)What if we have a sample instead of a whole distribution? (Think about the errors of the historical signals; these are a sample.)

How do you find the mean, variance and standard deviation of the sample? We need new formulas, which follow:

For a Sample: Mean Variance Standard deviation

.1

1

n

ii

xn

x .1

1

1

22

n

i

i xxn

s .1

1

1

2

n

i

i xxn

s

Page 10: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

10

=average(…..) =var(….) =stdev(…..)

3)Example8 from text: Let X be the number of days that a heart transplant recipient stays in the hospital after a transplant . An

insurance executive wanted to estimate the mean, X, and standard deviation, X. To do this, she took a random sample of 12

transplant recipients. The numbers of days for which these people were hospitalized are. 8, 7, 9, 10, 9, 10, 6, 7, 6, 8, 10, 8 .

a. Calculate sample standard deviation.

1.46

b. Use VAR and STDEV to compute 2s and s for the following random sample of values of a random variable X.

2s (2.15) and s (1.46)

4)Let X be the continuous random variable with p.d.f.

elsewhere.0

10 if2.12.1)(

2 xxxxf X

Use Integrating.xls to compute )(XV and Xσ .

)(XV =.05 and Xσ =.223

5)

Let X be the exponential random variable with parameter 4 . Recall that both the mean and standard deviation of X are equal

to 4. Let S be the standardization of X. Compute )1( SP . (Hint: First express )1( SP in terms of a probability for X, then use

the formula for the cumulative distribution function of X to finish the exercise.)

0.8647

Page 11: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

11

6)

In the future we want to learn about a whole population from a sample. For example, if you sample shoppers to see how much they

will pay for a new item, what can you conclude?

In order to draw conclusions from the sample (referred to as “making a statistical inference”), we have to know how the mean

of a sample varies as we take new samples. This is what the Central Limit Theorem tells us and this is what we will do today.

Central Limit Theorem says that as sample size, n, gets larger, the distribution of sample means is approximately

-Normal, and has

-Same mean as original distribution; that is, Mean =

-Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation =

a)

Let X be a random variable with a mean of 15.9 and a standard deviation of 0.24. Let x be the sample mean for random samples of

size 180n . Compute the expected value, variance, and standard deviation of x .

expected value(15.9), variance(0.00032), and standard deviation of x (0.0178)

b) CLT game (We will do this in class)

Page 12: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

12

Worksheet 3

Prep-Work (Normal Distributions)

The Normal Distribution

1. Using = NORMDIST(x, μ, σ, false), graph the pdf for σ = 1 and μ = 0, 1, 2, 3, -1, Use the interval [-5, 5].

Mean 0 Mean 2

2. What does the value of μ tell you? What does changing μ do?

The x-value of the peak(Typical value), The location peak changes

Mean 1 Mean -1

Page 13: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

13

3. Using = NORMDIST(x, μ, σ, false), graph the pdf for σ = 1 and μ = 0 and σ = 1, 2, 3, 0.5, Use the interval [--5, 5].

4. What does the value of σ tell you? The average distance from the average value

What does changing σ do? When it is larger the graph gets wider

5. Standard normal distribution has mean of zero and standard deviation of 1. Which is its graph

Page 14: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

14

6. Match the following graphs of normal pdfs with the one of the value of the parameters µ and σ. You will not use all the values of

the parameters.

µ, σ. (0.1) (1,0) (1, 1) (2,1) (-1,1) (0, 2) (0, 0.5) (10, 1) (10,3) (10,10)

answers d none e a b c f g h none

(a) (b)

(c) (d)

(e) (f)

g) (h)

Page 15: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

15

The normal distribution with mean µ and standard deviation σ has pdf

though w use = NORMDIST(x, for computation. The standard normal has

Probabilities and the standard normal distribution. Let X have the standard normal distribution.

7. Using the pdf, write an expression for the probability that X is within one standard deviation of the mean. (Use the formula at the

top of the page.)

8. Using the pdf, calculate the probability that X is within one standard deviation of the mean.

Using integrating.xls & answer in 2 we get .6827

9. Using the cdf, calculate the probability that X is within one standard deviation of the mean. To find cdf at 1 use

NORMDIST(1, 0

Probabilities for any normal distribution: “Rule of Thumb”

10. The results in #2-10 are true for all normal distributions. Summarize your results in the following table

Distance from Mean in Normal Distribution Probability

Within one standard deviation of mean

0.6827

Within two standard deviations of mean

.9545

Within three standard deviations of mean

.9973

Standardization of Normal Random Variables. If X is normally distributed, its standardization is

11

What is the distribution of Z?

Standard Normal

Suppose that X is normally distributed, with a mean X of 30 and standard deviation of 5.

12 What is the Z-value (that is, the standardized value) of X = 35?

1

13What is the standardized value of X =40?

2

14 if a value of X is three standard deviations above the mean, what is its Z value? 3 What is the X value? 45

Finding the Z value corresponding to particular probabilities

Page 16: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

16

1. Using Excel, find the value of z0 such that Give two decimal places. Use NORMDIST and trial and error.

1.96

Definition

Computation

Plot Interval Integration Interval

Formula for f(x)

x f(x)

A B

a b

0.39894

0.39894

-5 5

-5 1.95996

0.9750

Constants

s

t

u

v

w

2. Find the value of z0 such that

2.575

Definition

Computation

Plot Interval Integration Interval

Formula for f(x)

x f(x)

A B

a b

0.39894

0.39894

-5 5

-5 2.57589

0.9950

Constants

s

t

u

v

w

-0.050

0.050.1

0.150.2

0.250.3

0.350.4

0.45

-6 -4 -2 0 2 4 6

f(x)

x

FUNCTION

dxxfb

a )(

-0.050

0.050.1

0.150.2

0.250.3

0.350.4

0.45

-6 -4 -2 0 2 4 6

f(x)

x

FUNCTION

dxxfb

a )(

Page 17: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

17

A 50 kg sack of flour contains a weight of flour that is normally distributed with mean 51 kg and standard deviation 2 kg.

3. What is the Z-value of a weight of 50 kg?

-0.5

Standardization of Mean from Samples of Size n. By the Central Limit Theorem, the sample means is normally distributed

with mean µ and standard deviation σ/ Thus the standardization, has the standard normal distribution, where

This is true no matter what the distribution of X provided the samples are random and n is large enough (usually above 30).

(Quite remarkable!)

4. A sample of 4 sacks of flour has mean 50 kg. What is the Z-value of this mean?

-1

5. A sample of 25 sacks of flour has mean 50 kg. What is the Z-value of this mean?

-2.5

6. A sample of 100 sacks of flour has mean 50 kg. What is the Z-value of this mean?

-5

Confidence Intervals

Last time we showed that 95.0)96.196.1( ZP , where Z is the standard normal variable.

The variable Z represents the standard normal variable.

95.0)96.196.1( ZP

1. Represent this on a diagram.

Page 18: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

18

2. Explain what this result means in words.

We are 95 % confident the Z value will fall between -1.96 and 1.96

Suppose that X is normally distributed, with a mean X of 30 and standard deviation of 5. Let

5

30

xZ

3. What is the value of )96.15

3096.1(

xP ? Illustrate on a diagram.

95.0)96.196.1()96.15

3096.1(

ZP

xP

Page 19: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

19

4. What is the value of )596.130596.130( xP ? Illustrate on a diagram.

95.0)96.196.1()596.130596.130( ZPxP

Same illustration as in part 3.

5. What is the value of )8.392.20( XP ? Illustrate on a diagram.

This is the same probability; just evaluate the values of X at the end points as in the graph in part 3.

Standardization of Mean from Samples of Size n. By the Central Limit Theorem, for a sample of size n, the sample means

are normally distributed with mean µ and standard deviation . Thus the standardization, Z, has the standard normal

distribution, where

This is true no matter what the distribution of X provided the samples are random and n is large enough (usually above 30).

(Quite remarkable!)

Continuing the example where the random variable X has a mean 30 and standard deviation 5. Let’s take a sample of 100 and

find the mean

6. What is the mean of all the possible s?

The mean value is still the same, i.e. 30.

7. What is the standard deviation of all the possible s?

The standard deviation is 5.0100/5

Page 20: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

20

8. What is the value of )5.096.1305.096.130( xP ? Illustrate on a diagram.

This is the 95 % confidence interval. So the probability is 0.95

9. What is the value of )98.3002.29( xP ? Illustrate on a diagram.

Its value is 0.95, just evaluate the endpoints of the interval. We are 95 % sure the values of x will fall between 29.02 and 30.98.

The graph is the same as in part 8.

10. What is the interval in which there is a 95% chance of finding an x value?

For X, the confidence interval is from 30-1.96*5, to 30+1.96*5. That is 20.2 to 39.8 as illustrated below.

Page 21: 0.3 if 5 10 d x F ( ) x °° 0.5 if 10 15 d xkerimar/project2... · -Standard deviation = original standard deviation over square root sample size; that is, Standard Deviation = a)

21

11. Give an intuitive explanation of why the interval for is shorter than the interval for X.

The reason is that there is more concentration around the mean since we are dividing by the square root of the sample size, i.e. by 10.

12. What would happen to the length of the interval if the size of the sample (now 100) was increased? Would it get longer or

shorter? Why?

The size of the interval will be squeezed further. It is inversely proportional to the square root of the size of the sample.

Now suppose that the mean of a distribution is NOT known, but that the standard deviation is known. Suppose we take a

sample of size n and find that it has mean . Then it can be shown that there is a 95% chance that the mean of the population

lies in the interval given by the formula

nx

nx

96.1,96.1

This is called the 95% confidence interval for the mean.

Example 3, Normal Distributions, An administrator samples 50 other administrators’ salaries and find the mean of the

sample to be = $88,989 and the standard deviation of the sample to be s = $22,358. The standard deviation of the sample is a

good approximation to σ, the standard deviation of the population.

13. Find the 95% confidence interval for the mean of all such administrators’ salaries.

Using

($82,791, $ 95187)

14. What does the interval in #13 tell you?

95% chance that the mean national mean salary for his counter parts is between $82,791 & 95187

The reason the administrator took the sample was to show that he was paid less that the mean.

15. If the administrator’s own salary is $83,500, can he claim with 95% certainty that he is paid less than the mean?

No . Because his salary is within the 95% confidence interval

16. If the administrator’s own salary is $81,500, can he claim with 95% certainty that he is paid less than the mean?

Yes . Because his salary is outside the 95% confidence interval

Suppose the sample size had been 100 instead of 50.

The 95% confidence interval is NOW ($84,607, $93371)

17. With a salary of $83,500 would he have been able to claim he was paid less than the mean?

Yes . Because his salary is outside the 95% confidence interval

18. With a salary of $81,500 would he have been able to claim that he was paid less than the mean?

Yes . Because his salary is outside the 95% confidence interval