chapter 6 the normal probability...

Chapter 6

The Normal Probability Distribution

CONTINUOUS RANDOM VARIABLES Continuous random variables can assume the

infinitely many values corresponding to points on a

line interval. This means that the range of the variable

contains an interval of real numbers.

Examples:

Heights, weights

length of life of a particular product

experimental laboratory error

Recall that the probability distribution p(x) of a

discrete random variable is just the histogram of

its values. We had the following examples in

chapter 5:

1. Toss a fair coin three times and let x be the

number of heads. Then x is binomial and

x p(x)

0 1/8

1 3/8

2 3/8

3 1/8

We can create what we call the probability

polygon h(x) by starting from the midpoint of the

left side of the first bar and connecting the

midpoints of the upper sides of the subsequent

bars by straight lines and finishing at the

midpoint of the right side of the last bar.

2. A marksman hits a target 80% of the time. He

fires five shots at the target. What is the probability

that exactly 3 shots hit the target? If x is the number

of shots that hit the target, then x is a binomial and

Again we can draw the probability polygon h(x) in

the same fashion.

k p =

.80

0 .000

1 .007

2 .058

3 .263

4 .672

5 1.000

Note that the total area under the probability

histogram is 1 and that the area under the

probability polygon is very close to 1

Now let x be a continuous random variable and

consider a set of n measurements of x. we can

construct the relative frequency histogram of

these values and the corresponding probability

polygon h(x) . As mentioned before, the area

under h(x) is very close to 1. We can use this

polygon approximate as the area

under h(x) over the interval .

0

2

4

6

8

10

0.835 0.865 0.895 0.925 0.955 0.985 1.015 1.045 1.075 More

rel freq

rel freq

As (the number of measurements

increases without bound) the relative frequency

polygon h(x) approaches the graph of a smooth

function whose area under the graph is one.

This function is called the probability density

function, pdf, of the random variable x.

0

2

4

6

8

10

0.835 0.865 0.895 0.925 0.955 0.985 1.015 1.045 1.075 More

rel freq

rel freq

h(x)

The pdf of x can be used, among other

things, to find the exact value of

which is the area under the graph of over

the interval which is, from calculus,

a b

The probability density function for the random variable x describes the probability distribution of the continuous random variable x. This function is sometime referred to as the depth of probability.

•In many situations the density function is given

by a mathematical formula where as in many

many situations finding a nice formula may not

be possible.

PROPERTIES OF CONTINUOUSPROBABILITY

DISTRIBUTIONS

1.

2. The total area under the curve =

3. =area under the curve between a and b

Exercise: Show that there is no probability attached to any single

value of x. That is, P(x = a) = 0.

There are many different types of continuous random

variables. Here are some that occur often:

1. Uniform Random variable

2. Exponential Random Variable

Some other random variable include The Gamma, Chi

Square, and Bata random variables. The most important

random variable is the Normal Random variable.

To choose a model for real life situation, we try to pick a model that

Fits the data well

Allows us to make the best possible inferences using the data.

As mentioned, one important continuous random variable is the normal random variable and we will discuss that in more detail.

THE NORMAL DISTRIBUTION

deviation. standard andmean population theare and

1416.3 7183.2

for 2

1)(

2

2

1

e

xexf

x

• The shape and location of the normal

curve changes as the mean and standard

deviation change.

• The formula that generates the

normal probability distribution is:

THE STANDARD NORMAL

DISTRIBUTION

To find P(a < x < b), we need to find the

area under the appropriate normal curve.

To simplify the tabulation of these areas,

we standardize each value of x by

expressing it as a z-score, the number of

standard deviations it lies from the mean

.

xz

THE STANDARD

NORMAL (Z)

DISTRIBUTION

Mean = 0; Standard deviation = 1

When x = , z = 0

Symmetric about z = 0

Values of z to the left of center are negative

Values of z to the right of center are positive

Total area under the curve is 1.

USING TABLE 3

The four digit probability in a particular row

and column of Table 3 gives the area under

the z curve to the left that particular value of

z.

Area for z = 1.36

P(z 1.36) = .9131

P(z >1.36)

= 1 - .9131 = .0869

P(-1.20 z 1.36) =

.9131 - .1151 = .7980

EXAMPLE

Use Table 3 to calculate these probabilities:

To find an area to the left of a z-value, find the area

directly from the table.

To find an area to the right of a z-value, find the area

in Table 3 and subtract from 1.

To find the area between two values of z, find the two

areas in Table 3, and subtract one from the other.

P(-1.96 z 1.96)

= .9750 - .0250 =

.9500

P(-3 z 3)

= .9987 -

.0013=.9974

Remember the Empirical Rule:

Approximately 99.7% of the

measurements lie within 3

standard deviations of the mean.

USING TABLE 3

Remember the Empirical Rule:

Approximately 95% of the

measurements lie within 2

standard deviations of the mean.

1. Look for the four digit area

closest to .2500 in Table 3.

2. What row and column does

this value correspond to?

WORKING BACKWARDS

Find the value of z that has area .25 to its left.

4. What percentile

does this value

represent? 25th percentile,

or 1st quartile (Q1)

3. z = -.67

1. The area to its left will be 1 - .05

= .95

2. Look for the four digit area

closest to .9500 in Table 3.

Find the value of z that has area .05 to its right.

3. Since the value .9500 is

halfway between .9495 and

.9505, we choose z halfway

between 1.64 and 1.65.

4. z = 1.645

FINDING PROBABILITIES FOR THE

GENERAL NORMAL RANDOM

VARIABLE

To find an area for a normal random variable x

with mean and standard deviation , standardize

or rescale the interval in terms of z.

Find the appropriate area using Table 3.

Example: x has a normal

distribution with = 5 and =

2. Find P(x > 7).

1587.8413.1)1(

)2

57()7(

zP

zPxP

1 z

EXAMPLE The weights of packages of ground beef are

normally distributed with mean 1 pound and

standard deviation .10. What is the probability

that a randomly selected package weighs

between 0.80 and 0.85 pounds?

)85.80(. xP

)5.12( zP

0440.0228.0668.

What is the weight of a package

such that only 1% of all packages

exceed this weight?

233.11)1(.33.2?

33.21.

1? 3, Table From

01.)1.

1?(

01.?)(

zP

xP

chapter 6 the normal probability...

Documents