1 7. two random variables in many experiments, the observations are expressible not as a single...

15
1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to record the height and weight of each person in a community or the number of people and the total income in a family, we need two numbers. Let X and Y denote two random variables (r.v) based on a probability model (, F, P). Then 2 1 , ) ( ) ( ) ( ) ( 1 2 2 1 x x X X X dx x f x F x F x X x P . ) ( ) ( ) ( ) ( 2 1 1 2 2 1 y y Y Y Y dy y f y F y F y Y y P PILLAI

Upload: cameron-willis

Post on 12-Jan-2016

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to

1

7. Two Random Variables

In many experiments, the observations are expressible not

as a single quantity, but as a family of quantities. For

example to record the height and weight of each person in

a community or the number of people and the total income

in a family, we need two numbers.

Let X and Y denote two random variables (r.v) based on a

probability model (, F, P). Then

and

2

1

,)()()()( 1221

x

x XXX dxxfxFxFxXxP

.)()()()(2

11221

y

y YYY dyyfyFyFyYyP PILLAI

Page 2: 1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to

2

What about the probability that the pair of r.vs (X,Y) belongs to an arbitrary region D? In other words, how does one estimate, for example, Towards this, we define the joint probability distribution function of X and Y to be

where x and y are arbitrary real numbers.

Properties

(i)

since we get

?))(())(( 2121 yYyxXxP

,0),(

))(())((),(

yYxXP

yYxXPyxFXY (7-1)

.1),( ,0),(),( XYXYXY FxFyF

, )()(,)( XyYX

(7-2)

PILLAI

Page 3: 1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to

3

Similarly

we get

(ii)

To prove (7-3), we note that for

and the mutually exclusive property of the events on the right side gives

which proves (7-3). Similarly (7-4) follows.

.0)(),( XPyFXY ,)(,)( YX

.1)(),( PFXY

).,(),()(,)( 1221 yxFyxFyYxXxP XYXY

).,(),()(,)( 1221 yxFyxFyYyxXP XYXY

(7-3)

(7-4)

,12 xx

yYxXxyYxXyYxX )(,)()(,)()(,)( 2112

yYxXxPyYxXPyYxXP )(,)()(,)()(,)( 2112

PILLAI

Page 4: 1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to

4

(iii)

This is the probability that (X,Y) belongs to the rectangle in Fig. 7.1. To prove (7-5), we can make use of the following identity involving mutually exclusive events on the right side.

).,(),(

),(),()(,)(

1121

12222121

yxFyxF

yxFyxFyYyxXxP

XYXY

XYXY

(7-5)

.)(,)()(,)()(,)( 2121121221 yYyxXxyYxXxyYxXx

0R

1y

2y

1x 2x

X

Y

Fig. 7.1

0R

PILLAI

Page 5: 1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to

5

2121121221 )(,)()(,)()(,)( yYyxXxPyYxXxPyYxXxP

2yy 1y

.

),(),(

2

yx

yxFyxf XY

XY

(7-6)

. ),(),(

dudvvufyxF

x y

XYXY (7-7)

.1 ),(

dxdyyxf XY

(7-8)

This gives

and the desired result in (7-5) follows by making use of (7-3) with and respectively.

Joint probability density function (Joint p.d.f)

By definition, the joint p.d.f of X and Y is given by

and hence we obtain the useful formula

Using (7-2), we also get

PILLAI

Page 6: 1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to

6

To find the probability that (X,Y) belongs to an arbitrary region D, we can make use of (7-5) and (7-7). From (7-5) and (7-7)

Thus the probability that (X,Y) belongs to a differential rectangle x y equals and repeating this procedure over the union of no overlapping differential rectangles in D, we get the useful result

.),(),(

),(),( ),(

),()(,)(

yxyxfdudvvuf

yxFyxxFyyxF

yyxxFyyYyxxXxP

XY

xx

x

yy

y XY

XYXYXY

XY

(7-9)

x

X

Y

Fig. 7.2

yD

,),( yxyxf XY

PILLAI

Page 7: 1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to

7

Dyx XY dxdyyxfDYXP),(

.),(),( (7-10)

(iv) Marginal Statistics In the context of several r.vs, the statistics of each individual ones are called marginal statistics. Thus is the marginal probability distribution function of X, and is the marginal p.d.f of X. It is interesting to note that all marginals can be obtained from the joint p.d.f. In fact

Also

To prove (7-11), we can make use of the identity

.),()( ),,()( yFyFxFxF XYYXYX (7-11)

.),()( ,),()(

dxyxfyfdyyxfxf XYYXYX (7-12)

)()()( YxXxX

)(xFX

)(xf X

PILLAI

Page 8: 1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to

8

so that To prove (7-12), we can make use of (7-7) and (7-11), which gives

and taking derivative with respect to x in (7-13), we get

At this point, it is useful to know the formula for differentiation under integrals. Let

Then its derivative with respect to x is given by

Obvious use of (7-16) in (7-13) gives (7-14).

).,(,)( xFYxXPxXPxF XYX

dudyyufxFxFx

XYXYX ),(),()(

(7-13)

.),()(

dyyxfxf XYX (7-14)

.),()()(

)( xb

xadyyxhxH (7-15)

( )

( )

( ) ( ) ( ) ( , )( , ) ( , ) .

b x

a x

dH x db x da x h x yh x b h x a dy

dx dx dx x

(7-16)

PILLAI

Page 9: 1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to

9

If X and Y are discrete r.vs, then represents their joint p.d.f, and their respective marginal p.d.fs are given by

and

Assuming that is written out in the form of a rectangular array, to obtain from (7-17), one need to add up all entries in the i-th row.

),( jiij yYxXPp

j j

ijjii pyYxXPxXP ),()(

i i

ijjij pyYxXPyYP ),()(

(7-17)

(7-18)

),( ji yYxXP

),( ixXP

mnmjmm

inijii

nj

nj

pppp

pppp

pppp

pppp

21

21

222221

111211

j

ijp

i

ijp

Fig. 7.3

It used to be a practice for insurance companies routinely to scribble out these sum values in the left and top margins, thus suggesting the name marginal densities! (Fig 7.3).

PILLAI

Page 10: 1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to

10

From (7-11) and (7-12), the joint P.D.F and/or the joint p.d.f represent complete information about the r.vs, and their marginal p.d.fs can be evaluated from the joint p.d.f. However, given marginals, (most often) it will not be possible to compute the joint p.d.f. Consider the following example:

Example 7.1: Given

Obtain the marginal p.d.fs and Solution: It is given that the joint p.d.f is a constant in the shaded region in Fig. 7.4. We can use (7-8) to determine that constant c. From (7-8)

. otherwise0,

,10constant,),(

yxyxf XY (7-19)

),( yxf XY

)(xf X ).( yfY

.122

),(1

0

21

0

1

0

0

ccycydydydxcdxdyyxf

yy

y

xXY (7-20)

0 1

1

X

Y

Fig. 7.4

y

PILLAI

Page 11: 1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to

11

Thus c = 2. Moreover from (7-14)

and similarly

Clearly, in this case given and as in (7-21)-(7-22), it will not be possible to obtain the original joint p.d.f in (7-19).

Example 7.2: X and Y are said to be jointly normal (Gaussian) distributed, if their joint p.d.f has the following form:

,10 ),1(22),()(1

xyXYX xxdydyyxfxf (7-21)

.10 ,22),()(

0

y

xXYY yydxdxyxfyf (7-22)

)(xf X)( yfY

.1|| , ,

,12

1),(

2

2

2

2

2

)(

))((2

)(

)1(2

1

2

yx

eyxf Y

Y

YX

YX

X

X yyxx

YX

XY

(7-23)

PILLAI

Page 12: 1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to

12

By direct integration, using (7-14) and completing the square in (7-23), it can be shown that

~

and similarly

~

Following the above notation, we will denote (7-23) as Once again, knowing the marginals in (7-24) and (7-25) alone doesn’t tell us everything about the joint p.d.f in (7-23).

As we show below, the only situation where the marginal p.d.fs can be used to recover the joint p.d.f is when the random variables are statistically independent.

),,( 2

1),()( 22/)(

2

22

XXx

X

XYX Nedyyxfxf XX

(7-24)

(7-25)),,( 2

1),()( 22/)(

2

22

YYy

Y

XYY Nedxyxfyf YY

).,,,,( 22 YXYXN

PILLAI

Page 13: 1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to

13

Independence of r.vs

Definition: The random variables X and Y are said to be statistically independent if the events and are independent events for any two Borel sets A and B in x and y axes respectively. Applying the above definition to the events and we conclude that, if the r.vs X and Y are independent, then

i.e.,

or equivalently, if X and Y are independent, then we must have

AX )( })({ BY

xX )( , )( yY

))(())(())(())(( yYPxXPyYxXP (7-26)

)()(),( yFxFyxF YXXY

).()(),( yfxfyxf YXXY (7-28)

(7-27)

PILLAI

Page 14: 1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to

14

If X and Y are discrete-type r.vs then their independence implies

Equations (7-26)-(7-29) give us the procedure to test for independence. Given obtain the marginal p.d.fs and and examine whether (7-28) or (7-29) is valid. If so, the r.vs are independent, otherwise they are dependent. Returning back to Example 7.1, from (7-19)-(7-22), we observe by direct verification that Hence X and Y are dependent r.vs in that case. It is easy to see that such is the case in the case of Example 7.2 also, unless In other words, two jointly Gaussian r.vs as in (7-23) are independent if and only if the fifth parameter

., allfor )()(),( jiyYPxXPyYxXP jiji (7-29)

)(xf X

)( yfY

),,( yxf XY

).()(),( yfxfyxf YXXY

.0

.0

PILLAI

Page 15: 1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to

15

Example 7.3: Given

Determine whether X and Y are independent. Solution:

Similarly

In this case

and hence X and Y are independent random variables.

otherwise.,0

,10 ,0,),(

2

xyexy

yxfy

XY (7-30)

.10 ,2 22

),()(

0 0

0

2

0

xxdyyeyex

dyeyxdyyxfxf

yy

yXYX

(7-31)

.0 ,2

),()(21

0 ye

ydxyxfyf y

XYY (7-32)

),()(),( yfxfyxf YXXY

PILLAI