samplingprocessandsampling design...

51
1 Outline 1 Sampling process and sampling design Sampling Formulas

Upload: others

Post on 05-Nov-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

1  

Outline

1  

•  Sampling  process  and  sampling  design  

   •  Sampling  Formulas  

Page 2: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

2  

Part I

2  

Sampling  process  and  sampling  design  

 

Page 3: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

3  

Introduc4on  

For  small  popula.on,  a  census  study  is  used  because  data  can  be  gathered  on  all.  

 For  large  popula.on,  sample  surveys  will  be  used  to  obtain  informa.on  from  a  small  (but  carefully  chosen)  sample  of  the  pop’n.  The  sample  should  reflect  the  characteris.cs  of  the  popula.on  from  which  it  is  drawn.      

 

Page 4: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

4  

Sampling…….  

4

•  3 factors that influence sample representative-ness

•  Sampling procedure •  Sample size •  Participation (response)

•  When might you collect the entire population? •  When your population is very small •  When you have extensive resources •  When you don’t expect a very high response

Page 5: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

5  

Sampling  Process

5  

•  The sampling process comprises several stages:

–  Defining the population of concern –  Specifying a sampling frame, a set of items or events

possible to measure –  Specifying a sampling method for selecting items or

events from the frame –  Determining the sample size (not presented) –  Implementing the sampling plan “ –  Sampling and data collecting “ –  Reviewing the sampling process “

Page 6: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

6  6  

Page 7: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

7  

Sampling  Frame  -­‐1-­‐  

7  

•  In the most straightforward case, such as the sentencing of a batch of material from production (acceptance sampling by lots), it is possible to identify and measure every single item in the population and to include any one of them in our sample. However, in the more general case this is not possible. There is no way to identify all rats in the set of all rats. Where voting is not compulsory, there is no way to identify which people will actually vote at a forthcoming election (in advance of the election)

•  As a remedy, we seek a sampling frame which has the property that we can identify every single element and include any in our sample .

•  The sampling frame must be representative of the population

Page 8: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

8  8

Sampling  Frames  -­‐2-­‐  

•  Sources  – Administra.ve  records-­‐eg  

•  Hospital  records  •  Birth  and  Death  Registers  •  LC  lists  •  Voters’  register  •  School  registers  •  etc  

– Construct  your  own  

Page 9: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

9  

Types  of  Samples  and  Sample  design  

9  

•  Probability (Random) Samples •  Simple random sample (SRS)

– Systematic random sample – Stratified random sample – Multistage sample (not presented) – Multiphase sample “ – Cluster sample “

•  Non-Probability Samples – Convenience sample – Purposive sample – Quota

Page 10: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

10  

Probability  Sampling  

10  

 •  A probability sampling scheme is one in which every

unit in the population has a chance (greater than zero) of being selected in the sample, and this probability can be accurately determined.

•  When every element in the population does have the

same probability of selection, this is known as an 'equal probability of selection' (EPS) design. Such designs are also referred to as 'self-weighting' because all sampled units are given the same weight.

Page 11: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

11  

Non-­‐Probability  Sampling  

11  

•  Any sampling method where some elements of population have no chance of selection (these are sometimes referred to as 'out of coverage'/'undercovered'), or where the probability of selection can't be accurately determined. It involves the selection of elements based on assumptions regarding the population of interest, which forms the criteria for selection. Hence, because the selection of elements is nonrandom, nonprobability sampling not allows the estimation of sampling errors..

•  Example: We visit every household in a given street, and interview the first person to answer the door. In any household with more than one occupant, this is a nonprobability sample, because some people are more likely to answer the door (e.g. an unemployed person who spends most of their time at home is more likely to answer than an employed housemate who might be at work when the interviewer calls) and it's not practical to calculate these probabilities.

Page 12: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

12  

Probability  Sampling  Design  

12  

Ø Probability sampling includes: •  Simple Random Sampling (srs), •  Systematic Sampling, •  Stratified Random Sampling (SRS), •  Cluster Sampling (not presented)

•  Multistage Sampling (not presented) •  Multiphase sampling (not presented)

Page 13: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

13  

Simple  Random  Sampling  (srs)  

13  

•  Applicable when population is small, homogeneous & readily available

•  All subsets of the frame are given an equal probability.

Each element of the frame thus has an equal probability of selection.

•  It provides for greatest number of possible samples.

This is done by assigning a number to each unit in the sampling frame.

•  A table of random number or lottery system is used to

determine which units are to be selected.

Page 14: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

14  

Simple  Random  Sampling……..

14  

•  Advantages •  Estimates are easy to calculate. •  Simple random sampling is always an EPS design, but

not all EPS designs are simple random sampling.

•  Disadvantages •  If sampling frame large, this method impracticable. •  Minority subgroups of interest in population may not

be present in sample in sufficient numbers for study.

Page 15: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

15  

srs:  Replacement  of    Selected  Units  

15  

•  Sampling  schemes  may  be  without  replacement  ('WOR'  -­‐  no  element  can  be  selected  more  than  once  in  the  same  sample)  or  with  replacement  ('WR'  -­‐  an  element  may  appear  mul.ple  .mes  in  the  one  sample).    

 •  For  example,  if  we  catch  fish,  measure  them,  and  immediately  

return  them  to  the  water  before  con.nuing  with  the  sample,  this  is  a  WR  design,  because  we  might  end  up  catching  and  measuring  the  same  fish  more  than  once.  However,  if  we  do  not  return  the  fish  to  the  water  (e.g.  if  we  eat  the  fish),  this  becomes  a  WOR  design.  

Page 16: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

16  

Systema4c  Sampling  

16  

•  Systema4c  sampling  relies  on  arranging  the  target  popula.on  according  to  some  ordering  scheme  and  then  selec.ng  elements  at  regular  intervals  through  that  ordered  list.    

 •  Systema.c  sampling  involves  a  random  start  and  then  

proceeds  with  the  selec.on  of  every  kth  element  from  then  onwards.  In  this  case,  k=(popula.on  size/sample  size).    

 •  It  is  important  that  the  star.ng  point  is  not  automa.cally  the  

first  in  the  list,  but  is  instead  randomly  chosen  from  within  the  first  to  the  kth  element  in  the  list.    

 •  A  simple  example  would  be  to  select  every  10th  name  from  the  

telephone  directory  (an  'every  10th'  sample,  also  referred  to  as  'sampling  with  a  skip  of  10').  

Page 17: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

17  

Systema4c  Sampling……  

17  

As  described  above,  systema.c  sampling  is  an  EPS  method,  because  all  elements  have  the  same  probability  of  selec.on  (in  the  example  given,  one  in  ten).  It  is  not  'simple  random  sampling'  because  different  subsets  of  the  same  size  have  different  selec.on  probabili.es  -­‐  e.g.  the  set  {4,14,24,...,994}  has  a  one-­‐in-­‐ten  probability  of  selec.on,  but  the  set  {4,13,24,34,...}  has  zero  probability  of  selec.on.  

Page 18: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

18  

Systema4c  Sampling……  

18  

•  ADVANTAGES:  •  Sample  easy  to  select  •  Suitable  sampling  frame  can  be  iden.fied  easily  •  Sample  evenly  spread  over  en.re  reference  popula.on    •  DISADVANTAGES:  •  Sample  may  be  biased  if  hidden  periodicity  in  popula.on  

coincides  with  that  of  selec.on.  •  Difficult  to  assess  precision  of  es.mate  from  one  survey.  

Page 19: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

19  

Stra4fied  Sampling  

19  

Where  popula.on  embraces  a  number  of  dis.nct  categories,  the  frame  can  be  organized  into  separate  "strata."  Each  stratum  is  then  sampled  as  an  independent  sub-­‐popula.on,  out  of  which  individual  elements  can  be  randomly  selected.    

 •  Every  unit  in  a  stratum  has  same  chance  of  being  selected.    •  Using  same  sampling  frac.on  for  all  strata  ensures  propor.onate  

representa.on  in  the  sample.    •  Adequate  representa.on  of  minority  subgroups  of  interest  can  be  

ensured  by  stra.fica.on  &  varying  sampling  frac.on  between  strata  as  required.  

Page 20: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

20  

Stra4fied  Sampling……  

20  

•  Finally,  since  each  stratum  is  treated  as  an  independent  popula.on,  different  sampling  approaches  can  be  applied  to  different  strata.  

•  Drawbacks  to  using  stra.fied  sampling    •   First,  sampling  frame  of  en.re  popula.on  has  to  be  prepared  

separately  for  each  stratum  •  Second,  when  examining  mul.ple  criteria,  stra.fying  

variables  may  be  related  to  some,  but  not  to  others,  further  complica.ng  the  design,  and  poten.ally  reducing  the  u.lity  of  the  strata.  

•   Finally,  in  some  cases  (such  as  designs  with  a  large  number  of  strata,  or  those  with  a  specified  minimum  sample  size  per  group),  stra.fied  sampling  can  poten.ally  require  a  larger  sample  than  would  other  methods  

Page 21: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

21  

Stra4fied  Sampling…….  

21  

Draw  a  sample  from  each  stratum    

Page 22: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

22  

Post  Stra4fica4on  

22  

•  Stra.fica.on  is  some.mes  introduced  aher  the  sampling  phase  in  a  process  called  "poststra.fica.on“.  

•  This  approach  is  typically  implemented  due  to  a  lack  of  prior  knowledge  of  an  appropriate  stra.fying  variable  or  when  the  experimenter  lacks  the  necessary  informa.on  to  create  a  stra.fying  variable  during  the  sampling  phase.  Although  the  method  is  suscep.ble  to  the  pijalls  of  post  hoc  approaches,  it  can  provide  several  benefits  in  the  right  situa.on.    

•  Implementa.on  usually  follows  a  simple  random  sample.  In  addi.on  to  allowing  for  stra.fica.on  on  an  ancillary  variable,  poststra.fica.on  can  be  used  to  implement  weigh.ng,  which  can  improve  the  precision  of  a  sample's  es.mates.  

Page 23: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

23  

Non-­‐Probability  Sampling…….  

23  

•  Nonprobability  Sampling  includes:  Accidental  Sampling,  Convenience  Sampling,  Quota  Sampling  and  Purposive  (or  Judgement)  Sampling.    

•  In  addi.on,  nonresponse  effects  may  turn  any  probability  design  into  a  nonprobability  design  if  the  characteris.cs  of  nonresponse  are  not  well  understood,  since  nonresponse  effec.vely  modifies  each  element's  probability  of  being  sampled.  

Page 24: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

24  

Part II

24  

Sampling  Formulas  and  es4ma4on:  Sampling  Random  Sampling  and    Stra4fied  Random  Sampling  

Page 25: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

25  

Many  Sampling  methods….  

Probability  Sampling:  

•  Random  Sampling    •  Systema.c  Sampling    •  Stra.fied  Sampling  

NonProbability  Sampling  •  Convenience  Sampling    •  Judgment  Sampling    •  Quota  Sampling    •  Snowball  Sampling    

………………….and many others………

Page 26: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

26  

The  winner  is:    Probability  Sampling.    

In  non-­‐probability  sampling,  members  are  selected  from  the  pop’n  in  some  non-­‐random  manner  and  the  sampling  error  (=degree  to  which  a  sample  might  differ  from  the  pop’n)  is  unknown.  

 In  probability  sampling,  each  member  of  the  pop’n  has  a  specified  probability  of  being  included  in  the  sample.    Its  advantage  is  that  sampling  error  can  be  calculated.  

Page 27: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

27  

Popula4on  parameters  

Defini.on:  Parameters  are  those  numerical  characteris.cs  of  the  pop’n  that  we  will  es.mate  from  a  sample.      

Nota.ons:      

{ }

⎩⎨⎧

==absencepresence

orweightoragexExample

xxxXInterestofVariable

NofsubsetaissSample

NelementnPop

i

N

01

:

.....:

,....,2,1:

...21:'

21

Page 28: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

28  

Popula4on  mean,  total,  variance  

 Pop’n  mean:      Pop’n  total:    Pop’n  variance:  its  square  root  is  the  StdDev    

∑=

=N

iixN 1

µτ NxN

ii ==∑

=1

( ) 2

1

22

1

2 11 µµσ −=−= ∑∑==

N

ii

N

ii x

Nx

N

Page 29: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

29  

Simple  Random  Sampling  (s.r.s)  

The  most  elementary  form  of  sampling  is  s.r.s.  where  each  member  of  the  pop’n  has  an  equal  and  known  chance  of  being  selected  at  most  once.    

   There  are              possible  samples  of  size  n  taken  without  replacement.      

 In  this  sec.on,  some  sta.s.cal  proper.es  of  the  sample  mean  will  be  derived.  

⎟⎟⎠

⎞⎜⎜⎝

⎛nN

Page 30: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

30  

The  Sample  Mean  

   The  sample  mean          es.mates  the  pop’n  mean        where        so  that                                  will  es.mate  the  pop’n  total        

∑=

=n

iiXn

X1

1

)'()(

:

.....:

)(...21:

...21:'

21

npopValueFixedxsampleValueRandomX

DifferenceHuge

XXXmemberssampletheofValues

NnnelementSample

NelementnPop

i

i

n

==

∑=

=N

iixN 1

XNT = µτ N=

Page 31: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

31  

Distribu4on  of  Sample  Means  

31  

Page 32: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

32  

Expecta4on  &  Variance  of  the  Sample  Mean    

 Theorem  A  (UNBIASEDNESS)  :    under  s.r.s.  ,                    Theorem  B:  under  s.r.s.  ,    Recall:  The  variance  of                    in  sampling  without  replacement  differs  from  that  in  sampling  with  replacement  by  the  factor                                  which  is  called  the  finite  popula.on  correc.on.  

The  ra.o                    is  called  the  sampling  frac.on.            

( ) µ=XE( ) ⎟

⎠⎞⎜

⎝⎛

−−−=111

2

Nn

nXVar σ

⎟⎠⎞⎜

⎝⎛

−−−111

Nn

X

Nn

Page 33: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

33  

Es4ma4on  of  the  Popula4on  Variance  

Theorem  A:  under  s.r.s.,    where        Corollary  A:  An  unbiased  es.mator  of                              is      given  by                                                                                                          where        

( )1

1ˆ 22

−⎟⎠⎞⎜

⎝⎛ −=

NN

nnE σσ

( )2

1

2 1ˆ ∑=

−=n

ii XX

( )XVar

⎟⎠⎞⎜

⎝⎛ −=⎟

⎠⎞⎜

⎝⎛

−−

⎟⎠⎞⎜

⎝⎛ −⎟⎠⎞⎜

⎝⎛

−=

Nn

ns

NnN

NN

nn

nsX 1

11

1ˆ 22

2 σ

( )2

1

2

11 ∑

=

−−

=n

ii XX

ns

Page 34: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

34  

Normal  approxima4on  to  the  sampling  distribu4on  of  the  sample  mean  

We  will  be  using  the  CLT  (Central  Limit  Theorem)  in  order  to  find  the  probabilis.c  bounds  for  the  es.ma.on  error.        

Applica.on  1:      •  the  probability  that  the  error  made  in  es.ma.ng          by          is                                                                                              using  the  CLT.  

Applica.on  2:    •  a                                                        CI  (Confidence  Interval)  for  the  pop’n  mean              is  given  by                                                          using  the  CLT.  

µX ( ) 12|| −⎟⎟⎠

⎞⎜⎜⎝

⎛Φ≅≤−

X

XPσδδµ

µ 2/* ασ zX X±( )%1100 α−

Page 35: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

35  

Es4ma4ng  a  Ra4o  

Ra.o  arises  frequently  in  Survey  Sampling.            If  a  bivariate  sample                                is  drawn,  then  the  ra.o                                                                                        

=

== N

ii

N

ii

xN

yNr

1

1

1

1

XYR /=

),( ii YX

Page 36: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

36  

Es4ma4ng  a  Ra4o  (cont’d)  

Theorem  A:    With  s.r.s.,  the  approximate  variance  of  R  is            Since  the  popula.on  correla.on  pop’n  is                                    then        

( )yxyxx

rrNn

nRVar σρσσσ

µ21

1111)( 222

2 −+⎟⎠⎞⎜

⎝⎛

−−−≅

( )

( )xyyxx

YXYXx

rrNn

n

rrRVar

σσσµ

σσσµ

211111

21)(

2222

2222

−+⎟⎠⎞⎜

⎝⎛

−−−=

−+≅

yx

xy

σσσ

ρ =

Page 37: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

37  

Es4ma4ng  a  Ra4o  (cont’d)  

Theorem  B:      With  s.r.s.,  the  approximate  expecta.on  of  R  is         ( )yxx

x

rNn

nrRE σρσσ

µ−⎟

⎠⎞⎜

⎝⎛

−−−+≅ 22

2

11111)(

Page 38: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

38  

Standard  Error  es4mate  of  R  

The  es.mate  variance  of  R  is                    where  and  the  pop’n  covariance                    is  es.mated  by          

( )xyyxR RsssRXN

nn

s 211111 222

22 −+⎟

⎠⎞⎜

⎝⎛

−−−≅

( ) ( )yi

N

ixixy yx

Nµµσ −−= ∑

=1

1

( )( )

yx

xy

yx

xy

i

n

iixy

sss

and

YYXXn

s

=⇒=

−−−

= ∑=

ρσσ

σρ ˆ

11

1

Page 39: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

39  

Confidence  Interval  for  R  

   An  approximate                                                      CI  (Confidence  Interval)  for  the  ra.o  of  interest  r  is  given  by  

( )%1100 α−

2/* αzsR R±

Page 40: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

40  

Stra4fied  Random  Sampling  

Introduc4on  (S.R.S.)    The  popula.on  is  par..oned  into  sub-­‐pop’s  or  strata  (stratum,  singular)  that  are  then  independently  sampled  and  are  combined  to  es.mate  popula.on  parameters.          

 A  stratum  is  a  subset  of  the  popula.on  that  shares  at  least  one  common  characteris.c  

 Example:  males  &  females;  age  groups;…  

Page 41: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

41  

Why  is  Stra4fied  Sampling  superior  to  Simple  Random  Sampling?  

•  S.R.S.  reduces  the  sampling  error  •  S.R.S.  guarantees  a  prescribed  number  of  observa.ons  from  each  stratum  while  s.r.s.  can’t  

•  The  mean  of  a  S.R.S.  can  be  considerably  more  precise  than  the  mean  of  a  s.r.s.,          if  the  popula.on  members  within  each  stratum  are  rela.vely  homogeneous  and  if  there  is  enough  varia.on  between  strata.  

Page 42: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

42  

Proper4es  of  Stra4fied  Es4mates  

 Nota.on:  Let                                                                    be  the  total  pop’n  size  if                                                                    

                         denote  the  pop’n  sizes  in  the  L  strata.  The  overall  popula.on  mean                      is  a  weighted  average  of  the  pop’n  means                of  the  strata,  where                                  denotes  the  frac.on  of  the  popula.on  in  the    stratum.  

LNNNN +++= ...21

LlforNl ,...,2,1=

∑∑∑

∑==

=

= ===L

ll

L

lllL

ll

L

lll

WbecauseWW

W

11

1

1 1µµ

µ

lµNNW ll /= thl

Page 43: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

43  

Proper4es  of  Stra4fied  Es4mates  (cont’d)  

Stra.fied  sampling  requires  two  steps:  •  Iden.fy  the  relevant  strata  in  the  popula.on  •  Use  s.r.s.  to  get  subject  from  each  stratum  Within  each  stratum,  a  s.r.s.  of  size                      is  taken  to  obtain  the  sample  mean  in  the              stratum  will  be  denoted  by                                            

     where  denotes  the                          observa.on  in  the          stratum.  

 

thlln

∑=

=ln

iil

ll Xn

X1

1

thithl

Page 44: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

44  

Proper4es  of  Stra4fied  Es4mates  (cont’d)  

Theorem  A:  The  stra.fied  es.mate,                                      ,                                      of  the  overall  popula.on  mean                                  is  UNBIASED.  

•  Since  we  assume  that  the  samples  from  different  strata  are  independent  of  one  another  and  that  within  each  stratum  a  s.r.s.  is  taken,  then  the  variance  of                    can  be  easily  calculated    

Theorem  B:  The  variance  of  the  stra.fied  sample  is      

l

L

lls XWX ∑

=

=1µ

sX

( ) 2

1

2

11

11l

l

l

l

L

lls N

nn

WXVar σ⎟⎟⎠

⎞⎜⎜⎝

⎛−−

−⎟⎟⎠

⎞⎜⎜⎝

⎛=∑

=

Page 45: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

45  

Neglec4ng  /  Ignoring  the  finite  popula4on  correc4on  

 Approxima.on:    

 If  the  sampling  frac.ons                                          within  all  strata      were  small,  Theorem  B  will  then  reduce  to:          

NNW l

l =

( ) 2

1

2

l

L

l l

ls n

WXVar σ∑=

Page 46: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

46  

Expecta4on  and  Variance  of  the  stra4fied  es4mate  of  the  popula4on  total  

This  is  a  corollary  of  Theorems  A  &  B.                      

( )

( ) ( )

ss

ll

l

l

L

llss

s

XNTwhere

Nn

nNXVarNTVar

TE

=

⎟⎟⎠

⎞⎜⎜⎝

⎛−−−⎟⎟⎠

⎞⎜⎜⎝

⎛==

=

∑=

2

1

22

1111 σ

τ

Page 47: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

47  

Methods  of  units’  alloca4on  in  strata  

For  small  sampling  frac.ons  within  strata  i.e.  when  neglec.ng/ignoring  the  finite  pop’n  correc.on,    

   Ques.on:  How  to  choose                                                  to  minimize            subject  to  the  constraint                                                                      when  resources  of  a  survey  allowed  only  a  total  of  n  units  to  be  sampled?  

Note:  We  could  include  finite  popula.on  correc.ons  but  the  results  will  be  more  complicated.    Try  it!  

( ) 2

1

2

l

L

l l

ls n

WXVar σ∑=

Lnnn ,...,, 21 ( )sXVarLnnnn +++= ...21

Page 48: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

48  

Neyman  alloca4on  

Theorem  A:    The  samples  sizes                                                                          that  minimize                                                    subject  to  the  constraint                                                                                                

       are  given  by        Corollary  A:  stra.fied  es.mate  &  op.mal  alloca.ons          

Lnnn ,...,, 21( )sXVarLnnnn +++= ...21

LlwhereW

Wnn L

kkk

lll ,...,2,1

1

==∑=

σ

σ

( )n

WXVar

L

lll

so

2

1⎟⎠⎞⎜

⎝⎛

=∑=

σ

Page 49: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

49  

Propor4onal  alloca4on  

If  a  survey  measures  several  auributes  for  each  popula.on  member,  it  will  be  difficult  to  find  an  alloca.on  that  is  simultaneously  op.mal  for  each  of  those  variables.    Using  the  same  sampling  frac.on                

           in  each  stratum  will  provide  a  simple  and  popular  alterna.ve  method  of  alloca.on.    

LlfornWNNnn

Nn

Nn

Nn

ll

L

L

,...,2,1

....

1

2

2

1

1

===⇒

===

Page 50: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

50  

Propor4onal  alloca4on  (cont’)  

Theorem  B:  With  stra.fied  sampling  based  on  propor.onal  alloca.on,  ignoring  the  finite  popula.on  correc.on,    

   Theorem  C:  With  stra.fied  sampling  based  on  both  alloca.on  methods,  ignoring  the  finite  pop’n  correc.on,    

( ) ( ) ( )

=

=

=

−=−

L

lll

l

L

llsosp

Wwhere

Wn

XVarXVar

1

2

1

1

σσ

σσ

( ) 2

1

1l

L

llsp W

nXVar σ∑

=

=

Page 51: Samplingprocessandsampling design Sampling)Formulas)sampieuchair.ec.unipi.it/wp-content/uploads/2019/10/Survey-Method… · 10 ProbabilitySampling) • A probability sampling scheme

51  

Conclusions  

A  mathema.cal  model  for  Survey  Sampling  was  built  using  s.r.s.  and  probabilis.c  error  bounds  for  the  es.mates  derived.  

 The  theory  and  techniques  of  survey  probability  sampling  include  also  Systema.c  Sampling,  Cluster  Sampling,  etc…as  well  as  non-­‐probability  sampling  methods  such  as  Quota  Sampling,  Snowball  Sampling,…