sampling design – from a statistical point of view

15
3. March 2016 Dag Ø. Hjermann 1 Sampling design – from a statistical point of view Dag Ø. Hjermann, NIVA NORMAN seminar 3. March 2016

Upload: hathuan

Post on 13-Feb-2017

220 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Sampling design – from a statistical point of view

3. March 2016 Dag Ø. Hjermann 1

Sampling design – from a statistical point of view

Dag Ø. Hjermann, NIVA NORMAN seminar 3. March 2016

Page 2: Sampling design – from a statistical point of view

3. March 2016 Dag Ø. Hjermann 2

How we sample organisms (or sediment) has a large effect on - estimates of levels and spatial differences in levels - estimates of time trends

Page 3: Sampling design – from a statistical point of view

3. March 2016 Dag Ø. Hjermann 3

1. What do we consider one station? - how many days do we allow sampling time to stretch over? - how large area can we consider to be a single station? 1a. Small-scale (some km) 1b. Large-scale (hundreds of km)

2. How does pooling of samples affect the analysis and conclusions?

Page 4: Sampling design – from a statistical point of view

3. March 2016 Dag Ø. Hjermann 4

This study: - Cod - Legacy contaminants (metals, PCBs, etc.)

Page 5: Sampling design – from a statistical point of view

3. March 2016 Dag Ø. Hjermann 5

1a. Small-scale variation Data from the VIC project

Slemm. Svestad Håøya 1997 15.-18. jan x x x

22. jan x 3. feb x

1998 15.-17. jan x x x 21. jan x 2. feb x

1999 14. jan x 18.-21. jan x x x 28. jan x

One location, one sampling

Extension in space (3 locations)

Extension in time (ca. 2 weeks)

Inner Oslo fjord (Asker-Håøya)

Page 6: Sampling design – from a statistical point of view

3. March 2016 Dag Ø. Hjermann 6

Sampling extended in time

A short sampling period → much lower statistical error (SE) - but also spurious time trends?

1 locality, 1 day ( ) 1 locality, 3 days during 2 weeks ( , ) ( and : date as fixed or random factor)

Page 7: Sampling design – from a statistical point of view

3. March 2016 Dag Ø. Hjermann 7

Sampling extended in space

1 locality, 1 occasion ( ) 3 localities, 1 occasion ( , ) ( and : location fixed /random factor)

A small area → much lower statistical error (SE) - but quite clearly spurious time trends

Page 8: Sampling design – from a statistical point of view

3. March 2016 Dag Ø. Hjermann 8

All data ( ) compared with One location, one occasion ( )

Same conclusion: lower statistical error (SE) but spurious time trends

Sampling extended in both space and time

Page 9: Sampling design – from a statistical point of view

3. March 2016 Dag Ø. Hjermann 9

Conclusions for small-scale studies - sampling over a small area and a short time → Unrealistically low statistical uncertainty → Spurious trends - Fish in a single sample are clearly not

independent of each other

Page 10: Sampling design – from a statistical point of view

3. March 2016 Dag Ø. Hjermann 10

1b. Large-scale variation Picked stations not close to point sources

Can we combine these stations to get better trend estimates?

Page 11: Sampling design – from a statistical point of view

1b. Large-scale variation Example: mercury

Year (1991-2014) 3. March 2016 Dag Ø. Hjermann 11

Page 12: Sampling design – from a statistical point of view

1b. Large-scale variation

AIC lower values = more parsimonous model

Compared three models: (1) Specific time trend for each station (2) Specific time trend for southern vs. northern areas (3) Common time trend

Here: results for 20-year time trend

1 2 3 1 2 3 1 2 3

3. March 2016 Dag Ø. Hjermann 12

Page 13: Sampling design – from a statistical point of view

3. March 2016 Dag Ø. Hjermann 13

1b. Large-scale: conclusions - Even "uncontaminated" locations have very often very

different trends - In addition, concentration levels differ a lot among stations

- Example: BDE-47

log e

(BD

E47

) ca. 5x forskjell

highest

lowest

Page 14: Sampling design – from a statistical point of view

3. March 2016 Dag Ø. Hjermann 14

2. Pooled samples Example: 25 fish per station (1) 25 analyses (2) 5 analyses (each pooled from 5 fish) → Loses a bit statistical power

"Loses" 15-30 % of significant time trends → A bit higher rate of Type 1 errors

Method (2) shows a trend i 1-2% of the cases where method (1) does not show a trend

→ Mean level becomes biased upwards Estimated concentration ca. 10-20 % too high But: can be adjusted if the number of fish per

sample is (approximately) equal

Page 15: Sampling design – from a statistical point of view

3. March 2016 Dag Ø. Hjermann 15

Conclusions - On a small scale (for instance, within a fjord), it is

advantageous if sampling is done over some time (~2-3 weeks) and space (some km) - Fish in a sample tends to have similar age and history - At least when using active gear (i.e. trawling)

- On a large scale, stations should not be combined

- Pooled samples are OK (significantly more "bang for the

buck") but pooled samples should contain equally many fish

- Same conclusions for emerging contaminants?