d1t3 enm workflows updated

69
ECOLOGICAL NICHE MODELING METHODS UPDATE Town Peterson, University of Kansas

Upload: town-peterson

Post on 12-Apr-2017

150 views

Category:

Science


3 download

TRANSCRIPT

Page 1: D1T3 enm workflows updated

ECOLOGICAL NICHE MODELING METHODS UPDATETown Peterson, University of Kansas

Page 2: D1T3 enm workflows updated

It Is A Bit Too Easy …

• Very easy access to lots of occurrence data

• Very easy access to rich geospatial data

• Easy-to-use modeling tools• Lots of literature setting out

the examples

Page 3: D1T3 enm workflows updated

Ecological Niche Modeling

1. Accumulate Input Data2. Integrate Occurrence and Environmental Data3. Model Calibration4. Model Evaluation5. Summary and Interpretation

Page 4: D1T3 enm workflows updated

Accumulate Input Data

Collate primary biodiversity data documenting occurrences

Process environmental layers to be maximally relevant to distributional ecology of species in question

Collate GIS database of relevant data layers

Assess spatial precision of occurrence data; adjust inclusion of data accordingly

Data subsetting for model evaluation

Occurrence andenvironmental data

Assess spatial autocorrelation

Page 5: D1T3 enm workflows updated

Occurrence Data in Niche Modeling

• Goal is to represent the full diversity of situations under which a particular species maintains populations

• Spatial biases (i.e., non-random or non-uniform distribution within G) is not damning

• Biases within E are catastrophic, and will translate directly into biases in any niche estimate

• More is usually better, but not always…

Page 6: D1T3 enm workflows updated

speciesLink Network

Page 7: D1T3 enm workflows updated

Uncertainty in Direction and Distance

Page 8: D1T3 enm workflows updated

Georeferencing should …

• Represent the place at which the species was found

• Represent the certainty and uncertainty with which that place is characterized

• Summarize the methods used to establish that place

• Preserve all of the original information for possible reinterpretation

Page 9: D1T3 enm workflows updated
Page 10: D1T3 enm workflows updated
Page 11: D1T3 enm workflows updated

Internal Consistency Testing

Page 12: D1T3 enm workflows updated

Data Cleaning• Attempt to detect meaningfully erroneous records, so

that they can be treated with caution in analysis• Use internal consistency to detect initial problems– Species names consistent?– Terrestrial species on land, marine species in the ocean?– Latlong matches country, state, district, etc.

• Use external consistency to go deeper– Occurrence data match known distribution spatially?– Occurrence data match known distribution environmentally?

• If precision data are available, filter to retain only records that are precise enough for the study

• Iterative process with important consequences

Page 13: D1T3 enm workflows updated
Page 14: D1T3 enm workflows updated

Data Subsetting

• Must respond to the question at hand … why are you doing the study?

• Ideally completely independent data streams• Failing that, can be – Macrospatial– Microspatial (but see spatial autocorrelation)– Random

• Will return to this point later…

Page 15: D1T3 enm workflows updated

Generalities: Environmental Data

• Raster format: i.e., information exists across entire region of interest

• Relevant information as regards the distributional potential of the species of interest

• More dimensions = better (generally), BUT – collinearity is bad– too many dimensions is bad

Page 16: D1T3 enm workflows updated

Major Sources

• Climate data – long time span, but low temporal resolution

• Remote-sensing data – high temporal resolution, diverse products, short time span

• Topographic data – high temporal resolution, uncertain connection to species’ distributional ecology

• Soils data – uneven global coverage, categorical data

• Others

Page 17: D1T3 enm workflows updated

Spatial Autocorrelation

Page 18: D1T3 enm workflows updated

Two Major Implications• Non-independence in model evaluation– Available data are often split into data sets for calibration and

evaluation– Data points that are not independent of one another may end

up in different data sets, thereby compromising the robustness of the test

• Inflation of sample sizes– Because individual data points may be non-independent of one

another, sample sizes may appear larger than they actually are– This inflation may create opportunity for Type 1 errors in model

evaluation and model comparisons

Page 19: D1T3 enm workflows updated
Page 20: D1T3 enm workflows updated

Process for Maximum Relevancy

Page 21: D1T3 enm workflows updated

Integrate Occurrence and Environmental Data

Assess BAM scenario for species in question; avoid M-limited situations

Saupe et al. 2012. Variation in niche and distribution model performance: The need for a priori assessment of key causal factors. Ecological Modelling, 237–238, 11-22.

Estimate M and S as area of analysis in study

Barve et al. 2011. The crucial role of the accessible area in ecological niche modeling and species distribution modeling. Ecological Modelling, 222, 1810-1819.

Reduce dimensionality (PCA or correlation analysis)

Occurrence andenvironmental data

Occurrence andenvironmental data ready for analysis

Page 22: D1T3 enm workflows updated

Assess BAM Scenario

Page 23: D1T3 enm workflows updated

BAM I: Eltonian Noise Hypothesis

A

M

B A

M

Page 24: D1T3 enm workflows updated

BAM II

ClassicBAM

Hutchinson’sDream

Wallace’sDream

All OK

Page 25: D1T3 enm workflows updated

Project onto Geography

Page 26: D1T3 enm workflows updated

Effect of BAM Scenarios

Page 27: D1T3 enm workflows updated

BAM Conclusions

• Some situations are not amenable to fitting ecological niche models that will have predictive power

• Models tend much more to good fitting of the potential distribution, rather than the actual distribution

• Must ponder carefully the BAM configuration in a particular study situation to avoid configurations that will not yield usable models

Page 28: D1T3 enm workflows updated

M and S as Study Area

Page 29: D1T3 enm workflows updated

Test Arena: The Lawrence Species

Page 30: D1T3 enm workflows updated

M and Model Training

Page 31: D1T3 enm workflows updated

Model Evaluation

Page 32: D1T3 enm workflows updated

M and Model Comparison

Page 33: D1T3 enm workflows updated

Model Comparison

Page 34: D1T3 enm workflows updated

M• When the species has no history in an area:– Use a radius related to dispersal distances

• When history is short (i.e., environment constant):– Use a radius representing compounding of

dispersal distances• When history is long (i.e., environmental

change is a factor) – Seek ways of assessing areas that the species’

distribution through time has covered…

Page 35: D1T3 enm workflows updated

Icterus cucullatus

Sampling

Page 36: D1T3 enm workflows updated

Reduce Dimensionality

Page 37: D1T3 enm workflows updated

Model Calibration

Estimate ecological niche (various algorithms)

Model calibration, adjusting parameters to maximize quality

Model thresholding

Peterson et al. 2007. Transferability and model evaluation in ecological niche modeling: A comparison of GARP and Maxent. Ecography, 30, 550-560.

Occurrence andenvironmental data ready for analysis

“No Silver Bullet” paper to appear

Warren, D. L. and S. N. Seifert. 2011. Ecological niche modeling in Maxent: The importance of model complexity and the performance of model selection criteria. Ecological Applications 21:335-342.

Preliminary models

Page 38: D1T3 enm workflows updated

Estimate Ecological Niche

Page 39: D1T3 enm workflows updated
Page 40: D1T3 enm workflows updated
Page 41: D1T3 enm workflows updated
Page 42: D1T3 enm workflows updated
Page 43: D1T3 enm workflows updated
Page 44: D1T3 enm workflows updated
Page 45: D1T3 enm workflows updated
Page 46: D1T3 enm workflows updated
Page 47: D1T3 enm workflows updated

No Silver Bullets in ENM

• Single algorithms may perform ‘best’ on average• The best algorithm in any given situation, however,

may be other than the ‘best’• NSB thinking suggests that we should not use a

single approach• Use a suite of approaches (e.g., as implemented in

OM, BIOMOD, BIOENSEMBLES, etc.), challenge to predict, choose best for that situation

• Maxent is good, but it is not the only algorithm …

Page 48: D1T3 enm workflows updated

Model Thresholding

Page 49: D1T3 enm workflows updated

“Presence”

Page 50: D1T3 enm workflows updated

Thresholding

• Use an approach that prioritizes omission error over commission error, in view of the greater reliability of presence data

• Minimum training presence thresholding seeks the highest suitability value that includes 100% of the calibration data

• Suggest (strongly) using a parallel approach that seeks that highest suitability value that includes (100-E)% of the calibration data

Page 51: D1T3 enm workflows updated

Model Optimization and Parameter Choice

Page 52: D1T3 enm workflows updated

Model Evaluation

Project niche model to geographic space

Model evaluation

Peterson et al. 2008. Rethinking receiver operating characteristic analysis applications in ecological niche modelling. Ecological Modelling, 213, 63-72.

Preliminarymodels

Reset data subsets based on evaluation results

Corroborated models ready for projection to

geographic times/regions of

interest

Page 53: D1T3 enm workflows updated
Page 54: D1T3 enm workflows updated

If predicted suitable area covers 15% of the testing area, then 15% of evaluation points are expected to fall in the predicted suitable area by chance.

• p = proportion of area predicted suitable

• s = number of successes• n = number of evaluation

points

Cumulative binomial distribution calculates the probability of obtaining s successes out of n trials in a situation in which p proportion of the testing area is predicted present. If this probability is below 0.05, we interpret the situation as indicating that the model’s predictions are significantly better than random.

Threshold-dependent Approach

Page 55: D1T3 enm workflows updated

Threshold-independent Approaches

Page 56: D1T3 enm workflows updated
Page 57: D1T3 enm workflows updated
Page 58: D1T3 enm workflows updated

http://shiny.conabio.gob.mx:3838/nichetoolb2/

Page 59: D1T3 enm workflows updated

Significance vs Performance

• Predictions that are significantly better than random is important, and is a sine qua non for model interpretation

• BUT, it is also important to assure that the model performs sufficiently well for the intended uses of the output

• Performance measures include omission rate, correct classification rate, etc.

Page 60: D1T3 enm workflows updated

Summary and Interpretation

Evaluation of model transfer results

Transfer to other situations (time and space)

Assess extrapolation (MESS and MOP)Owens, H. L., L. P. Campbell, L. Dornak, E. E. Saupe, N. Barve, J. Soberón, K. Ingenloff, A. Lira-Noriega, C. M. Hensz, C. E. Myers, and A. T. Peterson. 2013. Constraints on interpretation of ecological niche models by limited environmental ranges on calibration areas. Ecological Modelling 263:10-18.

Refine estimate of current distribution via land use, etc.

Compare present and “other” to assess effects of change

Models calibrated and evaluated, and transferred to present and “other” situations

Page 61: D1T3 enm workflows updated
Page 62: D1T3 enm workflows updated
Page 63: D1T3 enm workflows updated
Page 64: D1T3 enm workflows updated
Page 65: D1T3 enm workflows updated
Page 66: D1T3 enm workflows updated

MESS and MOP• Both have the intention of detecting extrapolative

situations• MESS is implemented within Maxent• MESS compares the area in question to the

centroid of the calibration cloud• MOP compares the area in question to the nearest

part of the calibration cloud• Agree on ‘out of range’ conditions• MOP better characterizes similarities between

calibration and transfer regions, and thus is more optimistic as regards in-range extrapolation

Page 67: D1T3 enm workflows updated
Page 68: D1T3 enm workflows updated

Ecological Niche Modeling

1. Accumulate Input Data2. Integrate Occurrence and Environmental Data3. Model Calibration4. Model Evaluation5. Summary and Interpretation