conservation by consensus: reducing uncertainty from ... · conservation by consensus: reducing...

Conservation by Consensus: Reducing Uncertainty from Methodological Choices in Conservation-Based Models

by

Mark S. Poos

A thesis submitted in conformity with the requirements for the degree of Doctorate of Philosophy

Department of Ecology and Evolutionary Biology University of Toronto

© Copyright by Mark S. Poos 2010

ii

Conservation by Consensus: Reducing Uncertainty from

Methodological Choices in Conservation-Based Models

Mark S. Poos

Doctorate of Philosophy

Department of Ecology and Evolutionary Biology University of Toronto

2010

Abstract

Modeling species of conservation concern, such as those that are rare, declining, or have a

conservation designation (e.g. endangered or threatened), remains an activity filled with

uncertainty. Species that are of conservation concern often are found infrequently, in small

sample sizes and spatially fragmented distributions, thereby making accurate enumeration

difficult and traditional statistical approaches often invalid. For example, there are numerous

debates in the ecological literature regarding methodological choices in conservation-based

models, such as how to measure functional traits to account for ecosystem function, the impact of

including rare species in biological assessments and whether species-specific dispersal can be

measured using distance based functions. This thesis attempts to address issues in

methodological choices in conservation-based models in two ways. In the first section of the

thesis, the impacts of methodological choices on conservation-based models are examined across

a broad selection of available approaches, from: measuring functional diversity; to conducting

bio-assessments in community ecology; to assessing dispersal in metapopulation analyses. It is

the goal of this section to establish the potential for methodological choices to impact

conservation-based models, regardless of the scale, study-system or species involved. In the

second section of this thesis, the use of consensus methods is developed as a potential tool for

reducing uncertainty with methodological choices in conservation-based models. Two separate

applications of consensus methods are highlighted, including how consensus methods can reduce

uncertainty from choosing a modeling type or to identify when methodological choices may be a

problem.

iii

Acknowledgments

No accomplishment is ever singular, and my doctoral work is no different. For me, I have

had the great fortune of a loving family, a group of wonderful friends, supportive colleagues, a

thorough and considerate academic committee, and a brilliant advisor. I think Marston Bates was

right when she said “Research is the process of going up alleys to see if they are blind.” In my

personal journey through this doctoral work, I have gone through many blind alleys. Without the

help and support of all the people in my life, this thesis would not have been possible.

First, I am grateful for the opportunity to have conducted this research under the supervision

of Dr. Don Jackson. I have learned a great deal about a great many things from Don, including:

multivariate statistics, sampling aquatic systems and natural ecology. Don was always willing to

share his thoughts and ideas, leave his door open door for questions, and let me steal a cup of

coffee; you couldn`t ask for a better combination. I am also very thankful for the mentorship of

Dr. Harold Harvey. Harold is my academic grandfather, a mentor and I hope to say friend. I am

thankful for Harold’s advice, his stories, and providing laughter and support. I will greatly miss

our daily conversations (and the recession cookies). I hope one day I am half as wise as Harold

is. I am deeply indebted to Dr. Nicholas Mandrak for all his help and advice. Not only was Nick

my M.Sc. supervisor, he was a member of my Ph.D. academic committee, and he was integral

part from the start of my thesis. Nick’s insights into biology of fishes are second to none, and I

owe a great deal of gratitude for all his mentorship throughout the years. Nick was also

instrumental in helping to obtain funds (IRF #1410) to keep aspects of this project going; which

saved the project. I am also greatly indebted to my academic committee members: Dr. Marie-

Josee Fortin, Dr. Brian Shuter and Dr. Keith Somers, whose expertise and knowledge greatly

improved this thesis and my own research. All of my academic committee members were

wonderful in guiding me through their areas of expertise, from spatial ecology (Thanks Marie-

Josee), to fisheries techniques (Thanks Brian), to multivariate statistics and bioassessments

(Thanks Keith). Finally, I am thankful to Dr. Bill Matthews for acting as my external examiner.

Bill has always been one of my academic idols, and it was an honor and pleasure to have him as

an external. I will never forget it. From my advisor, to my academic committee, to those

involved on the appraisal/defense; this thesis was clearly built on the shoulders of giants.

They say it takes a village to raise a child, for this thesis, it took a small army. None of the

field work would have been possible without the dedication and support of many people. In

iv

particular I wish to thank Andrew Drake (aka D.A.R. Rubington III) and Cavan Harpur. So

frequently I’d found myself in dire straits, without field crew or transportation, a rag tag

assemblage of gear, missing GPS coordinates, lack of fish (or fill in any number of other

circumstances); yet Andrew and Cavan were always there to help out and make it work. Their

perseverance and friendship turned what should have been an unquestionable failure into what I

hope is a great success. I can’t thank them enough.

An enormous thank you goes to the Toronto Region Conservation Authority, and in

particular the Aquatics group: Christine Tu, Dave Lawrie, Trevor Parker, and Tim Rance.

Through a simple handshake, we became partners, and their help and support ensured that much

of this research would get done. When no-one was there to help, TRCA was always ready to

come to the rescue. Regardless of the obstacles, their support never faltered and I am so happy

that we worked together. Dave Lawrie deserves special recognition for being the absolute best

colleague you could ask for. Dave was always willing to give you the shirt off his back, I know

of few people with his dedication and passion for preserving aquatic species.

Too all the volunteers who helped sample and collect data; thank you. Of course, sampling

endangered species comes with its own set of surprises. The fire ants were my favorite. My guess

is that Dave was right when he observed a negative correlation between returning volunteers and

fire ants; yet so many of you returned for multiple feedings. So thank you all the dedicated

volunteers from- the University of Toronto - Andrew Drake, Cavan Harpur, Brie Edwards, Meg

St. John, Maggie Neff, Paul Venturelli, Alex Manning, Monica Granados, Jonathan Ruppert,

Sapna Sharma, Nicole Puckett, Cristal Hart, Chris Howard, Moe Luksenberg, John Brett, Don

Jackson, Steve Walker – and the Toronto Region Conservation Authority - Dave Lawrie,

Christine Tu, Trevor Parker, Tim Rance, Brennan Paul, Elyssa Elton, Brad Stephens, Cristal

Hart, Laura DelGiudice, Brian Moyle, and Maria Parish – And: Peter Ng, Michell Wong Ken,

Connie Zehr, Bev Edwards, Yuko Nozoe, Derek Trim, Kenny Lee, Kari Jean (ABCA), Davin

Heinbuck (ABCA), Doug Forder (Ontario Streams), Daniel Morodvanschi, and Kenny Lee.

I also wish to thank the Jackson lab past and present (Sapna Sharma, Steve Walker, Maggie

Neff, Meg St.John, Brie Edwards, Monica Granados, Riku Pavola, Lifei Wang, Jean Bernard

Caron, Angela Strecker, Karen Wilson, and Theo Willis), and Harvey labs (Andrew Drake,

Cavan Harpur) for their enormous support of my project. Most (if not all) members of these labs

participated in some form or another in my project, and if not, their influence from discussions

can be found throughout the following pages. In particular thanks to Steve Walker for his help in

v

developing tests for measuring sensitivity in functional diversity. I never thought it would take as

long or work out nearly as well as it did; my sincere thanks to Steve.

I thank the following agencies and individuals for provided data for portions of this research:

Royal Ontario Museum (ROM), Fisheries and Oceans (DFO), Ontario Ministry of Natural

Resources (OMNR), Toronto Region Conservation Authority (TRCA), Conservation Halton

(CH), Lower Lake Simcoe Conservation Authority (LLSCA), Credit Valley Conservation, Dr.

Nicholas E. Mandrak (DFO), Doug Forder (Ontario Streams), Jeff J. Anderson (LLSCA), Les

Stanfield (OMNR), Erling Holm (ROM), David Lawrie (TRCA), Scott Jarvie (TRCA), Sherwin

Watson-Leung (CH), Dr. Scott Reid (OMNR), John Pisapio (OMNR), and Mark Heaton

(OMNR).

To my extended academic family, the department of Ecology and Evolutionary Biology,

thank you. Dr. Lisa Manne was very kind in providing desk space at UTSc in the later stages of

my thesis for writing. I am ever so grateful as it saved me many hours of commuting and kept me

close to my family (who also thank you). I also really enjoyed coming in and talking ecology

with her and her lab (Caroline Tucker, Christopher Grouios, and all the undergraduate students).

Dr. Helen Rodd has also been a great sounding board for ideas and has been wonderfully

supportive. Dr. Locke Rowe and Dr. Spencer Barrett provided laboratory space for imaging and

genetics work, which didn’t make its way into the thesis, but will hopefully be out in publication

shortly and I thank them nonetheless. Bill Cole and Jen Perry were also wonderful in helping me

navigate through their worlds of genetics and imaging.

To my friends and family ... thanks. Mom, dad, Nicole, Dave, Mau, Evan, Hayley, Shannon,

my extended family, and friends; thanks for keeping me sane and always bringing a smile to my

face. Last but not least, none of this would be possible without the constant love and

encouragement from my wife Jessica and son Jacob. My absolute favourite part of everyday is

coming home to see you. Thank you just isn’t enough. Thanks for accepting the fish smell in the

car (you just can’t get that stink off), for missing your own events, or getting dropped off

early/late to accommodate mine, for the missed evenings and weekends, the added stress, the

crappy pay, the long hours, and everything else. To Jacob, I hope one day you can take some

inspiration from these pages. Not for its content (I would never do that to you), but from reading

in-between these lines and noting a worthy lesson: that even when something is difficult,

perseverance and hard work can help you find your way through any blind alley.

vi

Table of Contents Acknowledgments .......................................................................................................................... iii

Table of Contents ........................................................................................................................... vi

List of Tables .................................................................................................................................. x

List of Figures ............................................................................................................................... xii

List of Appendices ....................................................................................................................... xvi

Chapter 1 General Introduction ...................................................................................................... 1

I) Impacts of Methodological Choices in Conservation-Based Models .................................... 3

Functional Diversity ............................................................................................................ 4

Bioassessments ................................................................................................................... 5

Metapopulation Viability Models ....................................................................................... 6

II) Using Consensus Methods to Reduce Uncertainty from Methodological Choices .............. 7

Scope .......................................................................................................................................... 8

Statement of Contribution .......................................................................................................... 9

Publication of Thesis Material ................................................................................................... 9

References ................................................................................................................................ 10

Section I: ...................................................................................................................................... 18

The importance of methodological choices in ecological models ................................................ 18

Chapter 2: The importance of methodological choices in influencing the measure of functional diversity across ecological communities ..................................................................... 19

Abstract .................................................................................................................................... 19

Introduction .............................................................................................................................. 20

Methods .................................................................................................................................... 22

Results ...................................................................................................................................... 26

Discussion ................................................................................................................................ 29

vii

Acknowledgements .................................................................................................................. 32

References ................................................................................................................................ 33

Appendices ............................................................................................................................... 36

Chapter 3: Addressing the removal of rare species in bioassessments with other choices in multivariate analyses ..................................................................................................................... 43

Abstract .................................................................................................................................... 43

Introduction .............................................................................................................................. 44

Methods .................................................................................................................................... 47

Evaluating Decisions in Multivariate Bioassessments ..................................................... 48

Assessing the Statistical Argument ................................................................................... 51

Assessing the Biological Argument .................................................................................. 52

Results ...................................................................................................................................... 54

Discussion ................................................................................................................................ 58


References ................................................................................................................................ 61

Appendices ............................................................................................................................... 71

Chapter 4: Contrasting direct versus indirect dispersal in metapopulation viability analyses ..... 73

Abstract .................................................................................................................................... 73

Introduction .............................................................................................................................. 74

Methods .................................................................................................................................... 77

Determining Metapopulation Viability ............................................................................. 78

Colonization Rate: ............................................................................................................ 78

Extinction Rate: ................................................................................................................ 79

Incorporating Dispersal Directly into the Metapopulation Model: ................................... 80

Regional Stochasticity ...................................................................................................... 81

Comparing Viability of Metapopulations Using Direct versus Indirect Parameterization ................................................................................................... 81

viii

Results ...................................................................................................................................... 82

Metapopulation Dynamics ................................................................................................ 82

Metapopulation and Patch Viability ................................................................................. 83

Discussion ................................................................................................................................ 87


References ................................................................................................................................ 92

Appendices ............................................................................................................................. 100

Section II: ................................................................................................................................... 102

Reducing uncertainty from methodological choices using consensus methods ......................... 102

Chapter 5: Reducing uncertainties in modeling the distribution of endangered species using habitat-based ensemble models ................................................................................................... 103

Abstract .................................................................................................................................. 103

Introduction ............................................................................................................................ 104

Methods .................................................................................................................................. 106

Study Area and Species .................................................................................................. 106

Building Individual Models ............................................................................................ 109

Evaluating Individual Models ......................................................................................... 113

Building Ensemble Models ............................................................................................. 113

Evaluating Ensemble Models ......................................................................................... 114

Results .................................................................................................................................... 115

I) Individual Models ....................................................................................................... 115

II) Ensemble Models ....................................................................................................... 117

Discussion .............................................................................................................................. 120

Conclusion .............................................................................................................................. 123

Acknowledgements ................................................................................................................ 123

ix

References .............................................................................................................................. 124

Appendices ............................................................................................................................. 131

Chapter 6: Using consensus methods to identify (and reduce) sensitivity from methodological choices when measuring functional diversity ............................................................................. 136

Abstract .................................................................................................................................. 136

Introduction ............................................................................................................................ 137

Methods .................................................................................................................................. 139

Using Consensus Methods to Identify Uncertainty when Measuring FD ...................... 140

Results .................................................................................................................................... 141

The Relationship between FD, Distance Measure & Clustering Algorithm .................. 141

Identifying Sensitivity in FD Using Consensus Methods ............................................... 144

Discussion .............................................................................................................................. 144

Acknowledgements ................................................................................................................ 147

References .............................................................................................................................. 147

Appendices ............................................................................................................................. 152

Chapter 7: General Conclusions ................................................................................................. 153

A) Conservation-based Models in General ............................................................................ 153

B) Functional Diversity .......................................................................................................... 154

Recommendations .................................................................................................................. 157

References .............................................................................................................................. 158

x

List of Tables Table 2.1 – The maximum probability of FD sensitivity for five communities previously used to

examine FD (Petchey and Gaston 2007; Podani and Schmera 2006). The number of species and

the number of assemblage pairs tested are also shown. Data sources are as follows: A) Holmes

et al. (1979), B) Munoz and Ojeda (1997), C) Jaksic and Medel (1990), D) Golluscio and Sala

(1993), and E) Chapin et al. (1996). ............................................................................................. 29

Table 3.1 – Summary of ordination techniques, similarity coefficients and exclusion of rarely

sampled species being compared. Abbreviations are indicated in parentheses and used in

subsequent figures and tables. All four approaches described in the “Exclusion of Rarely

Sampled Species” were used in each of the four “Similarity Coefficient” combinations with both

PCoA and NMDS. As Correspondence Analysis has the implicit chi-squared distance measure,

only the four approaches used in the “Exclusion of Rarely Sampled Species” were included in

that set of analyses. ....................................................................................................................... 49

Table 3.2 –Partitioning of variation in sum of squared deviations of Procrustes analyses (m2

statistic) across various choices in multivariate analyses, including: i) removal of rare species; ii)

ordination technique; and, iii) choice of similarity measure. Abbreviations are those noted in

Table 1. ......................................................................................................................................... 56

Table 4.1 – Summary of mark-recapture information of the endangered fish the reside dace

(Clinostomus elongatus) used to directly parameterize stochastic patch occupancy

metapopulation models. Shown are two locations in the Greater Toronto Area, Ontario, Canada:

A) Leslie Tributary, and B) Berczy Creek. Note: items denoted with a single asterisk (*)

represent a significant difference between populations (Mann-Whitney U for average dispersal,

G-test for stationary tags, p<0.05). ............................................................................................... 82

Table 4.2 – Intrinsic mean time (in years) to extinction (Grimm and Wissel 2004) of two

metapopulations of the endangered fish the redside dace (Clinostomus elongatus) for different

levels of regional stochasticity. ..................................................................................................... 84

Table 5.1 – Summary of the hierarchical habitat-based model used for predicting the presence of

endangered minnow, redside dace (Clinostomus elongatus). Seven predictive models were used

xi

including: LR (logistic regression), CT (classification trees), MARS (multivariate adaptative

regression splines), ANN (artificial neural networks), DA (discriminant analysis), RF (random

forest), and BR (boosted regression trees). Variables were derived using forward-selection

procedures on five independent datasets and are shown as a percentage of datasets where each

variable was selected (parentheses indicate negative associations). Variables selected from only

one dataset were omitted. A prioi predictions (ap) based on habitat predictors thought to

influence the decline of the species are shown for reference, where + indicates a positive

correlation, - negative correlation, 0 none. ................................................................................. 108

Table 5.2 – A comparison of model sensitivity (correct classification of species presence),

specificity (correct classification of species absence), and overall classification (correct

classification of both species presence and absence) for redside dace (Clinostomus elongatus).

Single models are: LR (logistic regression); CT (classification trees); BR (boosted regression

trees); MARS (multivariate adaptive regression splines); ANN (artificial neural networks); DA

(discriminant analysis); and, RF (random forest). Ensemble forecasts are: consensus model

(CM); principal component analysis (PCA); weighted average using overall classification (WA);

mean (Mn); and, median (Md). ................................................................................................... 116

Table 6.1 – Consensus measures of dendrogram group fidelity across distance measures

(Euclidean and cosine) for each clustering algorithm: single linkage, unweighted pair group

method with arithmetic means (UPGMA), and complete linkage. Group fidelity was determined

by majority rules consensus tress using CI(C) consensus index. ................................................ 144

xii

List of Figures Figure 2.1 –Measuring the sensitivity of FD in a hypothetical eleven species community. The

procedure consists of randomly dividing the community into two assemblages, noting how FD

orders the two assemblages and assessing how sensitivity effects this order (i.e. did assemblage 1

always have higher FD given differences in methodology?). Each species is represented as a

letter and the assemblages are represented as overlapping rectangles that contain the letters

associated with their component species. The first set of rectangles represents one random

division of the community into two assemblages. A new random division can be obtained by

leaving the overlapping rectangles fixed and simply permuting the order of the species. The

second set of overlapping rectangles gives an example of such a permutation. ........................... 24

Figure 2.2 - The effect of alpha and beta diversity on the probability of FD sensitivity for three

communities crossed with four sets of construction methods. Darker shading represents a higher

probability of sensitivity. The communities are A) Insectivorous birds (Holmes et al. 1979), B)

Intertidal fish (Munoz and Ojeda 1997) and C) Predatory vertebrates (Jaksic and Medel 1990).

Each column is for a different set of construction methods. For the first and second columns,

overall PS and PG, all nine methods of dendrogram construction (three distance measures times

three clustering algorithms) were used with the PS and PG methods respectively. For the third

column, all three clustering algorithms were used with Gower’s distance and the PS method. For

the fourth column, all three distance measures were used with UPGMA and the PS method. .... 27

Figure 2.3 –The effect of alpha and beta diversity on the probability of FD sensitivity for two

communities crossed with two methods of FD calculation. Darker shading represents a higher

probability of sensitivity. The communities are D) Patagonian forbs (Golluscio and Sala 1993)

and E) Artic vegetation (Chapin et al. 1996). For these communities, only Gower’s distance

could be calculated and so only three construction methods could be compared, corresponding to

the three clustering algorithms. Each column is for a different method of FD calculation. The

first is for the PS method and the second is for the PG method. .................................................. 28

Figure 3.1 – Example of rank-ordered, site-level vector residuals of Procrustean multivariate

comparison. The length of a vector residual indicates an overall lack of fit for a site between two

multivariate analyses. Shown is a comparison of full dataset of Principal Coordinates with

xiii

Jacaard’s distance and the same dataset where species occurring at 5% of sites were removed.

Vectors shown in grey indicate those sites where at least one species was removed, whereas

vectors in black indicate sites where no species were removed. The ratio of mean vector

residuals between sites where species were removed versus those sites that did not have species

removed indicates the distribution of impacts of the removal of rare species across multivariate

analyses. Where most vector residuals for sites having species removed are largest, they indicate

that these observations (sites) have been changed the most in their position between two

ordinations. .................................................................................................................................... 53

Figure 3.2 –Principal Coordinates Analysis (PCoA) of the sum of squares deviations (m2

statistic) comparing the concordance between solutions based on different ordination techniques,

similarity coefficients and treatments of excluding rarely sampled species. A minimum-spanning

tree was overlaid on Axes 1 and 2 to highlight connections between groups of points. Dashed

lines indicate deviations from group membership in cases where clear groupings do not exist

(e.g. M10 for Axes 2 and 3). Short forms are continued from Table 3.2. .................................... 55

Figure 3.3 – Site-level impact of the removal of rare species. Shown are box and whisker plots of

the ratios of Procrustes vector residuals between sites for which rare species were removed and

those sites that did not have any species removed. All comparisons were done by comparing

site-level Procrustes vector residuals from the full datasets and with the removal of rare species

across all similarity coefficients and ordination methods. ............................................................ 57

Figure 4.1 – Study sites on Rouge River, Ontario where redside dace (Clinostomus elongatus)

dispersal and patch dynamics were monitored. Study locations: A) Leslie Tributary, and B)

Berczy Creek, were sub-divided into extensive sites (black), where redside dace were tagged

with a color-coded visual implant elastomer tag, and extended sites (grey), which were

monitored for tag movement. ........................................................................................................ 76

Figure 4.2 – Metapopulation viability of the endangered species the redside dace (Clinostomus

elongatus) in two stream metapopulations: A) Leslie Tributary, and B) Berczy Creek. Shown are

the probabilities of extinction (y-axis) in years (x-axis) of a stochastic patch-based

metapopulation model. Models were parameterized using: I) indirect parameterization of

colonization and dispersal via patch distance, and; II) direct parameterization of colonization and

dispersal using empirical estimates from a mark-recapture study. Legend: Vertical hashes

xiv

represent a time interval of 100 years, solid lines indicate population trajectories where regional

stochasticity was set to 0, dashed lines set to 0.1 and dotted lines set at 0.2. ............................... 85

Figure 4.3 – Differences in patch viability parameterized using indirect (y-axis) and direct (x-

axis) patch dynamics of the endangered species the redside dace (Clinostomus elongatus) in two

stream metapopulations: A) Leslie Tributary (L6-L15), and B) Berczy Creek (B6-B18). Shown

are the mean probabilities of persistence of a given patch across 10,000 simulations. To

demonstrate the variability in patch viability, 25% quantiles are overlaid as the negative of both

the vertical and horizontal axes, while 75% quantiles are overlaid as the positive vertical and

horizontal axes. The dashed line is a 1:1 line. .............................................................................. 86

Figure 5.1 – Distribution of sampling locations between 1997-2007. Closed circles indicate

redside dace occurrences, whereas, open circles indicate where redside dace were absent. ...... 107

Figure 5.2 .................................................................................................................................... 110

Figure 5.3- Cluster analysis showing the relationship with the observed distribution (Obs.) of

redside dace (Clinostomus elongatus) and: A) the seven individual modeling approaches alone,

and B) with ensemble forecasts included. Model short forms are carried over from Table 5.2. 117

Figure 5.4 – Box and whisker plots showing the variability in consensus ensemble forecasts for

predicting the presence of an endangered redside dace. Consensus ensemble models were

compared across all combinations of one (n =7), three (n =35), five (n =21) and seven (n=1)

input models (x-axis). Boxes are 25th and 75th percentiles, horizontal lines indicate the median,

vertical lines indicate the upper and lower values, diamonds indicate the mean and are connected

by dashed lines. Modeling metrics were: A) model sensitivity (i.e. correct classification of

species presence); B) model specificity (i.e. correct classification of species absence); and, C)

overall classification. Dashed lines indicate 95% confidence intervals. .................................... 119

Figure 6.1 - The relationship between distance measure (Euclidean or cosine) and clustering

algorithm (SL = single linkage / nearest neighbor, UPGMA = unweighted pair group method

with arithmetic mean, CL = complete linkage / farthest neighbor) with FD using five community

data sets: A) Arctic vegetation (Chapin et al. 1996); B) Insectivorous birds (Holmes et al. 1979);

C) Patagonian forbs (Golluscio and Sala 1993); D) Intertidal fish (Munoz and Ojeda 1997); and,

E) Predatory vertebrates (Jaksic and Medel 1990). FD values were re-scaled relative to the

xv

Arctic vegetation data, which has the highest FD values. This standardization leads to the

appearance of a constant outcome for the Arctic dataset, but this consistency is solely an artifact

of using it as the reference point rather than the outcomes not differing depending on the

resemblance measure or clustering algorithm. ............................................................................ 142

Figure 6.2 - The relationship between distance measure (solid lines = Euclidean distance, dashed

lines = cosine distance) and building a dendrogram using a clustering algorithm (1 = complete

linkage / farthest neighbor, 2 = unweighted pair group method with arithmetic mean / UPGMA,

3 = single linkage / nearest neighbor) where species are individually removed when calculating

FD. Five community data sets are shown: A) Arctic vegetation (Chapin et al. 1996), B)

Insectivorous birds (Holmes et al. 1979), C) Patagonian forbs (Golluscio and Sala 1993), D)

Intertidal fishes (Munoz and Ojeda 1997), and E) Predatory vertebrates (Jaksic and Medel 1990).

Shown inset are 50% majority rule consensus trees demonstrating lack of between group fidelity

of species where calculating functional diversity using different distance measures, but the same

clustering approach. .................................................................................................................... 143

xvi

List of Appendices Appendix 2.1 – Derivation of Equation 1. .................................................................................... 36

Appendix 2.2 – MatLAB Code for testing sensitivity of FD ........................................................ 38

Appendix 2.3 – MatLAB Code for calculating FD via Podani and Schmera ............................... 41

Appendix 2.4 – MatLAB Code for calculating probabilities of sensitivity .................................. 42

Appendix 3.1 – Site-level effects of methodological choices in bioassessments. Shown are the

ratios between mean site-level vector residuals from Procrustes analyses of sites having species

removed and those sites having no species removed. Mean site-level vector residual values were

separated by sites which had rare species removed (M1: n=2; M5: n=19; and M10: n=63); and

compared with those sites that not. ............................................................................................... 71

Appendix 3.2 – Summary of three-dimensional ordination results. Shown are eigenvalues for

Principal Coordinates Analyses (PCA) and Correspondence Analyses (CA), with percent

variance explained shown in parentheses. Stress values are shown for Non-Metric

Multidimensional Scaling (NMDS). ............................................................................................. 72

Appendix 4.1 –The endangered redside dace (Clinostomus elongatus). Photo credit: Mark Poos.

..................................................................................................................................................... 100

Appendix 4.2 –Visual implant elastomer (VIE) tag inserted subcutaneously on the ventral surface

of the endangered redside dace (Clinostomus elongatus). Photo credit: Mark Poos. ................. 100

Appendix 4.3 – Model parameters used in the stochastic patch-occupancy models. Shown are

the number of emigrants (mi), the mean probability of detection (PD), the number of emigrants

adjusted for potentially missed tags (Mi), the number of immigrants needed to start a new sub-

population, and the rate of extinction (Ei). Other parameters include: the incidence function for

Leslie Tributary (dI = 210, x = 0.4926, e = 3.685, y = 6.12), and Berczy Creek (dI = 150, x =

0.5652, e = 4.187, y = 7.01). Not shown: dij given it is a pairwise estimate rather than unique for

each pool. .................................................................................................................................... 101

xvii

Appendix 5.1 – Model metrics for all combinations of consensus ensemble models. Models are:

LR (logistic regression), CT (classification trees), MARS (multivariate adaptive regression

splines), RF (random forest), ANN (artificial neural networks), BR (boosted regression trees, and

DA (discriminant analysis). ........................................................................................................ 131

Appendix 5.2 – R Code for testing configurations of 1,3, 5 and 7 prediction consensus models

..................................................................................................................................................... 133

Appendix 6.1 – MatLAB Code for calculating total branch lengths of dendrograms from various

species combinations .................................................................................................................. 152

1

Chapter 1 General Introduction

“All models are wrong, some models are useful” George Box, Statistics Professor

The overall aim of ecology is to broaden the understanding of the relationship between species

and their environment (Krebs1998). This objective can be achieved through a myriad of ways

from broadening the understanding of the structural components of an organism (e.g. molecular

or cellular biology, physiology, evolutionary ecology), to understanding specific aspects of the

organism itself (e.g. behavioral or evolutionary ecology, genetics), to understanding the interface

between the organism and its biotic and abiotic environment (e.g. community or population

ecology). Due to the complexity involved in each of the disciplines, and regardless of how

ecologists view the world, simplification of natural processes are needed and a statistical model

is often required.

Statistical models provide the framework for an interpretation of nature. A statistical model is,

by definition, merely a mathematical representation of some aspect of nature (Quinn and Keough

2002) and can take many forms. Models can be predictive in that they can be used to forecast,

among other things, species distributions (e.g. Araújo and Guisan 2006; Elith et al. 2006), habitat

importance (e.g. Olden and Jackson 2001; Guissan and Thuiller 2005), or potential changes in

the environment and species response (e.g. climate change; Thuiller 2004; Araújo et al. 2005).

Models can also be used to test hypotheses, e.g. is environment A better than B for a given

species (e.g. Matthews et al. 1992; Grossman et al. 2002; Skyfield and Grossman 2008)? Lastly,

models can be strictly informative, such as what is the current condition of environment A or

species B (e.g. Grossman 1984; Mathews et al. 1982; 1986)? In all cases, models are assumed as

useful if they can provide a realistic representation of the underlying natural world (Hilborn and

Mangel 1997). However, many factors may influence models and understanding methodological

choices may provide improved understanding of how ecologists interpret nature.

The history of statistical methods has demonstrated that understanding the impact of

methodological choices in today’s modeling approaches is sorely needed. Prior to the

development of modern computers, ecologists opted for models where the structure was

arbitrarily simplified (Quinn and Keough 2002). As such, there were relatively few

2

methodological choices, as data were often ‘massaged’ to fit the statistical approach or,

alternatively, practitioners used their favorite method ad-hoc (Jackson 1993). Now, with

complex statistical software and powerful computers readily available, ecologists have a plethora

of approaches to choose from and many methodological choices to make.

Understanding how methodological choices may impact results is an area of active and ongoing

research (Jackson et al. 1989; Hilborn and Mangel 1997; Grossman et al. 2006). Over the last

decade alone, methods for the analysis of species distribution have diversified and, at this point,

dozens of different approaches are available (Guisan and Zimmermann 2000; Guisan and

Thuiller 2005; Elith et al. 2006). Previous studies have demonstrated that methodological

choices made at one level of analysis may have cascading impacts on subsequent choices

(Jackson et al. 1989; Grossman et al. 1991; Dormann et al. 2008). For example, Dormann et al.

(2008) demonstrated that changes in both data quality and choice of model type had large

impacts for modeling the distribution of the critically endangered bird the Great Grey Shrike

(Lanius excubitor). Further, in a review of modeling species distributions with presence only

data, Elith et al. (2006) showed large differences in predictive performance among modeling

methods at both regional and species levels. These examples highlight that, despite substantial

effort in improving statistical methods for conservation-based species, there remains great

uncertainty regarding the impacts of methodological choices.

There are numerous debates in ecological literature regarding methodological choices in diverse

areas such as how to measure functional traits to account for ecosystem function (Petchey and

Gaston 2002; Podani and Schmera 2006), the impact of including rare species in biological

assessments (Cao et al. 1998; Marchant 1999), and whether species-specific dispersal can be

measured using distance-based functions (Heinz et al 2005; 2006). Given the choices that

ecologists must make, knowing the pitfalls and assumptions of conservation-based models

become increasingly important. For example, ecologists should know the impact of

methodological choices given the sampling design and kind of data that they wish to collect

(Quinn and Keough 2002).

My thesis attempts to address issues in methodological choices in conservation-based models in

two ways. In the first section of the thesis, the impacts of methodological choices inherent in

conservation-based models will be examined across a broad selection of available approaches

3

from measuring functional diversity (Chapter 2), to developing bio-assessments in community

ecology (Chapter 3), to assessing dispersal in metapopulation analyses (Chapter 4). It is the goal

of this section to illustrate the potential for methodological choices to impact conservation-based

models, regardless of the scale, study system or species involved.

For clarity, a conservation-based model refers to a model used to interpret/advance knowledge of

species with conservation concern (e.g. conservation designation, rare, or declining). These

models can include population viability analyses, models of habitat suitability or models of

ecological function. A methodological impact refers to a difference in conclusions (e.g.

population viability, habitat suitability, and ecological function) derived from methodological

choices that alter ecological interpretation and/or lead to differences in recovery/management.

Species with conservation concern refers to those which occur infrequently, have a conservation

status (e.g. endangered or threatened) or are declining (i.e. undergoing reductions in population

size but do not have a conservation designation). As there are many forms of rarity (Rabinowitz

et al. 1986), the definition of conservation-based models and species with conservation concern

used in this thesis is purposefully inclusive to allow for the greatest possible impact on

conservation applications. In the second section of this thesis, the use of consensus methods will

be demonstrated as a potential tool for reducing uncertainty with methodological choices in

conservation-based models. Two separate applications of consensus methods will be

highlighted, including how consensus methods can reduce uncertainty from choosing a modeling

type (Chapter 5) and can be used to identify when methodological choices may be a problem

(Chapter 6).

I) Impacts of Methodological Choices in Conservation-Based Models

To some researchers, there is perhaps no greater goal of ecology than to preserve the biological

diversity of species (Ehlrich and Wilson 1991). The loss of biological diversity through human

impact is of concern for the structure of ecological communities and its affects may, in turn,

affect the structure of ecosystems (e.g. keystone species). Current rates of decline of species

worldwide are thought to be hundreds to thousands of times higher than pre-human levels (Jelks

et al. 2008). In particular, freshwater systems may have rates of decline many times higher than

terrestrial systems (Ricciardi and Rasmussen 1999). With the rate of species decline increasing,

the need for conservation-based models that reflect biological phenomena is becoming timely.

4

There are many methodological difficulties with developing conservation-based models and

understanding these challenges may help improve species management. First, species that are of

conservation concern suffer from low sample sizes and spatially fragmented distributions,

thereby making accurate enumeration difficult (Green and Young 1993) and traditional statistical

approaches often invalid (Ellison and Agrawal 2005). As a result, models fitted to a response of

species with conservation concern usually have low power or high uncertainty (Cunningham and

Lindenmayer 2005). Further, in many multivariate methodologies, researchers down weight or

exclude rare species altogether (Gauch 1982), thereby reducing or eliminating their usefulness

for conservation applications.

The role of methodological choices has been debated in several types of conservation-based

models. In the following section a few ecological examples will be highlighted for further

analysis in this thesis.

Functional Diversity

Given the current loss of biodiversity, it is important to be able to model how functional diversity

will respond to species loss. Quantification of the relationship between species loss and

functional diversity is necessary to highlight unique species traits that may be lost and their

implication to ecosystem function (Srivastava and Vellend 2005). Extinction should not affect

overall ecosystem function if all species have similar functions, but it will have a major effect if

each species has a different function (Fonseca and Ganade 2001). Simulations using natural

communities have shown that the loss of species reduces functional diversity disproportionately

relative to the number of species (Petchey and Gaston 2002; Larsen et al. 2005). These findings

are in agreement with empirical studies noting the loss of rare species disproportionately impacts

ecosystem function (Hooper et al. 2005).

One of the current metrics of functional diversity (sensu Petchey and Gaston 2002) uses the total

branch length of a dendrogram (tree) constructed from a matrix of ecological traits. This method

requires three steps before the total branch lengths can be calculated, including: 1) obtaining a

trait matrix; 2) converting the trait matrix into a distance matrix; and, 3) clustering of the

distance matrix to produce a dendrogram. Petchey and Gaston (2002) have suggested that the

particular analytical methods used to produce the functional dendrogram are of limited relevance

5

to the resultant metric as the relationship between functional diversity and species richness is

robust to changes in distance metric and clustering algorithm. However, previous research has

indicated that changes in distance metric have dramatic effects on clustering dendrograms

(Jackson et al. 1989; Jackson 1993) and, similarly, choice of clustering algorithm (Legendre and

Legendre 1998) can alter biological conclusions.

The role of methodological choices for altering functional diversity (FD; Petchey and Gaston

2002) has only been recently debated (Petchey and Gaston 2006, 2007; Podani and Schmera

2006, 2007; Poos et al. 2009). Functional diversity is an important component of conservation-

based models as researchers are often interested in the amount of functional variation with an

ecosystem (Loreau et al. 2001). Comparisons of the amount of functional variation that is lost

when species have been removed from an ecosystem has demonstrated that rare species play an

important role for maintaining ecosystem function (Fonseca and Ganade 2001; Petchey and

Gaston 2002b; Schmera et al. 2009). In Chapter 2, the importance of methodological choices on

estimating functional diversity is examined. In particular, Chapter 2 highlights whether how

functional diversity is measured can alter ecological insight.

Bioassessments

Similar to functional diversity, there has been a dramatic increase in the amount of

bioassessment literature in the past few decades (Bailey et al. 2004). Bioassessments have been

used by managers for decades to evaluate communities undergoing anthropogenic impact and

species decline (Green 1979; Barbour et al. 1999). Debates regarding the impact of

methodological choices on the assessment of biological communities (i.e. bioassessment) are

varied and include criticisms of multi-metric approaches (Hannaford and Resh 1995; Wallace et

al. 1996; Bowman and Somers 2006); the development of multivariate and predictive models

(Bailey et al. 2004), taxonomic resolution (Somers et al. 1998; Hewlett 2000), improvements to

rapid methods (Hannaford and Resh 1995; Somers et al. 1998), redundancy in metrics (Barbour

et al. 1992), and improvements to the study design of reference conditions (Norris and Thoms

1999; Bowman and Somers 2005; Bailey et al. 2009).

One ongoing debate in bioassessments is the importance of rare species (Cao et al. 1998; 2001;

Marchant 1999; Marchant 2002). Methods for identifying trends in multivariate analyses which

6

include rare species are often limited because they typically represent points of unusually high

leverage in the analysis. When rare species (i.e. species that occur infrequently) are included in

multivariate analyses, they often alter the analysis because, relative to more common species,

they over-fit an association of relatively few occurrences to a specific habitat type that, in turn,

unduly influences species-habitat differences (Legendre and Gallagher 2001). To compensate

researchers either down weight or exclude rare species from multivariate analyses without

consideration of the degree of influence they may have on the underlying biological data (Gauch

1982). Obviously, the benefits of this decision are negated when the species or relationship of

interest relates to a species that is rare, as is the case with many conservation applications.

Imperiled species (i.e. species with a conservation status), like rare species, are scarce and often

found in small numbers in specific habitat types (Mace 1994). In Chapter 3, the impact of

removing rare species is assessed relative to other multivariate methodological choices, such as

the choice of distance measure and multivariate method, which are all common to multivariate

bioassessment approaches.

Metapopulation Viability Models

Trajectories of populations through time are often needed to extrapolate population viability.

Approaches such age or stage based matrix population models require parameterization of

several species life history characteristics (e.g. fecundity, age/size of maturation), which may be

limited or unknown for species with conservation concern. As such, simplifications in how many

life history characteristics are needed to model population trajectories are needed.

Metapopulation viability analyses offer simplification in over traditional population viability

analyses as they only require parameterization of patch occupancy and estimates of rates of

colonization and extinction (Hanski 1999). One popular method of metapopulation viability

analysis is stochastic patch occupancy models (SPOM, Hanksi 1999; Moilanen 1999; 2004).

Stochastic patch occupancy models provide a spatially realistic model for patch dynamics as they

incorporate estimates of patch location and species dispersal into the modeling procedure. For

example, estimates of patch colonization (i.e. “reachability”) are quantified using the distance

between a starting patch and a target patch, and the ability of species to disperse (Hanksi 1994;

Hanski et al. 1996; Heniz et al. 2005). Most often, this measure is estimated by assuming that

colonization potential declines exponentially with distance (i.e. exponential decay; Hanski 1994;

7

Vos et al. 2001; Frank and Wissel 2002). However the ability of how well even simple formulae

are able to model patch dynamics remains an open question (Heinz et al. 2006; Marsh 2008).

Chapter 4 aims to better understand the impact of estimating population viability using direct

versus indirect parameterization of dispersal in metapopulation models.

II) Using Consensus Methods to Reduce Uncertainty from

Methodological Choices

Conservation-based models provide a useful tool for understanding consequences of species loss

(e.g. functional diversity), anthropogenic impacts (e.g. bioassessments) and the predictions for

population viability and habitat suitability. However, the difficulty with using the standard sets

of conservation-based models is that they are often inappropriate when analyzing data limited to

a few sites and/or species (Ellison and Agrawal 2005). This condition of sparse data, in turn,

contributes to many statistical limitations, including zero-inflated bias, increased collinearity

between variables and inflated coefficient of variation (Dixon et al. 2005; Edwards et al, 2006;

Dormann et al. 2008). However, uncertainties may arise during all stages of modeling such as

obtaining species level data obtaining accurate species counts, and linking species to

landscapes/habitats (Burgman et al. 2005; Cunningham and Lindenmayer 2005; Edwards et al.

2005; Elith et al. 2006). Therefore, developing methods that may reduce impacts from

methodological choices may enhance the utility of conservation-based models.

The use of consensus methods has been highlighted as one way to address problems related to

using conservation-based models (Araújo and New 2007; Marrimon et al. 2009). Laplace

(1820) demonstrated that the probability of error will decline as more models are included (i.e.

an ensemble or consensus approach). This old idea has only recently been applied to problems

of uncertainty with the influential work of Bates and Granger (1969) that in part, contributed to

the award of the Nobel Memorial Prize in Economics to Clive Granger in 2003. Recently,

ecologists have applied consensus models to issues with uncertainty in climate-based predictions

(Thuiller 2004; Araújo et al. 2005; Thuiller et al 2005; Buisson and Grenouillet 2009) and, to

some extent, to functional diversity (Mouchet et al. 2008). Yet, the application of consensus

methods seems due to other problems in ecology.

8

The application of consensus methods to resolve issues of methodological choices is in its

infancy. Issues such as how to build the best ensemble model have yet to be quantitatively

evaluated. The second section of the thesis applies consensus models to impacts from

methodological choices including whether consensus methods can improve prediction over

several singular methodological approaches (Chapter 5) and whether consensus methods can

identify instances where methodological choices may be an issue as in the measure of functional

diversity (Chapter 6). In Chapter 5, a quantitative evaluation of consensus models is applied to

the habitat modeling of the endangered fish, Redside Dace (Clinostomus elongatus). Chapter 5

aims to evaluate the ability of consensus models to improve model performance (e.g. model

specificity, sensitivity and overall classification). In Chapter 6, consensus methods are used to

evaluate whether or not methodological choices are an issue for measuring functional diversity.

Scope

The difficulty with developing any model for conservation-based purposes are the linkages

needed across many facets of ecological knowledge, from how species traits relate to the

environment, to the types of environments undergoing impact. These relationships require

knowledge not only of the species, but each system, the suite of species found there, the

relationships between those species and their related habitats, and the relationship between each

habitat and large scale (e.g. landscape) and small scale (e.g. in-stream habitat) ecological

processes (Smith and Powell 1971; Matthews 1998; Jackson and Harvey 1989; Jackson et al.

2001). Therefore, there is no easy starting point (e.g. scale, method or species) for determining

modeling approaches appropriate for conservation-based issues.

The scope for identifying the influence of methodological choices in conservation-based models

is broad by its very nature. As such, this thesis is not restricted to a given study organism, study

system, data series, or methodology. Instead, data is used that best addresses the specific

questions and objectives of the various chapters. For example, in Chapters 2, 3 and 6 data is

from published studies to demonstrate the impact of methodological issues. In Chapter 4,

empirical information was gathered over the course of a one-year field study. In Chapter 5, over

420,000 fish records were gathered from various not-for-profit and government agencies,

universities and other academic institutions and studies.

9

In addition, this thesis examines different spatial and temporal scales for identifying

methodological choices. In Chapters 4 and 5, data was collected on dispersal and habitat

suitability of Redside Dace (Clinostomus elongatus). In Chapter 4, the spatial scale is the

province of Ontario and temporal scale are data points within the last decade. In Chapter 5, the

spatial scale is one watershed and the temporal scale is one year. These spatial and temporal

scales were chosen to highlight the approaches being compared (e.g. predictive models in

Chapter 4, and metapopulation models in Chapter 5). The goal of this thesis is that through the

consistency of the results within this thesis, across study organisms, study systems, data series,

methodologies, and spatial and temporal scales, it will provide ecological insight into making

methodological decisions in ecology. Each chapter is linked based on the influence of

methodological choices and an overall interest on how methodological choices can alter

conclusions in ecological studies.

Statement of Contribution

Parts of this thesis would not have been possible without the contribution of many collaborators.

Chapters 2 and 6 were completed in collaboration with Steve Walker, who helped in the

derivation of Equation 1 (Appendix 2.1), the development of ideas, and in writing code for

assessing the robustness of the functional diversity measure. Chapter 4 was completed with the

help of staff from the Toronto Region Conservation Authority, which provided valuable field and

logistical assistance. Chapter 5 was completed with the help of various governmental, non-

governmental and not-for profit agencies who contributed data.

Publication of Thesis Material

Chapter 2 has been published and can be found as:

Poos, M.S., S.C. Walker and D.A Jackson. 2009. Functional diversity indices can be

driven by methodological choices and species richness. Ecology 90(2): 341-347. (doi:

10.1890/08-1638.1)

Chapters 3 and 5 are currently submitted and undergoing peer review. All published material is

provided under approved copyright from the related journal.

10

References

Araújo, M. B., and A. Guisán. 2006. Five (or so) challenges for species distribution modelling.

Journal of Biogeography 33:1677-1688.

Araújo, M. B., and M. New. 2007. Ensemble forecasting of species distributions. Trends in

Ecology and Evolution 22:42-47.

Araújo, M. B., R. J. Whittaker, R. J. Ladle, and M. Erhard. 2005. Reducing uncertainty in

projections of extinction risk from climate change. Global Ecology and Biogeography

14:529-538.

Bailey, R. C., R. H. Norris, and T. B. Reynoldson. 2004. Bioassessment of freshwater

ecosystems: Using the reference condition approach. Springer, New York.

Bailey, R. C., M. G. Kennedy, M. Z. Dervish, and R. M. Taylor. 2008. Biological assessment of

freshwater ecosystems using a reference condition approach: comparing predicted and

actual benthic invertebrate communities in Yukon streams. Freshwater Biology 39:765-

774.

Barbour, M.T., J. Gerritsen, B.D. Snyder, and J.B. Stribling. 1999. Rapid bioassessment

protocols for use in streams and wadeable rivers: Periphyton, benthic macroinvertebrates

and fish. 2nd edition. EPA-841-B-99-002. Environmental Protection Agency.

Bates, J. M., and C. W. Granger. 1969. The combination of forecasts. Operations Research

Quarterly 20:451-468.

Bowman, M. F., and K. M. Somers. 2005. Considerations when using the reference condition

approach for bioassessment of freshwater ecosystems. Water Quality Research Journal of

Canada 40:347-360.

Bowman, M. F., and K. M. Somers. 2006. Evaluating a novel Test Site Analysis (TSA)

bioassessment approach. Journal of the North American Benthological Society 25:712-

727.

11

Burgman, M., D. B. Lindenmayer, and J. Eltih. 2005. Managing landscapes for conservation

under uncertainty. Ecology 86: 2007-2017.

Buisson, L., and G. Grenouillet. 2009. Contrasted impacts of climate change on stream fish

assemblages along an environmental gradient. Diversity and Distributions 15:613-626.

Cao, Y., D.D. Williams, and N.E. Williams. 1998. How important are rare species in community

ecology and bioassessment? Limnology and Oceanography 43: 1403–1409.

Cao, Y., D.P. Larsen, and R.S. Thorne. 2001. Rare species in multivariate analysis for

bioassessment: some consideration. Journal of the North American Benthological Society

20: 144–153.

Cunningham, R.B., and D.B. Lindenmayer. 2005. Modeling count data of rare species: Some

statistical issues. Ecology 86: 1135-1142.

Dixon, P.M., A.M. Ellison, and N.J. Gotelli. 2005. Improving the precision of estimates of the

frequency of rare events. Ecology 86: 1114-1123.

Dormann, C. F., O. Purschke, J. R. G. Márquez, S. Lautenbach, and B. Schröder. 2008.

Components of uncertainty in species distribution analysis: A case study of the great grey

shrike. Ecology 89:3371-3386.

Edwards, T.C., D.R. Cutler, N.E. Zimmerman, L. Geiser, and J. Alegriae. 2005. Model-based

stratifications for enhancing the detection of rare ecological events. Ecology 86: 1081-

1090.

Ehlrich, P.R. and E.O. Wilson. 1991. Biodiversity studies: Science and policy. Science 253: 758-

763.

Elith, J., C. H. Graham, R. P. Anderson, M. Dudík, S. Ferrier, A. Guisan, R. J. Hijmans, F.

Huettmann, J. R. Leathwick, A. Lehmann, J. Li, L. G. Lohmann, B. A. Loiselle, G.

Manion, C. Moritz, M. Nakamura, Y. Nakazawa, J. M. Overton, A. T. Peterson, S. J.

Phillips, K. Richardson, R. Scachetti-Pereira, R. E. Schapire, J. Soberón, S. Williams, M.

S. Wisz, and N. E. Zimmermann. 2006. Novel methods improve prediction of species’

distributions from occurrence data. Ecography 29:129-151.

12

Ellison, A.M., and A.A. Agrawal. 2005. The statistics of rarity. Ecology 86: 1079-1080.

Frank, K., and C. Wissel. 2002. A formula for the mean lifetime of metapopulations in

heterogeneous landscapes. The American Naturalist 159:530-552.

Fonseca, C. R., and G. Ganade. 2001. Species functional redundancy, random extinctions and the

stability of ecosystems. Ecology 89:118-125.

Gauch, H. G. 1982. Multivariate analysis in community ecology. Cambridge University Press,

London, England.

Green, R.H. and R.C. Young. 1993. Sampling to detect rare species. Ecological Application 3:

351-366.

Grossman, G. D., and V. Devlaming. 1984. Reproductive ecology of female Oligocottus snyderi

greeley - A North American intertidal sculpin. Journal of Fish Biology 25:231-240.

Grossman, G. D., D. M. Nickerson, and M. C. Freeman. 1991. Principal component analyses of

assemblage structure data - Utility of tests based on eigenvalues. Ecology 72:341-347.

Grossman, G. D., P. A. Rincon, M. D. Farr, and R. E. Ratajczak. 2002. A new optimal foraging

model predicts habitat use by drift-feeding stream minnows. Ecology of Freshwater Fish

11:2-10.

Grossman, G. D., R. E. Ratajczak, J. T. Petty, M. D. Hunter, J. T. Peterson, and G. Grenouillet.

2006. Population dynamics of mottled sculpin (Pisces) in a variable environment:

information theoretic approaches. Ecological Monographs 76:217-234.

Guisan, A., and W. Thuiller. 2005. Predicting species distribution: offering more than simple

habitat models. Ecology Letters 8: 993-1009.

Guisan, A., and N. E. Zimmermann. 2000. Predictive habitat distribution models in ecology.

Ecological Modelling 135:147-186.

13

Hannaford, M. J., and V. H. Resh. 1995. Variability in macroinvertebrate rapid-bioassessment

surveys and habitat assessments in a Northern California stream. Journal of the North

American Benthological Society 14:430-439.

Hanski, I. 1999. Metapopulation ecology. Oxford University Press, New York, New York.

Hanski, I. 1994. A practical model of metapopulation dynamics. The Journal of Animal Ecology

63:151-162.

Hanski, I., A. Moilanen, T. Pakkala, and M. Kuussaari. 1996. The quantitative incidence

function model and persistence of an endangered butterfly metapopulation. Conservation

Biology 10:578-590.

Heinz, S. K., L. Conradt, C. Wissel, and K. Frank. 2005. Dispersal in fragmented landscapes:

Deriving a practical formula for patch accessibility. Landscape Ecology 20:83-99.

Heinz, S. K., C. Wissel, and K. Frank. 2006. The viability of metapopulations: individual

dispersal behaviour matters. Landscape Ecology 21:77-89.

Hewlett, R. 2000. Implications of taxonomic resolution and sample habitat for stream

classification at a broad geographic scale. Journal of the North American Benthological

Society 19:352-361.

Hilborn, R. and M. Mangel. 1997. The ecological detective: confronting models with data.

Princeton University Press, Princeton, New Jersey, 330 pp.

Hooper, D. U., F. S. Chapin, J. J. Ewel, A. Hector, P. Inchausti, S. Lavorel, J. H. Lawton, D. M.

Lodge, M. Loreau, S. Naeem, B. Schmid, H. Setala, A. J. Symstad, J. Vandermeer, and

D. A. Wardle. 2005. Effects of biodiversity on ecosystem functioning: A consensus of

current knowledge. Ecological Monographs 75:3-35.

Jackson, D.A. 1993. Multivariate analysis of benthic invertebrate communities: The implication

of choosing particular data standardizations, measures of association, and ordination

methods. Canadian Journal of Fisheries and Aquatic Sciences 50:2641-2651.

14

Jackson, D.A. and H.H. Harvey. 1993. Fish and benthic invertebrates: Community concordance

and community-environment relationships.Canadian Journal of Fisheries and Aquatic

Sciences 50: 2641-2651.

Jackson D. A., K.M. Somers, and H.H. Harvey. 1989. Similarity coefficients: measures of co-

occurrence and association or simply measures of occurrence? American Naturalist 133: 436-

453.

Jackson, D.A., P.R. Peres-Neto, and J.D. Olden. 2001. What controls who is where in freshwater

fish communities – the roles of biotic, abiotic, and spatial factors. Canadian Journal of

Fisheries and Aquatic Sciences 58: 157-170.

Jelks, H. L., S. J. Walsh, N. M. Burkhead, S. Contreras-Balderas, E. Diaz-Pardo, D. A.

Hendrickson, J. Lyons, N. E. Mandrak, F. McCormick, J. S. Nelson, S. P. Platania, B. A.

Porter, C. B. Renaud, J. J. Schmitter-Soto, E. B. Taylor, and M. L. Warren. 2008.

Conservation status of imperiled North American freshwater and diadromous fishes

Fisheries 33:372-386.

Krebs, C.J.1998. Ecological Methodology (2nd ed.). Addison-Welsey Educational Publishers

Inc., New York, New York.

Laplace, P. S. 1820. Théorie analytique des probabilités. Courcier, Paris.

Larsen, T. H., N. M. Williams, and C. Kremen. 2005. Extinction order and altered community

structure rapidly disrupt ecosystem functioning. Ecology Letters 8:538-547.

Legendre, P. and E.D. Gallagher. 2001. Ecologically meaningful transformations for ordination

of species data. Oecologia 129: 271-280

Legendre, P. and L. Legendre. 1998. Numerical Ecology (2nd edition). London, Elsevier.

Loiselle, B. A., C. A. Howell, C. H. Graham, J. M. Goerck, T. Brooks, K. G. Smith, and P. H.

Williams. 2003. Avoiding pitfalls of using species distribution models in conservation

planning. Conservation Biology 17:1591-1600.

15

Loreau, M., S. Naeem, P. Inchausti, J. Bengtsson, J. P. Grime, A. Hector, D. U. Hooper, M. A.

Huston, D. Raffaelli, B. Schmid, D. Tilman, and D. A. Wardle. 2001. Biodiversity and

ecosystem functioning: current knowledge and future challenges Science 294: 804-808.

Mace, G.M. 1994. Classifying threatened species: means and ends. Philosophical Proceedings of

the Royal Society of London. Series B. 344: 91-97.

Matthews, W. J., E. Surat, and L. G. Hill. 1982. Heat death of the orangethroat darter

Etheostoma spectabile (Percidae) in a natural environment. Southwestern Naturalist

27:216-217.

Matthews, W. J., M. E. Power, and A. J. Stewart. 1986. Depth distribution of Campostoma

grazing scars in an Ozark stream. Environmental Biology of Fishes 17:291-297.

Matthews, W. J., F. P. Gelwick, and J. J. Hoover. 1992. Food and habitat use by juveniles of

species of Micropterus and Morone in a southwestern reservoir. Transactions of the

American Fisheries Society 121:54-66.

Matthews, W. J. 1998. Patterns in freshwater fish ecology. Chapman and Hall, New York, New

York.

Marchant, R. 1999. How important are rare species in aquatic ecology and bioassessment? A

comment on the conclusions of Cao et al. Limnology and Oceanography 44: 1840–1841.

Marchant, R. 2002. Do rare species have any place in multivariate analysis for bioassessment?

Journal of the North American Benthological Society 21: 311-313

Marmion, M., M. Parviainen, M. Luoto, R. K. Heikkinen, and W. Thuiller. 2009. Evaluation of

consensus methods in predictive species distribution modelling. Diversity and

Distributions 15: 59-69.

Marsh, D. 2008. Metapopulation viability analysis for amphibians. Animal Conservation 11:463-

465.

Moilanen, A. 1999. Patch occupancy models of metapopulation dynamics: Efficient parameter

estimation using implicit statistical inference. Ecology 80:1031-1043.

16

Moilanen, A. 2004. SPOMSIM: software for stochastic patch occupancy models of

metapopulation dynamics. Ecological Modelling 179:533-550.

Mouchet, M., F. Guilhaumon, S. Villéger, N. W. H. Mason, J.-A. Tomasini, and D. Mouillot.

2008. Towards a consensus for calculating dendrogram-based functional diversity indices

Oikos 117:794-800.

Norris, R. H., and M. C. Thoms. 1999. What is river health? Freshwater Biology 41:197-209.

Olden, J. D., and D.A. Jackson. 2001. Fish habitat relationships in lakes: gaining predictive and

explanatory insight by using artificial neural networks. Transactions of the American

Fisheries Society 130:878-897.

Petchey, O. L. and K. J. Gaston. 2002. Functional diversity (FD), species richness and

community composition. Ecology Letters 5: 402-411.

Petchey, O. L., and K. J. Gaston. 2002b. Functional diversity (FD), species richness and

community composition. Ecology Letters 5:402-411.

Petchey, O. L., and K. J. Gaston. 2006. Functional diversity: back to basics and looking forward.

Ecology Letters 9:741-758.

Petchey, O. L., and K. J. Gaston. 2007. Dendrograms and measuring functional diversity. Oikos

116:1422-1426.

Podani, J. and D. Schmera. 2006. On dendrogram-based measures of functional diversity. Oikos

115: 179-185.

Podani, J., and D. Schmera. 2007. How should a dendrogram-based measure of functional

diversity function? A rejoinder to Petchey and Gaston. Oikos 116:1427-1430.

Quinn, G.P.and M.J. Keough 2002. Experimental design and data analysis for biologists.

Cambridge University Press: London, 537 pp.

17

Rabinowitz, D., S. Cairns and T. Dillon. 1986. Seven forms of rarity, and their frequency in the

flora of the British Isles. In Conservation biology: the science of scarcity, and diversity (ed.

M.E. Soule), pp. 184-204. Sunderland, Massachusetts, Sinauer Associates.

Ricciardi, A. and J.B. Rasmussen. 1999. Extinction rates of North American freshwater fauna.

Conservation Biology 13: 1220-1222.

Schmera, D., J. Podani, and T. Eros. 2009. Measuring the contribution of community members to

functional diversity. Oikos 118: 961-971.

Sharma, S. and D.A. Jackson. 2008. Predicting smallmouth bass incidence across North

America: Comparison of statistical approaches. Canadian Journal of Fisheries and

Aquatic Sciences 65: 471-481.

Skyfield, J. P., and G. D. Grossman. 2008. Microhabitat use, movements and abundance of gilt

darters (Percina evides) in southern Appalachian (USA) streams. Ecology of Freshwater

Fish 17:219-230.

Somers, K. M., R. A. Reid, and S. M. David. 1998. Rapid biological assessments: how many

animals are enough? Journal of the North American Benthological Society 17:348-358.

Srivastava, D. S., and M. Vellend. 2005. Biodiversity-ecosystem function research: Is it relevant

to conservation? Annual Review of Ecology, Evolution, and Systematics 36: 267-294.

Thuiller, W. 2004. Patterns and uncertainties of species’ range shifts under climate change.

Global Change Biology 10: 2020-2027.

Thuiller, W., S. Lavorel, M. B. Araújo, M. T. Sykes, and I. C. Prentice. 2005. Climate change

threats to plant diversity in Europe. Proceedings of the National Academy of Sciences

102:8245-8250.

Wallace, J. B., J. W. Grubaugh, and M. R. Whiles. 1996. Biotic indices and stream ecosystem

processes: results from an experimental study. Ecological Applications 6:140-151.

Vos, C. C., J. Verboom, P. F. M. Opdam, and C. J. F. ter Braak. 2001. Toward ecologically

scaled landscape indices. American Naturalist 157: 24-41.

18

Section I:

The importance of methodological choices in ecological models

19

Chapter 2: The importance of methodological choices in influencing the measure of functional diversity across ecological communities

Abstract Functional diversity is an important concept in community ecology because it captures

information on functional traits absent in measures of species diversity. One popular method of

measuring functional diversity is the dendrogram-based method, FD. To calculate FD, a variety

of methodological choices are required and it has been debated about whether biological

conclusions are sensitive to such choices. We studied the probability that conclusions regarding

FD were sensitive, and that patterns in sensitivity were related to alpha and beta components of

species richness. We developed a randomization procedure which iteratively calculated FD by

assigning species into two assemblages and calculating the probability that the community with

higher FD varied across methods. We found evidence of sensitivity in all five communities we

examined, ranging from a probability of sensitivity of 0 (no sensitivity) to 0.976 (almost

completely sensitive). Variations in these probabilities were driven by differences in alpha

diversity between assemblages and not by beta diversity. Importantly, FD was most sensitive

when it was most useful (i.e. when differences in alpha diversity were low). We demonstrate

that trends in functional diversity analyses can be largely driven by methodological choices or

species richness, rather than functional trait information alone.

Keywords: multivariate statistics, functional diversity, community ecology, species richness,

biodiversity, ecological organization, dendrogram.

20

Introduction

Functional diversity is the amount of inter-specific variation in functional traits in an ecological

community. The concept of functional diversity has received considerable attention recently,

largely because of the following intuitive argument. Species diversity indices treat all species

identically, whereas functional diversity indices do not. Therefore, it is reasonable to expect that

functional diversity is likely to be more ecologically relevant because species differ from one

another in functionally important ways (Petchey and Gaston 2002). For example, several studies

have concluded that measures of ecosystem function tend to correlate more strongly with

functional diversity indices than with species diversity indices (Loreau et al. 2001). These

studies have spurred continued interest in developing new and improved functional diversity

indices (Mouchet et al. 2008; Villeger et al. 2008).

Despite the conceptual simplicity of functional diversity, ecologists wishing to measure it must

choose from a number of approaches. Mason (2005) developed a typology of functional

diversity indices with three types: functional richness; functional evenness; and, functional

divergence. This typology is similar in spirit to the distinction between species richness and

evenness in species diversity studies. For example, functional richness indices measure the

amount of trait space filled by the species in a community, whereas functional evenness indices

measure the evenness in the distribution of abundance in trait space (Mason 2005; Villeger et al.

2008). Using rarefaction techniques, functional richness and evenness can also be thought of as

extremes along a gradient of functional diversity indices (Walker et al. 2008). Rarefaction also

makes clear the close relationship between species and functional richness. It is therefore

important to ensure that accepted indices of functional richness provide information beyond that

of species richness, as data on functional traits can be costly to obtain.

One approach to measuring functional richness, which has shown promise as a predictor of

ecosystem function, is the dendrogram-based approach known as FD (Petchey and Gaston 2002).

This approach consists of measuring functional richness as the total branch-length of a

dendrogram that clusters species based on the similarity of their functional-trait characteristics.

There are numerous methods for constructing a dendrogram; in particular, both a resemblance

measure, which measures the difference between species in their functional-trait characteristics,

21

and a clustering algorithm, which specifies the manner in which similar species are grouped

together, must be chosen. There is the possibility that ecological conclusions drawn from an

analysis of dendrogram-based functional diversity may be sensitive to the methodological

choices that are required for producing a dendrogram. This may be a serious issue given that

dendrogram topology may change considerably with changes in the methods used (e.g. Sneath

and Sokal 1973, Jackson et al. 1989).

There has been considerable recent debate about the importance of the method of dendrogram

construction for the measurement of dendrogram-based functional diversity (Petchey and Gaston

2006, 2007, Podani and Schmera 2006, 2007; Mouchet et al. 2008). To facilitate resolution, we

conducted a detailed analysis of the sensitivity of dendrogram-based functional diversity

measures to differences in species richness and methodological choices. For this resolution, we

need a quantitative definition of sensitivity. Given a pair of species assemblages and set of

dendrogram construction methods, we make the following definition: conclusions are insensitive

if all construction methods result in the same assemblage being identified as having higher

functional diversity. Conclusions are sensitive if at least one construction method identifies a

different assemblage as having higher functional diversity.

With this definition, we seek answers to the following questions. First, through a systematic

study of previously analyzed data from ecological communities (Petchey and Gaston 2002,

Podani and Schmera 2006, Petchey and Gaston 2007), what is the probability that conclusions

regarding FD are sensitive to methodological choices? Second, if sensitivity is found to be likely

in many communities analyzed, is the probability of sensitivity related to the difference in local

species richness (i.e. alpha diversity) between the two assemblages? We hypothesize that the

probability of sensitivity should be low when differences in alpha diversity are very high.

Intuitively, we expect species richness to drive functional richness patterns in these cases, no

matter how it is measured. This is a null hypothesis; it assumes that functional richness (as

measured by FD) does not provide information beyond that provided by species richness.

Failure to reject this hypothesis would suggest that FD and alpha diversity are largely redundant,

provided that the most species rich assemblage also tends to have the highest FD. Third, is the

probability of sensitivity related to the amount of species turnover (i.e. beta diversity) between

assemblages? As species turnover can be measured in numerous ways, hereafter we use the

term beta diversity to refer to Lande’s species turnover (Lande 1996). We hypothesize that the

22

probability of sensitivity should be high when beta diversity is high. High beta diversity will

tend to lead to lower redundancy across traits between assemblages, in comparison to low beta

diversity. Therefore, high beta diversity produces conditions under which we intuitively expect

small differences in functional diversity. Small differences will presumably be more sensitive to

methodological choices. Fourth, is the probability of sensitivity related to certain types of

methodological choices? We hypothesize that conclusions will be more sensitive to the choice

of distance measure than to the choice of clustering algorithm, because the distance measure can

completely change the order of functional similarity amongst the species whereas the clustering

algorithm is more limited in that it can only alter how groups of species relate to one another in

multivariate space. We note that there are reasons to believe that FD will also be quite sensitive

to the choice of a clustering algorithm. Indeed, different clustering algorithms can generate quite

different tree topologies, which may translate into FD sensitivity. We address these questions by

assessing the probability of sensitivity of pairs of randomly drawn sub-assemblages from five

ecological communities.

Methods All of these analyses were based on data from ecological communities obtained from the

literature. We used the same five datasets used in previous studies of FD (Petchey and Gaston

2002, 2007, Podani and Schmera 2006). These datasets represent variation in the number (from

13 to 37) and type of species, and the number and type of functional traits (from 6 to 27). For

example, the three vertebrate datasets use characteristics ranging from foraging behavior to the

consumption of prey species as their functional traits (Holmes et al. 1979, Jaksic and Medel

1990, Munoz and Ojeda 1997), whereas the remaining two datasets rely on vegetative

characteristics, such as rooting depth and herbivore palatability, of the plants being studied

(Golluscio and Sala 1993, Chapin et al. 1996).

The general approach to assessing the sensitivity of FD to methodological choices was as follows

(see Fig. 2.1 for an example). For each community (i.e. dataset), we organized all of the species,

γ, into two groups, hereafter referred to as assemblages. Let the average species richness over

the two assemblages be α . Each species in the community was included in either one of the

assemblages or in both. For a given level of beta diversity,β = γ −α , and difference in alpha

diversity between the assemblages, Δα, the total number of unique pairs of assemblages is

23

. (1)

The numerator is the total number of ways that one can order γ species. The three factorials in

the denominator are, respectively, the total number of ways that (i) the number of shared species

can be ordered, (ii) the species that are unique to assemblage 1 can be ordered, (iii) the species

that are unique to assemblage 2 can be ordered (see Appendix 2.1). In Fig. 2.1, we give two

examples of such orderings when γ = 11, Δα = 1 and β = 4.5. Note however, that assemblage

pairs for which (2β – Δα) is an odd number are not possible given the inter-dependencies of these

parameters. For each possible combination of β and Δα, we randomly selected 1000 pairs of

assemblages using code programmed in MATLAB version 7.1 (see Appendices 2.1 – 2.4). For

each of these randomly selected assemblages, we calculated FD based on several different

dendrogram construction methods. FD was considered insensitive to methodological choices for

a particular pair of assemblages if the assemblage with the higher FD was the same for all

construction methods; FD was otherwise considered sensitive. We then calculated the

proportion of the 1000 random iterations that were sensitive. We refer to this proportion as the

probability of sensitivity. When the probability of sensitivity is high for a particular combination

of Δα and β, it is very likely that the conclusions drawn from an FD analysis in this context will

be dependent on methodological choices, rather than on the data alone.

In order to calculate FD, two methodological choices must be made. First, a distance (or

resemblance) measure must be chosen. Distance measures quantify the difference between two

entities based on their characteristics (e.g. species based on their functional traits). There are a

large number of resemblance measures from which to choose (Jackson et al. 1989; Legendre and

Legendre 1998). We used three distance measures: Euclidean distance as suggested by Holmes

et al. (1979); cosine distance; and, Gower’s distance as it allows mixed and missing data types

24

Figure 2.1 –Measuring the sensitivity of FD in a hypothetical eleven species community. The procedure consists of randomly dividing the community into two assemblages, noting how FD orders the two assemblages and assessing how sensitivity effects this order (i.e. did assemblage 1 always have higher FD given differences in methodology?). Each species is represented as a letter and the assemblages are represented as overlapping rectangles that contain the letters associated with their component species. The first set of rectangles represents one random division of the community into two assemblages. A new random division can be obtained by leaving the overlapping rectangles fixed and simply permuting the order of the species. The second set of overlapping rectangles gives an example of such a permutation.

(Gower 1971; Podani 1999; Podani and Schmera 2006, 2007). For Euclidean distance, we

standardized all trait matrices so that all traits have a mean = 0 and variance =1 (i.e. z-scores;

Holmes et al. 1979, Gaston and Petchey 2002). We used cosine distance because it more

accurately reflects proportional changes in traits whereas the Euclidean distance emphasizes

absolute differences. For the Patagonian forb and Arctic vegetation datasets we used only

Gower’s distance because these datasets contained missing values and mixed data types; the

Euclidean and cosine distances were not appropriate for such datasets (e.g. Podani and Schmera

2006). Second, a clustering algorithm must be chosen. We used three clustering algorithms in

this analysis: 1) unweighted pair group method with arithmetic mean (UPGMA); 2) single

linkage (i.e. nearest neighbor); and, 3) complete linkage (i.e. maximum or farthest neighbor).

These algorithms represent natural endpoints across a methodological continuum of dendrogram

construction methods, where single linkage lies on one end, complete linkage on the other and

UPGMA lies somewhere in the middle (Podani and Schmera 2006).

We considered several different collections of construction methods because the sensitivity of

FD is defined in terms of a particular set of construction methods. For cases where multiple

comparisons could be made (e.g. several distance measures), we calculated four separate

25

probabilities of sensitivity: 1) sensitivity with respect to all nine construction methods; 2)

sensitivity with respect to the three distance measures with UPGMA clustering (i.e. clustering

algorithm is held constant); and, 3) sensitivity with respect to the three clustering algorithms with

Gower’s distance measure (i.e. distance measure is held constant). In cases where data were

deficient and only Gower’s distance could be used, only overall probabilities of sensitivity were

calculated. The sensitivity when the clustering algorithm is held constant could be calculated,

but these results would be identical to the overall values. Finally, we also calculated

probabilities of sensitivity holding other distance measures and clustering algorithms constant

and consider pairs of assemblages that do not contain all of the species in the complete datasets.

However, we do not present these additional results because they do not alter any of the

conclusions.

There exists an ongoing debate regarding a standard procedure for calculating FD. Petchey and

Gaston (2002) based their measure of FD on a dendrogram derived from a dataset that included

all species that were of interest (i.e. the entire community). For an assemblage that does not

contain all of the species in the entire community, FD is measured as the total branch length of

the dendrogram minus the branch lengths of the species that are not included in the assemblage

(see Petchey and Gaston 2002, 2007 for more details). We refer to this approach as the Petchey-

Gaston (PG) method. Alternatively, Podani and Schmera (2006) suggested that FD should be

calculated as the total branch length of a dendrogram that is unique to each assemblage, i.e.

recalculated from the reduced dataset. We refer to this measure as the PS (Podani-Schmera)

method. As this debate remains unresolved, we tested whether FD was sensitive using both

methods. To calculate FD using the PG method, we calculated a species-by-branch matrix and a

vector of branch lengths for the complete community using the code of Petchey and Gaston

(2002) for the R programming language. We then used this code to calculate FD using the PG

method for each assemblage (see Petchey and Gaston 2002 for more details). We repeated this

approach for each of the nine construction methods (i.e. three distance measures for each of the

three clustering algorthms). To calculate dendrograms using the PS method, we calculated

unique dendrograms for all assemblages and construction methods. We used MatLAB (v.7.1) to

calculate the sum of dendrogram lengths for each assemblage and construction method.

To display all of these results, we constructed image plots with the R programming language.

Image plots can be used to show how a variable changes over a two-dimensional grid. The

26

shading of each square on the grid represents the value of the variable at that grid location. In

this case, the variable of interest is the probability of sensitivity and the grid is defined by beta

diversity, β, and the difference in alpha diversity, Δα, between the two assemblages. However,

only certain combinations of β and Δα are possible. For example, for an eleven species

community, it is not possible to create two assemblages such that β = 6, Δα = 3 and all of the

species are in at least one of the two assemblages. Therefore, for identification purposes, these

impossible grid locations are plotted in white whereas all other levels of sensitivity are some

shade of grey. Higher levels of sensitivity are represented by darker shades of grey. This results

in a checkerboard pattern. However, it is important to keep in mind that the checkerboard pattern

is solely an artifact of the impossibility of certain combinations of β and Δα.

Results

We identified numerous cases for which FD had a high probability of sensitivity across all

communities; that is, it is easy to find cases for which conclusions derived from FD analyses will

be driven primarily by methodological choices. In the worst-case scenario, FD sensitivity

reached probabilities of 0.976 using the PS method and 0.594 using the PG method (Table 1).

Variation in the probabilities of sensitivity was largely driven by variation in alpha diversity,

with the highest probabilities of sensitivity found when assemblages were similar in alpha

diversity (Figures 2.2 & 2.3). In every case where the probability of sensitivity was zero, FD

was larger for the assemblage with more species; this result indicates that FD and alpha diversity

lead to identical conclusions about the diversity of assemblages in these cases. Therefore, the

hypothesis presented here concerning the relationship between alpha diversity and probability of

sensitivity is consistent with these results. Contrary to hypotheses presented here, there were no

consistent patterns in the relationship between beta diversity and probability of sensitivity

(Figures 2.2 & 2.3).

Decisions about distance measures were more important than decisions about clustering

algorithms. For example, when UPGMA clustering was kept constant and only distance

measures were compared, FD was more sensitive than when Gower’s distance was held constant

and clustering methods were compared (Figures 2.2 & 2.3). These results were not altered by

the distance measure held constant (e.g. Euclidean, cosine or Gower’s) or by the clustering

27

algorithm held constant (e.g. UPGMA, single linkage and complete linkage), and so we only

present the results for holding constant Gower’s distance and UPGMA respectively (Figure 2.2).

D

iffer

ence

in a

lpha

div

ersi

ty

0.0

0.2

0.4

0.6

0.8

1.0

Beta diversity

Overall PS Overall PG Gower PS UPGMA PS

Probability of Sensitivity

A)

B)

C)

15

5

0

10

8

4

0

6

6

2

0

4

2

118.563.50.5118.563.50.5118.563.50.5118.563.50.5

6.54.52.50.56.54.52.50.56.54.52.50.56.54.52.50.5

5.53.51.55.53.51.5 5.53.51.55.53.51.5

Figure 2.2 - The effect of alpha and beta diversity on the probability of FD sensitivity for three communities crossed with four sets of construction methods. Darker shading represents a higher probability of sensitivity. The communities are A) Insectivorous birds (Holmes et al. 1979), B) Intertidal fish (Munoz and Ojeda 1997) and C) Predatory vertebrates (Jaksic and Medel 1990). Each column is for a different set of construction methods. For the first and second columns, overall PS and PG, all nine methods of dendrogram construction (three distance measures times three clustering algorithms) were used with the PS and PG methods respectively. For the third column, all three clustering algorithms were used with Gower’s distance and the PS method. For the fourth column, all three distance measures were used with UPGMA and the PS method.

28

Diff

eren

ce in

alp

ha d

iver

sity

0.0

0.2

0.4

0.6

0.8

1.0

Beta diversity

Probability of Sensitivity

D)

E)

20

0

10

30

20

016

Overall PS Overall PG

10

2 7 122 7 12

1161161161

Figure 2.3 –The effect of alpha and beta diversity on the probability of FD sensitivity for two communities crossed with two methods of FD calculation. Darker shading represents a higher probability of sensitivity. The communities are D) Patagonian forbs (Golluscio and Sala 1993) and E) Artic vegetation (Chapin et al. 1996). For these communities, only Gower’s distance could be calculated and so only three construction methods could be compared, corresponding to the three clustering algorithms. Each column is for a different method of FD calculation. The first is for the PS method and the second is for the PG method.

29

There are some additional trends worth mentioning. The PG method of FD calculation led to

lower probabilities of sensitivity than the PS method in all cases (Figures 2.2 & 2.3, Table 2.1).

Also, where greater numbers of dendrogram construction methods are compared, the

probabilities of sensitivity increase. For example, compare the overall probabilities (nine

construction methods) with the probabilities obtained by holding the clustering method at

UPGMA (three construction methods) (Figure 2.2). This difference makes intuitive sense

because as one considers more construction methods, it becomes more likely to find a method

that leads to different conclusions regarding the ranking of the assemblages in terms of FD. Table 2.1 – The maximum probability of FD sensitivity for five communities previously used to examine FD (Petchey and Gaston 2007; Podani and Schmera 2006). The number of species and the number of assemblage pairs tested are also shown. Data sources are as follows: A) Holmes et al. (1979), B) Munoz and Ojeda (1997), C) Jaksic and Medel (1990), D) Golluscio and Sala (1993), and E) Chapin et al. (1996).

Community No.

species

Maximum

Probability of

Sensitivity: PS

Maximum

Probability of

Sensitivity: PG

No. of

Assemblage

Pairs Tested

A) Insectivorous birds 22 0.818 0.497 134

B) Intertidal fish 13 0.976 0.366 46

C) Predatory vertebrates 11 0.610 0.594 32

D) Patagonian forbs 24 0.364 0.196 159

E) Arctic vegetation 37 0.244 0.142 370

Discussion These results demonstrate that FD is sensitive to choices of distance measure and clustering

algorithm in many cases. The major factor contributing to a high probability of sensitivity is low

variation in alpha diversity between the assemblages being compared. By contrast, beta diversity

between assemblages was a very poor predictor of sensitivity. This did not support the initial

hypothesis that lower beta diversity (i.e higher redundancy between traits across assemblages)

would lead to a higher probability of sensitivity. The consistency and severity of the results

30

suggest that this sensitivity is not likely to be unique to the examples we present. Indeed, we did

not actively search for atypical data to support this position; we merely used the same data that

have been used consistently by investigators when evaluating FD (Petchey and Gaston 2002,

2007, Podani and Schmera 2006).

If our results are so clear, why did others (e.g. Petchey and Gaston 2007) conclude that decisions

regarding methodological choices have only a minor affect on FD, especially given that they

used the same data that we use here? There are two possible reasons for this discrepancy. First,

to evaluate sensitivity, previous studies have shown that FD calculated using Gower’s distance

was strongly collinear with FD calculated using the Euclidean distance across many functional

trait matrices (Petchey and Gaston 2007). However, these trait matrices differed widely in

number of species. In this analysis, we demonstrate that FD becomes more sensitive as variation

in alpha diversity becomes small. Therefore, in the light of this new work, it is not surprising

that others have found low sensitivity to methodological choices; in their case, the results

strongly suggest that variation in FD was being driven largely by differences in alpha diversity,

no matter what methodological choices were made. Second, we compared more distance

measures than previously investigated (Podani and Schmera 2007; Petchey and Gaston 2007).

We feel this is a more appropriate comparison as there are a large number of distance measures

in the multivariate literature deemed to be appropriate. Additionally, when we restricted the

analysis to comparing only Gower’s distance and Euclidean distance (with PG dendrogram

construction and UPGMA held constant), we found that rates of sensitivity remained high when

differences in alpha diversity were low (maximum probability of sensitivity: 0.260 for the bird

data, 0.162 for the fish data, and 0.319 for the mammal data). Thus, FD did not provide much

additional information in this case, beyond that provided by alpha diversity.

The preceding discussion leads to the following important conclusion regarding FD. FD is most

sensitive to methodological choices when it genuinely provides new information beyond that

provided by alpha diversity. This is because conditions under which FD is sensitive coincide

with relatively little variation in alpha diversity between assemblages. Thus in these cases, FD

could potentially provide useful information about the differences between the assemblages and

ecosystem function. Unfortunately it is precisely in these cases, where FD would genuinely be

useful, that it is expected to be highly sensitive to the choice of a distance measure or clustering

algorithm. On the other hand, FD is not sensitive to methodological choices, in those cases when

31

it provides very little information beyond that already provided by species richness (alpha

diversity). This is because, when FD is insensitive, the results show that alpha diversity is

largely redundant with FD no matter what methodological choices are made. Newer approaches

to measuring functional richness (e.g. convex hull volume or consensus dendrograms) have been

proposed that may reduce the subjectivity of multivariate decisions (Cornwell et al. 2006;

Mouchet et al. 2008; Villeger et al. 2008); however, decisions are still required that may alter

results (e.g. trait scaling and transformations or what to include in the consensus). Further

research into understanding these methodological choices will likely enhance the ability to

measure functional richness. Here we wish to raise awareness about the importance of species

richness and methodological choices for calculating functional richness and identify cases for

which sensitivity is likely to be an issue.

What can be done to minimize the impact of sensitivity? One simple approach could be to

analyze data from ecological communities using several different construction methods to ensure

that sensitivity is not an issue. However, if sensitivity is an issue, a decision must be made. The

results suggest potential approaches for reducing the probability of sensitivity. First, we found

that probabilities of sensitivity were systematically lower for the PG method of FD calculation

than for the PS method. Therefore, one might be tempted to recommend the PG method for

general use. There is an important issue with this recommendation however. The PG method

assumes that the entire community is known whereas the PS method does not. In a recent paper

(Walker et al. 2008), they emphasized the importance of assuming that there may be species in

the community that are undiscovered or undetected in the study area when estimating FD from

field data. In some cases, this might not be a problem. For example, Barnett et al. (2007) have

recently published a list of species to be used in studies of FD in zooplankton communities.

However, in the vast majority of cases, there will typically be a high degree of uncertainty about

the composition of the entire community. The PG method does not provide the same estimate as

the PS method for a subset of the community. Given that the PS method provides the correct

dendrogram length for that particular subset, as it is based on a distance matrix constructed from

this subset, such differences between the methods remain a concern. Therefore, even though the

PS method is more sensitive than the PG method, the PS method is recommended for general use

and the PG method when the species list for the entire community is known. Second, we found

that FD is much more sensitive to the choice of a distance measure than to the choice of a

32

clustering algorithm. Therefore, one might be tempted to simply adopt a particular distance

measure as a standard. However, FD is not completely sensitive to the choice of clustering

algorithm (e.g. range in maximum probability of sensitivity across communities: 0.137 to 0.260

for PG method and 0.248 to 0.364 for PS Method). Furthermore, the choice of a distance

measure must be made very carefully. It is unlikely that a single distance measure can be found

that is justifiable in all situations; indeed, the history of multivariate statistics teaches us that

there is no distance measure that can be uniformly recommended in all cases (Sneath and Sokal

1973; Legendre and Legendre 1998).

To calculate functional richness, a method for quantifying inter-specific differences in functional

traits is required. However, the flexibility to use more than one trait is often required to

understand even simple natural systems (Villeger et al. 2008). Unfortunately in these

multivariate situations, complications arise as researchers have to make several key decisions

during data analysis (e.g. choice of a distance measure, clustering algorithm, data

transformations, scaling). Ideally, these decisions should have minimal impact on scientific

conclusions. Here we demonstrate that in the case of the popular index of functional richness,

FD, decisions inherent in multivariate analyses can drastically alter conclusions of functional

diversity and that sensitivity in FD is highest when alpha diversity is low. These results suggest

that in cases where information captured by dendrogram-based functional diversity would be

most useful, it is redundant with alpha diversity.

Acknowledgements

Funding was provided by NSERC Canada and OGS Scholarships to M.S.P and S.C.W., an

NSERC Discovery Grant to D.A.J., and the University of Toronto. We thank D.A.R. Drake, J.

Podani, D. Schmera, O. Petchey, and anonymous reviewers for comments on early drafts of this

paper.

33

References Barnett, A.J., K. Finlay and B.E. Beisner. 2007. Functional diversity of crustacean zooplankton

communities: towards a trait-based classification. Freshwater Biology 52: 796-813.

Chapin, F. S. I., M.S. Bret-Harte, S.E. Hobbie, and Z. Hailan. 1996. Plant functional types as

predictors of transient responses of arctic vegetation to global change. Journal of Vegetation

Science 7: 347-358.

Cornwell, W. K., D. W. Schwilk, and D. D. Ackerly. 2006. A trait-based test for habitat filtering:

convex hull volume. Ecology 87:1465–1471.

Golluscio, R. A. and O. E. Sala. 1993. Plant functional types and ecological strategies in

Patagonian forbs. Journal of Vegetation Science 4: 839-846.

Gower, J. C. 1971. A general coefficient of similarity and some of its properties. Biometrics 27:

857–874.

Holmes, R. T., R. E. J. Bonney and S. W. Pacala. 1979. Guild structure of the Hubbard Brook

bird community: a multivariate approach. Ecology 60: 512-520.

Jackson, D. A., K. M. Somers and H. H. Harvey. 1989. Similarity coefficients: measures of co-

occurrence and association or simply measures of occurrence? American Naturalist 133:436-453.

Jaksic, F. M., and R.G. Medel. 1990. Objective recognition of guilds: testing for statistically

significant species clusters. Oecologia 82: 87-92.

Legendre, P., and L. Legendre. 1998. Numerical Ecology, 2nd ed. Elsevier B.V. Amsterdam, The

Netherlands.

Lande, R. 1996. Statistics and partitioning of species diversity, and similarity among multiple

communities. Oikos 76: 5-13.

Loreau, M., S. Naeem, P. Inchausti, J. Bengtsson, J.P. Grime, A. Hector, D. U. Hooper, M. A.


ecosystem functioning: Current knowledge and future challenges. Science 294: 804-808.

34

Mason, N.W.H., D. Mouillot, W. G. Lee, and J. B. Wilson. 2005. Functional richness, functional

evenness and functional divergence: the primary components of functional diversity. Oikos 111:

112-118.

Mouchet, M., F. Guilhaumon, S. Villeger, N. W. H. Mason, J. A. Tomasini, and D. Mouillot.

2008. Towards a consensus for calculating dendrogram-based functional diversity indices. Oikos

117: 794-800.

Munoz, A. A., and F. P. Ojeda. 1997. Feeding guild structure of a rocky intertidal fish

assemblage in central Chile. Environmental Biology of Fishes 49: 471-479.

Petchey, O. L., and K. J. Gaston. 2002. Functional diversity (FD), species richness and

community composition. Ecology Letters 5:402-411.

Petchey, O. L., and K. J. Gaston. 2006. Functional diversity: back to basics and looking forward.

Ecology Letters 9: 741-758.

Petchey O. L., and K. J. Gaston. 2007. Dendrograms and measuring functional diversity. Oikos

161: 1422-1426.

Petchey, O.L., A. Hector, and K. J. Gaston. 2004. How do different measures of functional

diversity perform. Ecology 85: 847-857.

Podani J., and D. Schmera. 2006. On dendrogram-based measures of functional diversity. Oikos

115: 179-185.

Podani J., and D. Schmera. 2007. How should a dendrogram-based measure of functional

diversity function? A rejoinder to Petchey and Gaston. Oikos 116: 1427-1430.

Sneath, P.H.A., and R. R. Sokal. 1973. Numerical Taxonomy. The principles and practice of

numerical classification. W.H. Freeman and Company, San Francisco, United States.

Walker, S.C, M. S. Poos, and D. A. Jackson. 2008. Functional rarefaction: estimating functional

diversity from field data. Oikos 117: 286-296.

35

Villeger S., N.W.H. Mason, and D. Mouillot. 2008. New multidimensional functional diversity

indices for a multifaceted framework in functional ecology. Ecology 89: 2290-2301.

36

Appendices Appendix 2.1 – Derivation of Equation 1.

Definitions

Let γ be the total number of species in both assemblages.

Let α1 be the number of species in assemblage one.

Let α2 be the number of species in assemblage two.

Let Δα = α1 – α2.

Let <α> be the average of α1 and α2.

Let β = γ – <α>.

Let ν be the number of shared species (i.e., the number of species in both assemblages).

Assumption

All of the γ species are in assemblage 1, assemblage 2, or both.

Main result (Eq. 1)

For a given γ, Δα and β, the total number of unique assemblage pairs is [γ!] / [(γ – 2β)!(0.5(2β +

Δα))!(0.5(2β – Δα))!].

Deriving this result is much easier once there are three simpler results.

Simpler result 1

X! is the total number of ways to order X objects.

Simpler result 2

(γ – 2β) is the number of shared species, ν.

37

Derivation of simpler result 2

Given that all species are in at least one of the two assemblages (see assumption above), it

follows from the inclusion-exclusion principle that γ = α1 + α2 – ν. It must also be that β = γ –

0.5(α1 + α2) from the definitions of β and <α>. Eliminating the alphas and solving for ν leads to

simpler result 2.

Corollary of simpler result 2

2β = α1 + α2 – 2ν is the total number of unshared species (i.e., species that are only in either

assemblage 1 or assemblage 2 but not in both).

Simpler result 3

0.5(2β + Δα) and 0.5(2β – Δα) are the numbers of unshared species in assemblages 1 and 2,

respectively.

Derivation of simpler result 3

It is only necessary to derive the first claim; the second follows immediately after the first. Note

that one can write Δα = (α1 – ν) – (α2 – ν), from the definition of Δα, and 2β = (α1 – ν) + (α2 – ν),

from the corollary of simpler result 2. Eliminating α2 – ν and solving for α1 – ν one can obtain α1

– ν = 0.5 (Δα + 2β). Simpler result 3 follows once it is recognized that α1 – ν is the number of

unshared species in assemblage 1.

Derivation of main result

It follows from simpler result 1 that the numerator in Eq. 1 is the number of ways that γ species

can be ordered. From Fig. 1, one can see that the numerator gives the total number of ways that

iterations from the simulations could occur. However, many of these ways result in identical

assemblage pairs. This is because the order that species are listed in Fig. 1 determines a

particular assemblage pair, only insofar as it determines whether each species is in assemblage 1

only, assemblage 2 only or in both assemblages 1 and 2. Therefore one must divide by the total

number of ways to order the species within each of these three groups. It follows from simpler

results 1 through 3 that the denominator in Eq. 1 gives this number.

38

Appendix 2.2 – MatLAB Code for testing sensitivity of FD % X is the trait matrix (S by T) % S is species richness % T is # of traits % robustness measures % 1. overall % 2. euc % 3. cos % 4. gower % 5. UPGMA % 6. single % 7. complete S = length(X(:,1)); gamma = 10; iters = 10; n_measures = 9; % number of different ways to calculate FD G = zeros(iters,n_measures); output = -999.*ones((gamma+1),(gamma+1),iters,n_measures); % sign matrix nonrobust_probs = -999.*ones((gamma+1),(gamma+1),7);

% use when all dist measures %nonrobust_probs = -999.*ones((gamma+1),(gamma+1),1);

% use when only gowers (remove % and put one on the above) for B = 4:gamma for A = max(B,(gamma-B)):gamma delta = A - B overlap = A + B - gamma unsharedA = A - overlap unsharedB = B - overlap startA = 1; endA = unsharedA; startB = unsharedA + 1; endB = unsharedA + unsharedB; startShare = endB + 1; endShare = startShare + overlap - 1; Alist = zeros(iters,A); Blist = zeros(iters,B); Slist = zeros(iters,S);

39

FDA = zeros(iters,n_measures); FDB = zeros(iters,n_measures); if overlap == 0 for i = 1:iters currlist = randperm(S); Slist(i,:) = currlist; Alist(i,:) = [currlist(startA:endA)]; Blist(i,:) = [currlist(startB:endB)]; FDA(i,1) = FD(X(Alist(i,:),:),'euclidean','average'); FDB(i,1) = FD(X(Blist(i,:),:),'euclidean','average'); FDA(i,2) = FD(X(Alist(i,:),:),'euclidean','single'); FDB(i,2) = FD(X(Blist(i,:),:),'euclidean','single'); FDA(i,3) = FD(X(Alist(i,:),:),'euclidean','complete'); FDB(i,3) = FD(X(Blist(i,:),:),'euclidean','complete'); FDA(i,4) = FD(X(Alist(i,:),:),'cosine','average'); FDB(i,4) = FD(X(Blist(i,:),:),'cosine','average'); FDA(i,5) = FD(X(Alist(i,:),:),'cosine','single'); FDB(i,5) = FD(X(Blist(i,:),:),'cosine','single'); FDA(i,6) = FD(X(Alist(i,:),:),'cosine','complete'); FDB(i,6) = FD(X(Blist(i,:),:),'cosine','complete'); FDA(i,7) = FD(X(Alist(i,:),:),'gowers','average'); % change needed FDB(i,7) = FD(X(Blist(i,:),:),'gowers','average'); % change needed FDA(i,8) = FD(X(Alist(i,:),:),'gowers','single'); % change needed FDB(i,8) = FD(X(Blist(i,:),:),'gowers','single'); % change needed FDA(i,9) = FD(X(Alist(i,:),:),'gowers','complete'); % change needed FDB(i,9) = FD(X(Blist(i,:),:),'gowers','complete'); % change needed end else for i = 1:iters currlist = randperm(S); Slist(i,:) = currlist; Alist(i,:) = [currlist(startA:endA),currlist(startShare:endShare)]; Blist(i,:) = [currlist(startB:endB),currlist(startShare:endShare)];

40

FDA(i,1) = FD(X(Alist(i,:),:),'euclidean','average'); FDB(i,1) = FD(X(Blist(i,:),:),'euclidean','average'); FDA(i,2) = FD(X(Alist(i,:),:),'euclidean','single'); FDB(i,2) = FD(X(Blist(i,:),:),'euclidean','single'); FDA(i,3) = FD(X(Alist(i,:),:),'euclidean','complete'); FDB(i,3) = FD(X(Blist(i,:),:),'euclidean','complete'); FDA(i,4) = FD(X(Alist(i,:),:),'cosine','average'); FDB(i,4) = FD(X(Blist(i,:),:),'cosine','average'); FDA(i,5) = FD(X(Alist(i,:),:),'cosine','single'); FDB(i,5) = FD(X(Blist(i,:),:),'cosine','single'); FDA(i,6) = FD(X(Alist(i,:),:),'cosine','complete'); % change needed FDB(i,6) = FD(X(Blist(i,:),:),'cosine','complete'); % change needed FDA(i,7) = FD(X(Alist(i,:),:),'gowers','average'); % change needed FDB(i,7) = FD(X(Blist(i,:),:),'gowers','average'); % change needed FDA(i,8) = FD(X(Alist(i,:),:),'gowers','single'); % change needed FDB(i,8) = FD(X(Blist(i,:),:),'gowers','single'); % change needed FDA(i,9) = FD(X(Alist(i,:),:),'gowers','complete'); % change needed FDB(i,9) = FD(X(Blist(i,:),:),'gowers','complete'); % change needed end end output(overlap+1,delta+1,:,:) = sign(FDA-FDB); G(:,:) = output(overlap+1,delta+1,:,:); nonrobust_probs(overlap+1,delta+1,1) = sum(abs(sum((G(1:iters,:))'))<n_measures)/iters; nonrobust_probs(overlap+1,delta+1,2) = sum(abs(sum((G(1:iters,1:3))'))<3)/iters; nonrobust_probs(overlap+1,delta+1,3) = sum(abs(sum((G(1:iters,4:6))'))<3)/iters; nonrobust_probs(overlap+1,delta+1,4) = sum(abs(sum((G(1:iters,7:9))'))<3)/iters; nonrobust_probs(overlap+1,delta+1,5) = sum(abs(sum((G(1:iters,[1,4,7]))'))<3)/iters; nonrobust_probs(overlap+1,delta+1,6) = sum(abs(sum((G(1:iters,[2,5,8]))'))<3)/iters; nonrobust_probs(overlap+1,delta+1,7) = sum(abs(sum((G(1:iters,[3,6,9]))'))<3)/iters; end end

41

Appendix 2.3 – MatLAB Code for calculating FD via Podani and Schmera

function output = FD(X,distance,cluster) if strcmp(distance,'gowers') Y = gowers(X); else Y = pdist(X,distance); end Z = linkage(Y,cluster); output = sum(sum(branch_lengths(Z)));

42

Appendix 2.4 – MatLAB Code for calculating probabilities of sensitivity S = length(X(:,1)); iters = 1000; n_measures = 9; % number of different ways to calculate FD Amax = (ceil(S/2)-3); Bmax = (floor(S/2)-3); Omax = (S-8); dist.probs = zeros(Bmax,Amax,Omax,7); G = zeros(iters,n_measures); for BBB = 1:Bmax for AAA = BBB:Amax for OOO = 1:(S-5-AAA-BBB) G(:,:) = output(BBB,AAA,OOO,:,:); dist.probs(BBB,AAA,OOO,1) = sum(abs(sum((G(1:iters,:))'))<n_measures)/iters; dist.probs(BBB,AAA,OOO,2) = sum(abs(sum((G(1:iters,1:3))'))<3)/iters; dist.probs(BBB,AAA,OOO,3) = sum(abs(sum((G(1:iters,4:6))'))<3)/iters; dist.probs(BBB,AAA,OOO,1) = sum(abs(sum((G(1:iters,7:9))'))<3)/iters; dist.probs(BBB,AAA,OOO,1) = sum(abs(sum((G(1:iters,[1,4,7]))'))<3)/iters; dist.probs(BBB,AAA,OOO,1) = sum(abs(sum((G(1:iters,[2,5,8]))'))<3)/iters; dist.probs(BBB,AAA,OOO,1) = sum(abs(sum((G(1:iters,[3,6,9]))'))<3)/iters; end end end

43

Chapter 3: Addressing the removal of rare species in bioassessments with other choices in multivariate analyses

Abstract The inclusion or exclusion of rare species in the bioassessment of aquatic communities has been

greatly debated. Researchers may include rare species in bioassessments as they are likely better

indicators of ecosystem stress than more common species (i.e. the biological argument).

Alternatively, researchers may exclude rare species due to the potential influence on statistical

analyses (i.e. the statistical argument). As this debate remains unresolved, the objective of this

study was to determine the impacts of removing rare species in multivariate bioassessments.

These approaches were tested independently using multivariate comparisons of fishes from a

thoroughly sampled system. The biological argument was assessed using sites-level vector

residuals across treatments of species removal and demonstrated that the removal of rare species

had important site-level implications relative to full dataset, including up to a nine-fold change in

multivariate vector residuals at sites where single species were removed. The statistical

argument was assessed using variation partitioning of multivariate decisions such as ordination

method, distance measure and removal of rare species, and found that the removal of rare species

demonstrated similar levels of multivariate variation (e.g. 24.8% variation) as other choices

inherent in multivariate bioassessments, such as the choice of ordination technique (26%

variation) and similarity measure (11%). This study demonstrates that contrary to the common

held practice of removing rare species in multivariate bioassessments, that the removal of rare

species may be less important than previously thought, whereas other multivariate decisions may

be at least as equally important. Better justification for the removal of rare species, along with all

decisions in multivariate analyses, is needed to ensure bioassessments are developed in a

rigorous manner.

Keywords: Multivariate analysis; ordination; similarity measures; rare species; community

ecology; bioassessment; procrustes analysis.

44

Introduction The use of multivariate analyses has become an important tool in the biological assessment of

aquatic communities (Norris 1995; Wright et al. 2000). Several national bioassessment

programs are based on multivariate measures, notably those in the UK (e.g. RIPVACS; Wright et

al. 2000) and Australia (AUSRIVAS; Simpson and Norris 2000; Metzeling et al. 2006); and the

use multivariate analyses has become widespread elsewhere (Reynoldson et al. 2001; Joy and

Death 2002; Bailey et al. 2004). In multivariate analyses, researchers order entities in the data

(e.g. species or observations) on the basis of the similarity of their characteristics (e.g.

observations or species) (Wartenberg et al. 1987). The goal of such analyses is to determine the

basis for the order of entities; for example, differences in species abundance or occurrence

between impacted sites versus sites with little or no impact (Barbour et al. 1999; Wright et al.

2000). From these orderings, one may be able to infer causative relationships between species

and their environment so that site-level impacts can be identified and mitigated (Hawkins et al.

2000).

The application of multivariate analyses to bioassessments of aquatic communities has been

riddled with controversy. Debates in bioassessment literature include the use of multi-metric

versus multivariate approaches (Hannaford and Resh 1995; Wallace et al. 1996; Bowman and

Somers 2006); the amount of taxonomic resolution needed to determine site level impacts

(Somers et al. 1998; Hewlett 2000); the use of rapid assessment methods (Hannaford and Resh

1995; Somers et al. 1998); and issues with using reference conditions (Norris and Thoms 1999;

Bowman and Somers 2005; Bailey et al. 2008). In particular, the debate regarding the treatment

of rare species has received much attention (e.g. Faith and Norris 1989; Norris 1995; Cao et al.

1998; Cao and Williams 1999; Marchant 1999; 2002). On one hand, researchers remove rare

species with the perceived notion that they may add noise to multivariate analyses and provide

little additional information beyond more common species (Gauch 1982; Clarke and Green 1988;

Marchant 1990; 2006; McCune and Grace 2002; Paukert and Wittig 2002). On the other hand,

researchers retain rare species in multivariate analyses because they may be better indicators of

ecosystem stress than common species (Faith and Norris 1988; Cao et al. 1998; 2001), as they

may be more sensitive to the stressor(s). In either case, the debate regarding the treatment of rare

45

species has remained unresolved and researchers need to be aware of the impact of their decision

of how to treat rare species (among others).

There are many difficulties in attempting to resolve the debate regarding the treatment of rare

species in bioassessments. For example, most multivariate approaches require several decisions

beyond the removal of rare species, and these decisions may reduce insight into the effect of rare

species on resultant analyses. Researchers using multivariate methods typically must choose a

type of similarity coefficient and ordination technique, where such choices have been shown to

significantly alter results (Podani 2000; Podani and Schmera 2006; Hirst and Jackson 2007; Poos

et al. 2009). As such, the resolution to the debate regarding the impact of rare species cannot

proceed until the effect of removing rare species is placed into a context comparable to other

decisions inherent in multivariate bioassessments. Unfortunately, relatively little effort has gone

into comparing these methods of bioassessment (Norris 1995; Marchant et al. 2006), and few

studies have viewed decisions in analyses in a holistic manner (e.g. how do all of the decisions

inherent in multivariate bioassessments affect results?).

There are two general arguments for the inclusion or exclusion of rare species in multivariate

bioassessments. The first argument for removing rare species from bioassessments is that rare

species provide limited interpretative value (Marchant 1999). Proponents of this argument

suggest that rare species may simply reflect stochastic sampling effects and, as such, add noise to

the statistical solution (Gauch 1982; Clarke and Green 1988; Bailey et al. 2004). This argument

is referred to as the “statistical argument” for excluding rare species. Support for this argument

has come from results from multivariate methods could be driven by the inclusion of rare species

alone (Webb et al. 1967, Austin and Greig-Smith 1968, Day et al. 1971, Orloci and Mukkattu

1973). To some degree this argument has been examined in the literature with analyses of data

standardizations (Jackson 1993; Cao et. al 1999), similarity coefficients (Jackson et al. 1989),

ordination method (Marchant 1990), or their combinations (e.g. data standardization and

similarity coefficients; Jackson 1993; Hirst and Jackson 2007; taxonomic resolution and rarity,

Arscott et al. 2006). Unfortunately, a quantitative evaluation of the role of rare species in

community assessments is largely absent, including a more holistic evaluation that answers the

practical question of how important rare species are relative to other decisions in multivariate

analyses (but see Faith and Norris 1988). In this context, the statistical argument can be tested as

a hypothesis, with the prediction that multivariate analyses should show greater variation among

46

one another where rare species are removed than multivariate analyses where other decisions

have been altered (e.g. similarity measure, ordination method) – specifically, does the inclusion

or exclusion of rare species lead to greater changes in multivariate analyses than those arising

due to other decisions? If not, then the inclusion of rare species may be warranted as this

decision contributes similar or lower amounts of variation versus other decisions typical in a

bioassessment.

The second argument for the inclusion or exclusion of rare species is that such species should be

included in multivariate analyses because they are better indicators of ecosystem stress than are

common species (Cao et al. 1999; 2001a), i.e. common species tend to have broad ranges of

tolerance to many conditions and, therefore, are not good indicators. This argument is referred to

as the “biological argument” for the inclusion of rare species. Support for this argument has

come from empirical studies that note the importance of including rare species for conservation

issues (Margules 1986; DeVelice et al. 1988; Norris and Hawkins 2000). Proponents of this

argument suggest that exclusion of rare species may lead to an underestimation of differences

between impacted and un-impacted sites (Cao et al. 1999; 2001a). As one of the main goals of a

bioassessment is to determine site-level impacts (Barbour et al. 1999; Wright et al. 2000), this

argument assumes that sites with rare species represent the strongest signals, such as decreases in

species diversity or changes in community composition (Cao et al. 1998). Therefore, the

biological argument can be tested also as a hypothesis, with the prediction that once rare species

are removed from multivariate analyses, sites which were chosen for the removal of rare species

should be more affected across multivariate analyses than sites that were not (i.e. when rare

species are removed from the analysis, the site-level signal will change in greater proportion at

sites with rare species than those sites without). If this result is not found, it would indicate that

the exclusion of rare species is warranted as they do not provide information beyond that

captured by more common species.

Given these hypotheses, the objective of this study is to determine the biological and statistical

effect of removing rare species relative to other methodological decisions inherent in

multivariate analyses (e.g. choice of ordination method and distance measure). For this

evaluation, the Sydenham River is used as a model system because it has the highest diversity of

aquatic fauna in Canada, as well as the highest number of species at risk in Canada (Staton et al.

2003). Also, the Sydenham River has undergone detailed sampling (Poos et al. 2007; 2008),

47

which provides a high-quality dataset. As there are many types of rarity (Raboniwitz et al. 1986;

Gaston 1994), for clarity rare species are defined as those which occur infrequently (i.e. at few

locations or low prevalence, e.g. <1%, 5% and 10% occurrence). As well, species that are rare

due to declines in population sizes or number of locations and have a conservation designation

(e.g. endangered, threatened, special concern) are referred to as species at risk.

Methods Fishes were collected from the Sydenham River using the Ontario Stream Assessment Protocol

(OMNR 2007). The Ontario Stream Assessment Protocol is a typical bioassessment protocol for

monitoring impacts to aquatic systems (e.g. Barbour et al. 1999; Wright et al. 2000) and has been

used to monitor the changes in riverine communities (Stanfield and Jones 1998; Poos et al.

2008). Fishes were used as model organisms rather than benthic macroinvertebrates as fish are

relatively easy to identify, and enumerate and have been used extensively in bioassessments (e.g.

Fausch et al. 1990; Joy and Death 2002; Boys and Thoms 2006; Kennard et al. 2006; Mugodo et

al. 2006); however, this approach is equally well suited to macroinvertebrates or any other group

of species. Further, most bioassessments using benthic macroinvertebrates (Marchant 1990;

Marchant et al. 1997; Cao et al. 1998) are sensitive to sampling method and taxonomic

resolution (Arscott et al. 2006; Nichols and Norris 2006).

Fishes were sampled using a variety of approaches (see Poos et al. 2007 for sampling protocol);

however, for this analysis only electrofishing data were used as it is the most commonly used

bio-monitoring protocol and regarded as the most effective gear type for sampling stream-fish

assemblages (Bohlin et al. 1989; Reynolds 1996). As sample representativeness may be an issue

(Cao et al. 2001a; 2002), electrofishing was assessed relative to other methods and determined to

be the most effective method for evaluating fish species at risk (Poos et al. 2007). Species were

collected at 50 sites in 2002 and 25 additional sites in 2003. Sampling sites were chosen at

random across the entire watershed, with the exception of non-wadeable sections of the river in

the lower portions of the watershed which were not sampled due to constraints with using

wadeable sampling gear.

48

Evaluating Decisions in Multivariate Bioassessments

Prior to analysis, four treatments of the removal of species were applied to the species matrix

(Table 3.1). Data transformation and standardization have been previously shown to influence

multivariate analyses (Austin and Greg-Smith 1968; Jackson et al. 1993; Cao et al. 1999) and,

not wanting to provide potential bias, data were reduced to presence/absence as it removes one

additional source of variation from the comparisons and provides focus on specific comparisons

of multivariate methods.

Traditionally, researchers decide what characteristics define a rare species within a sample (Faith

and Norris 1989; McCune and Grace 2002). Some researchers suggest eliminating species that

occur at single sites because of the inflated correlations created by attempting to relate

potentially random features at that site to its lone occurrence (Legendre and Legendre 1998).

Others suggest removing species that occur at less than five percent (McGardial et al. 2000) or

ten percent of sites (Marchant 1990; McCune and Grace 2002) or at even higher thresholds

(Boulton et al. 1992; Marchant et al. 1997). The following treatments of removing rare species

were used: all species included, single-occurrence species removed; species found at less than

five percent of sites removed; and, species found in less than 10% of sites removed. These

criteria removed 0, 2, 8, and 21 species respectively of the 67 species dataset.

49

Table 3.1 – Summary of ordination techniques, similarity coefficients and exclusion of rarely sampled species being compared. Abbreviations are indicated in parentheses and used in subsequent figures and tables. All four approaches described in the “Exclusion of Rarely Sampled Species” were used in each of the four “Similarity Coefficient” combinations with both PCoA and NMDS. As Correspondence Analysis has the implicit chi-squared distance measure, only the four approaches used in the “Exclusion of Rarely Sampled Species” were included in that set of analyses.

Ordination Technique Similarity Coefficient Exclusion of Rarely Sampled Species

Principal Co-ordinates Analysis (PCoA)

• Jaccard’s (J) • Phi (Φ) • Russell and Rao (RR) • Simple Matching (SM) *

• No species removed (All) • Single occurrences removed (M1) • < 5% occurrences removed (M5) • < 10% occurrences removed (M10)

Non-metric multidimensional scaling (NMDS)

• Jaccard’s (J) • Phi (Φ) • Russell and Rao (RR) • Simple Matching (SM)

• Same as above.

Correspondence analysis (CA)

• X2 distance (no other choice of similarity coefficient)

• Same as above.

Note: * Principal Co-ordinates Analysis (PCoA) using a simple matching similarity is identical to a Principle Components Analysis (PCA) using a correlation matrix (Gower 1966).

50

Similarity coefficients were calculated from each of the four matrices of rarely sampled species

(Table 3.1). Several dozen similarity coefficients have been developed for use with

presence/absence data (Gower 1966; Gower and Legendre 1986; Legendre and Legendre 1998;

Podani 2000). The choice of similarity coefficient has been largely subjective and is often

based on tradition or on a posteriori criteria without clear justification (Jackson et al. 1989;

Krebs 1998). As different similarity coefficients emphasize different aspects of the relation

between observations, the exclusion of rare species may alter species relationships and

subsequent results of analyses such as ordinations (McGargal et al. 2000). Treatments of

similarity coefficient included: Jaccard’s, phi (Φ), Russell and Rao, and simple matching

coefficients, and were chosen because they represent standard examples amongst the continuum

of similarity coefficients. Jaccard’s similarity does not consider joint absences; the phi

coefficient is the correlation coefficient for binary data; and, Russell and Rao and simple

matching are variations that consider joint absences (Jackson et al. 1989; Legendre and Legendre

1998; Podani 2000). All similarity coefficients were transformed into metric distances having

Euclidean properties for subsequent analysis (Jackson et al. 1989; Legendre and Legendre 1998).

Three types of ordination technique were compared for each combination of treatments

excluding rare species and using different similarity coefficients: principal co-ordinates analysis

(PCoA), non-metric multidimensional scaling (NMDS), and correspondence analysis (CA). As

PCoA measured from simple matching similarity is identical to PCA measured using a

correlation coefficient (Gower 1966), the results can be used to interpret both ordination types.

These ordination techniques were chosen because they represent typical multivariate methods

used by the majority of biologists (Legendre and Legendre 1998; Podani 2000; McCune and

Grace 2002). In addition, more current approaches (e.g. NMDS, CA) may provide robust

alternatives to previous methods (e.g. PCA) where non-linear relationships occur between

variables (Cao et al. 2001a). Whereas both NMDS and PCoA allow the user to choose a

similarity coefficient, CA do not provide the same option given its inherent resemblance measure

(chi-square distance). For NMDS, a random set of 20 starting configurations were used as input

configurations, and the solution having the lowest stress was retained. In NMDS, stress was

measured as an objective function of a regression analysis where the goodness of between the

fitted values and forecasted values was fit using a least square criterion (Legendre and Legendre

1998). A broken-stick model was used to compare the eigenvalues from PCoA and CA to those

51

expected from random relationships. This method has been shown useful in identifying non-

random patterns of association in multivariate analyses (Jackson 1993; King and Jackson 1999;

Peres-Neto et al. 2003). Axes not representing a meaningful contribution of the variation were

removed from resultant analyses. All analyses were completed using the R programming

language v2.70 plus statistics libraries simba (Jurasinski 2007), vegan (Oksanen et al. 2008) and

ecodist (Goslee and Urban 2007).

Assessing the Statistical Argument

The statistical argument in multivariate bioassessments was assessed in several ways. First, all

variants in ordination method, similarity coefficients and exclusion of rare species were

compared using Procrustes analysis (Gower 1971; Jackson 1995). Procrustes analysis is

appropriate for comparing separate ordination results because it is an orthogonal rotation that

best matches two or more ordinations (Olden et al. 2001; Peres-Neto and Jackson 2001; Paavola

et al. 2006). The first three dimensions from each ordination solution were retained for

comparisons as they represented the greatest portion of variance explained using a broken-stick

model (Legendre and Legendre 1998) and the majority of the ordination methods were best

represented by three-dimensional solutions, including the NMDS results. The sum-of-squared-

deviations (i.e. m2 statistic) was used as a metric of association, with lower sum-of-squared-

deviations representing greater similarity of multivariate configurations (Gower 1971; Jackson

1993; Peres-Neto and Jackson 2000) and was calculated between each pair of three-dimensional

ordination solutions to produce a matrix of m2 distances between all 36 exclusion-distance-

ordination combinations. The resultant 36-by-36 matrix of m2 distances was analyzed using a

PCoA to determine the relative effect of each methodological choice. This type of “ordination of

ordinations” (see Digby and Kempton 1987; Jackson 1993; Hirst and Jackson 2007) provides a

useful characterization of methodological decisions, where larger distances between objects

represent more dissimilar associations. A minimum spanning tree was calculated to determine

the most similar groups and super-imposed onto the first two axes of the ordination diagram.

Partitioning of variation of multivariate data can provide quantitative and objective

determination of the influence of methodological choices. For multivariate datasets, partitioning

of variation is often thought of in a spatial or temporal context, where the influence of variables

can be partitioned across various spatial or temporal scales (e.g. Borcard et al. 1992; Borcard et

52

al. 2004; Dray et al. 2006). Yet, partitioning of variation in multivariate data is also possible

through the analysis of residuals across various matrix comparisons (e.g. Rundle and Jackson

1996; Olden et al. 2001; Paavola et al. 2006). For example, the total of among-group variation of

removing rare species can be summarized relative to all treatments using the sum of squared

deviations from a Procruste’s analysis, and represents the amount of multivariate variation

explained. In this study, the variation of all treatments was quantified using a partitioning method

of multivariate matrices (see Rundle and Jackson 1996), which separated the sum-of-squared-

deviations for within- and among-treatment groups (e.g. removal of rare species, ordination

technique, and similarity measure).

Assessing the Biological Argument

To assess the biological argument of removing rare species from bioassessments, changes at sites

where rare species were removed were evaluated across the various multivariate analyses. Site-

level differences were calculated for each pair-wise Procrustes analysis using vector residuals

from PROtest (Jackson 1995). Vector residuals provide a means of investigating deviations in

position of individual samples between two superimposed ordinations (Olden et al. 2001;

Paavola et al. 2006), i.e. the degree to which any given observation changes from one ordination

to another ordination. The length of the vector residual represents a lack of fit of scores for an

individual sample between two ordinations, with low values indicating close agreement between

multivariate methods. Vector residuals were separated between sites where rare species were

removed with sites where rare species where no species were removed. For example in Figure

3.1, a typical example of site level vector residuals is shown across a comparison of multivariate

analyses (e.g. PCoA with Jaccard’s distance) between all data and with species occurring at 5%

of sites removed. From this comparison, the effect of the removal of rare species can be assessed

as the ratio of site-level vector residuals for sites where rare species were removed (i.e. grey bars;

Fig. 3.1) versus the sites where species were not removed (i.e. black bars; Fig. 3.1). Ratios over

1 indicate situations where site-specific differences are more related to the removal of rare

species than those that not (i.e. bioassessments may be affected by species removal as sites

where rare species occur, change in greater magnitude than sites where rare species do not occur

when compared to a full dataset).

53

Figure 3.1 – Example of rank-ordered, site-level vector residuals of Procrustean multivariate comparison. The length of a vector residual indicates an overall lack of fit for a site between two multivariate analyses. Shown is a comparison of full dataset of Principal Coordinates with Jacaard’s distance and the same dataset where species occurring at 5% of sites were removed. Vectors shown in grey indicate those sites where at least one species was removed, whereas vectors in black indicate sites where no species were removed. The ratio of mean vector residuals between sites where species were removed versus those sites that did not have species removed indicates the distribution of impacts of the removal of rare species across multivariate analyses. Where most vector residuals for sites having species removed are largest, they indicate that these observations (sites) have been changed the most in their position between two ordinations.

0

0.006

0.012

1 7

Vect

or R

esid

ual

Ordered Sites

5

54

Results The multivariate analyses used in this study provided generally good representation of the data.

Variance explained by the first three axes from all combinations of multivariate analyses ranged

from 24.6% (PCoA with Jaccard’s distance and all species included) to 38.4% (PCoA with

simple matching similarity and species occurring at less than 10% sites removed). In all cases,

the variance explained by each multivariate method increased with the exclusion of more

species.

For fish species in the Sydenham River, the removal of rare species had similar effects on

resultant analyses to those arising from other decisions in multivariate analyses, such as the

choice of ordination type or similarity coefficient. Recall that in considering the relative role of

the various decisions to be made, the partitioning of variation provides a measure summarizing

the relative importance. Variation across multivariate analyses was highest for ordination

method (26.15%) across all comparisons, followed by the removal of rare species (24.81%) and

similarity measure (10.99%; Table 3.2). These results were also evident by the well-defined

clustering of treatments of rarely sampled species (All, M1, M5) in close proximity to one

another relative to the clustering of the differences in ordination technique (PCoA, NMDS, CA,

PCA) or in similarity coefficients (J, Φ RR, SM) in the ordination of m2 distances, i.e. the

comparison of the various ordination results (Fig. 3.2). One clear exception to this result was the

removal of 10% of the least occurring species, which showed deviations from the general

multivariate groupings (Fig. 3.2a,b), and variation that exceeded most other choices (6.82%;

Table 3.2).

There was large variation between individual choices across multivariate methods. Whereas CA

and PCA demonstrated overall low amounts of variation among analyses (0.19%; 0.039%

respectively), there was large variation among analyses based on NMDS (21.60%; Table 3.2).

These differences may be influenced by the smaller number of comparisons for correspondence

analysis (as a choice of similarity measure is implicit and not selected); however, NMDS also

showed an almost seven-fold increase in variation over PCoA, which involved same number of

55

a) Axis 1 (47.3%)

-0.45 0 0.45

Axi

s 2 (2

0.7%

)

-0.3

0

0.3

M10

M10

All, M1, M5

M10

M10

NMDS-RR

All, M1, M5

NMDS-J

NMDS-Φ

All, M1, M5

NMDS-SM

CA

All, M1, M5

M10

PCoA-J

PCoA-Φ

M10M1 M5

All, M1, M5

PCoA-SM

PCoA-RR

All, M1, M5M10

All, M1, M5M10

M10All

AllM1M5

PCA

b)

-0.3 0.30-0.3

0.15

M10

M5,M1,AllM10

M10

All,M1,M5

M10

CA

NMDS-Φ

NMDS-SMAllM1

M5

M5M1All NMDS-J

M10

NMDS-RRAllM1M5

PCoA-SM

PCoA-Φ

M10 M10

PCoA-J

PCAPCoA-RR

M10 M10

M1M5

All

M1M5

All

M5M1All

M5M1All

Axis 2 (20.7%)

Axi

s 3 (1

5.6%

)

0

0

Figure 3.2 –Principal Coordinates Analysis (PCoA) of the sum of squares deviations (m2 statistic) comparing the concordance between solutions based on different ordination techniques, similarity coefficients and treatments of excluding rarely sampled species. A minimum-spanning tree was overlaid on Axes 1 and 2 to highlight connections between groups of points. Dashed lines indicate deviations from group membership in cases where clear groupings do not exist (e.g. M10 for Axes 2 and 3). Short forms are continued from Table 3.2.

56

comparisons (Table 3. 2). Decisions, such as, which ordination method to choose, may be as

important (or more important) than other choices like the removal of rare species. In fact, the

inclusion or exclusion of rare species did not impact the resulting multivariate analyses any more

than the choice of ordination method (i.e. subtotals of 24.8% vs 26.2%, respectively) and likely

less given that four comparisons regarding species inclusion are used and only three comparisons

for ordination method. Levels of variation were similar between the removal of single

occurrence species (6.03%) and species occurring at 5% of the sites (5.93%) as they were for

using the entire dataset (6.03%; Table 3.2). Finally, the choice of similarity measure showed

lower levels of variation in general. Variation in X2 values was lowest (0.19%), followed by

simple matching (1.53%), Jaccard (2.03%) and the Phi (2.16%) coefficients (Table 3.2). Table 3.2 –Partitioning of variation in sum of squared deviations of Procrustes analyses (m2 statistic) across various choices in multivariate analyses, including: i) removal of rare species; ii) ordination technique; and, iii) choice of similarity measure. Abbreviations are those noted in Table 1.

Variation component %

I) Removal of Rare Species All 6.03 M1 6.03 M5 5.93 M10 6.82 Subtotal 24.81 II) Ordination Technique PCoA 4.37 NMDS 21.60 CA 0.19 Subtotal 26.15 III) Dissimilarity Measure J 2.03 Phi 2.16 RR 5.08 SM 1.53 X2 0.19 Subtotal 10.99

57

There was high site-level effect of removing rare species. In most cases, when rare species were

removed from the analysis, the effects were driven by differences in sites that contained species

that were removed. For example, the site-level residuals were much higher in sites impacted by

the removal of species than the full data (Figure 3.3; Appendix 3.1). Recall that site-level vector

residuals represent the degree to which any given observation changes from one ordination to

another (Olden et al. 2001; Paavola et al. 2006). Therefore, the ratio of site-level residuals

between sites impacted by species removal and those sites that did not have species removed

provides an indication as to magnitude that rare species may alter site-level assessments.

Overall, sites impacted by the removal of rare species changed in multivariate space over nine

fold when single-occurrence species were removed from the analysis, relative to the full data set,

and over two fold when species having prevalence less than 5% were removed from the analysis.

Interestingly, once species that occurred at less than 10% of sites were removed from the

analyses, there was virtually no difference between the two categories of sites (1.13 difference;

Figure 3.3), and in some cases represented less of an impact (e.g. NMDS-J, NMDS-RR;

Appendix 3.1).

0

3

6

9

12

15

M1 M5 M10

Proc

rust

e's v

ecto

r res

idua

l

Removal of Missing Species

Figure 3.3 – Site-level impact of the removal of rare species. Shown are box and whisker plots of the ratios of Procrustes vector residuals between sites for which rare species were removed and those sites that did not have any species removed. All comparisons were done by comparing site-level Procrustes vector residuals from the full datasets and with the removal of rare species across all similarity coefficients and ordination methods.

58

Discussion The treatment of rare species in multivariate bioassessments has been debated widely.

Advocates for the inclusion of rare species argue that rare species are useful indicators of

environmental stressors and their removal may result in the unnecessary loss of ecological

information (i.e. biological argument; Cao et al. 1999; 2001). On the other hand, advocates for

the exclusion of rare species argue that when rare species are removed from analysis, the

resultant analyses (and conclusions) are not altered (i.e. statistical argument; Gauch 1982;

Marchant 1999; 2002). To date, the debate surrounding the removal of rare species in

bioassessments has remained controversial and resolution is needed.

One of the difficulties with assessing the importance of removing rare species in bioassessments

is the lack of context from which to judge the consequences of the decision. For example, how

can one evaluate whether the inclusion of rare species provides redundant information with more

common species or provides undue influence (Gauch 1982; Marchant 1999; 2002; Bailey et al.

2004)? Alternatively, how can one determine whether rare species are more sensitive to

ecosystem stress than more common species (Faith and Norris 1989; Cao et al. 1999; 2001)?

Here, both the statistical and biological arguments were tested as separate hypotheses using data

collected from fish species in a well-sampled aquatic system. This study demonstrates contrary

to the previously held notions that rare species provide redundant information as compared to

more common species, or unduly influence multivariate analyses (Marchant 1999; Marchant et

al. 2006); neither was supported. In the case of fish species in the Sydenham River, the

hypothesis for the biological argument for the inclusion of rare species in bioassessments was

supported, while the hypothesis for the statistical argument was not.

The removal of rare species may have large biological consequences for bioassessments. First,

as rare species may not be as rare as perceived simply as a result of sampling bias (Preston 1948,

Resh et al. 2005; Arscott et al. 2006), the removal of rare species may limit the number of

species from which to assess the biological community. Second, and perhaps more importantly,

the removal of rare species may fundamentally change conclusions of multivariate

bioassessments. In this study, when rare species were removed from the analysis, sites impacted

by this removal shifted in multivariate space to a greater degree than those not directly changed

by this decision (e.g. nine fold change between the full data set and when single occurring

59

species were removed; Fig. 3.3). Interestingly, these results were reduced as more species were

removed (Fig. 3.3), demonstrating that as species are removed from the analyses, site-level

species assemblages become more homogenized and differences across multivariate analyses are

minimized. Therefore, the removal of rare species may also remove important indicators of

ecosystem stress. In the case of the Sydenham River, large-scale agricultural activity and

increases in turbidity has led to declines in several species (Staton et al. 2003; Poos et al. 2007),

with issues of turbidity shown to be more related to rare species (e.g. species at risk; Poos et al.

2008). Despite claims to the contrary (e.g. Marchant et al. 2006) this study demonstrates a

scenario where the choice of removing rare species can have limited statistical effect on the

overall multivariate analyses but, at the same time, have large biological effect on particular

observations (i.e. sampling sites). Therefore, the assumption that more common species can

sufficiently define impact or reflect the response of the whole community may not be justified

(Marchant 2002; Marchant et al. 2006).

The importance of choices in multivariate analyses need to be better justified for bioassessments.

For example, this study demonstrates that the removal of rare species had similar (and often less)

influence in multivariate analyses as other choices inherent in its calculation. Previous research

has noted that differences in similarity coefficients (Jackson et al. 1989; Jackson 1997; Cao et al.

1998; Legendre and Gallagher 2001; Podani 2005; Podani and Schmera 2007; Poos et al. 2009)

and ordination techniques (Jackson 1993; Podani 2000; Heino et al. 2003) can lead to divergent

results. Researchers often select methods based on past experience and assume that the resultant

summary adequately models the underlying data, or they choose solutions that are most

interpretable with regard to a priori hypotheses (Jackson et al. 1989; Jackson 1997; Podani

2000). This approach may have severe consequences for the ultimate goal of inferring

community responses for bioassessment. Here, comparisons of choices inherent in multivariate

analyses demonstrated that choices, such as ordination methods (e.g. NMDS), can provide

largely divergent results (Fig. 3.2; Table 3.2). As a result, the removal of rare species may be

less of a concern than previously noted (e.g. Marchant 1999; Marchant et al. 2006), whereas

other choices (e.g. type of ordination) may be more important. These results indicate that

researchers must be mindful of the statistical decisions they make including ordination

technique, similarity coefficient and the exclusion of rarely sampled species, as each choice may

have potential to influence community responses and meaningful conclusions. Other issues,

60

such as sample size (Cao et al. 2001b; 2002), seasonal effects (Furse et al. 1984), experience

(Metzeling et al. 2003), data standardization (Jackson 1993; Cao et al. 1999), taxonomic

resolution (Arscott et al. 2006) and data quality (Cao et al. 2003; Nichols and Norris 2006), also

require adequate justification.

There are issues with the inclusion of rare species in multivariate analyses that should be taken

into account for the bioassessment of aquatic communities that this study could not assess.

Species found infrequently, but with varying abundances have been shown strongly to influence

multivariate analyses (Legendre and Legendre 1998). Researchers who wish to minimize the

impacts of rare species can choose from a variety of data transformations that can downweight

the influence of rare species, although caution is warranted when using small sample sizes

(Jackson et al. 1989; Jackson 1993; Cao et al. 1999). For example, Legendre and Gallagher

(2001) have suggested the use of Helinger transformation for reducing the impacts of rare

species. In addition, authors can choose to reduce their data from abundance to presence-

absence (as done here), which represents the strongest form of standardization and is less likely

to influence analyses (Cao et al. 1999) than many other decisions. The decision to include or

remove rare species will be context dependant. This study demonstrates that, contrary to the

common practice of removing rare species in multivariate analysis, that the impact of leaving

rare species in may be minimal relative to other methodological choices while maintaining

important site-level information on species, including some with species at risk.

Ultimately, the decision to include or remove rare species should be justified by the goals of the

bioassessment. In cases, such as the Sydenham River, where rare species are used as targets for

evaluating ecosystem recovery, rare species should remain in the analyses as their inclusion did

not alter the results (e.g. levels of variation were similar among All, M1 and M5 treatment

groups; Table 2) and they also represent components of the community being assessed.

Naturally, researchers wish to limit their data to reflect the most appropriate number of species,

the most practical similarity coefficient and the most useful ordination technique; however, no

such set of criteria exists. One alternative is to use a consensus approach where several methods

(and choices within methods) are used and compared (Green 1979; Jackson et al. 1989; Jackson

1993). If the methods produce similar results then one can have greater confidence that the

results are more robust and representative rather than being dominated by the set of choices used

61

in the analysis (Jackson 1993). In summary, better justifications of all of decisions in analyses

are needed to ensure bioassessments are rigorous.

Acknowledgements

Funding was provided by NSERC and OGS Scholarships to M.S.P, an NSERC Discovery Grant

to D.A.J, and funding from the Ontario Ministry of Natural Resources and University of Toronto.

This manuscript was greatly enhanced by conversations with C. Harpur.

References

Arscott, D. B., J. K. Jackson, and E. B. Kratzer. 2006. Role of rarity and taxonomic resolution in

a regional and spatial analysis of stream macroinvertebrates. Journal of the North


Austin, M.P. and P. Greig-Smith. 1968. The application of quantitative methods to vegetation

survey. II. Some methodological problems of data from rain forests. Journal of Ecology

56:827–844

Bailey, R. C., R. H. Norris, and T. B. Reynoldson. 2004. Bioassessment of freshwater

ecosystems: Using the reference condition approach. Springer, New York.

Bailey, R. C., M. G. Kennedy, M. Z. Dervish, and R. M. Taylor. 2008. Biological assessment of

freshwater ecosystems using a reference condition approach: comparing predicted and

actual benthic invertebrate communities in Yukon streams. Freshwater Biology 39:765-

774.

Barbour, M.T., J. Gerritsen, B.D. Snyder, and J.B. Stribling. 1999. Rapid bioassessment

protocols for use in streams and wadeable rivers: Periphyton, benthic macroinvertebrates

and fish. 2nd edition. EPA-841-B-99-002. Environmental Protection Agency.

Borcard, D., P. Legendre, and P. Drapeau. 1992. Partialling out the Spatial Component of

Ecological Variation. Ecology 73:1045-1055.

62

Borcard, D., P. Legendre, C. Avois-Jacquet, and H. Tuomisto. 2004. Dissecting the spatial

structure of ecological data at multiple scales. Ecology 85:1826-1832.

Bohlin, T., S. Hamrin, T.G. Heggberget, G. Rasmussen, and S.J. Saltveit. 1989. Electrofishing -

theory and practice with special emphasis on salmonids. Hydrobiologia 173: 9-43.

Boulton, A. J., 1999. An overview of river health assessment: philosophies, practice, problems

and prognosis. Freshwater Biology 41: 469–479.

Bowman, M. F., and K. M. Somers. 2005. Considerations when using the reference condition

approach for bioassessment of freshwater ecosystems. Water Quality Research Journal of

Canada 40:347-360.

Bowman, M. F., and K. M. Somers. 2006. Evaluating a novel Test Site Analysis (TSA)

bioassessment approach. Journal of the North American Benthological Society 25:712-

727.

Boys, C. A. and M. C. Thoms, 2006. A large-scale, hierarchical approach for assessing habitat

associations of fish assemblages in large dryland rivers. Hydrobiologia 572: 11–31.

Cao, Y., and D.D. Williams. 1999. Rare species are important in bioassessment (Reply to the

comment by Marchant). Limnology and Oceanography 44:1841-1842.

Cao Y., D.D. Williams and N.E. Williams. 1998. How important are rare species in community

ecology and bioassessment. Limnology and Oceanography 43: 1403–1409.

Cao, Y., D. D. Williams, and N. E. Wiliams. 1999. Data transformation and standardization in

the multivariate analysis of river water quality. Ecological Applications 9:669-677.

Cao Y., D.P. Larsen, and R.S. Thorne. 2001a. Rare species in multivariate analysis for

bioassessment: some consideration. Journal of the North American Benthological Society

20: 144–153.

Cao Y., D.P. Larsen, and R.M. Hughes. 2001b. Evaluating sampling sufficiency in fish

assemblage survey: a similarity-based approach. Canadian Journal of Fisheries and Aquatic

Sciences 58: 1782–1793.

63

Cao Y., D.D. Williams, and D.P. Larsen. 2002. Comparison of ecological communities-the

problem of sample representativeness. Ecological Monographs 72: 41–56.

Cao, Y., C.P. Hawkins, and M.R. Vinson. 2003. Measuring and controlling data quality in

biological assemblage surveys with special reference to stream benthic macroinvertebrates.

Freshwater Biology 48: 1898–1911

Clarke, K. R., and R. H. Green. 1988. Statistical design and analysis for a 'biological effects'

study. Marine Ecology Progress Series 46: 213-226.

COSEWIC. 2007. Canadian Species at Risk. Committee on the Status of Endangered Wildlife in

Canada, Ottawa, Ontario.

Day, J.H., J.G. Field, and M.P. Montgomery. 1971. The use of numerical methods to determine

the distribution of the benthic fauna across the continental shelf of North Carolina. Journal

of Animal Ecology 40: 93–125.

DeVelice, R. L., J. W. DeVelice, and G. N. Park. 1988. Gradient analysis in nature reserve

design: a New Zealand example. Conservation Biology 2: 206-217.

Digby, P. G. N. and R.A. Kempton. 1987. Multivariate analysis of ecological communities.

Chapman & Hall, London.

Dray, S., P. Legendre, and P. R. Peres-Neto. 2006. Spatial modelling: a comprehensive

framework for principal coordinate analysis of neighbour matrices (PCNM). Ecological

Modelling 196:483-493.

Faith, D. P., and R. H. Norris. 1989. Correlation of environmental variables with patterns of

distribution and abundance of common and rare freshwater macroinvertebrates. Biological

Conservation 50:77-98.

Fausch, K. D., J. Lyons, J. R. Karr, and P. L. Angermeir. 1990. Fish communities as indicators of

environmental degradation. American Fisheries Society Symposium 8:123-144.

Furse, M. T., D. Moss, J.F. Wright, and P.D. Armitage. 1984. The influence of seasonal and

taxonomic factors on the ordination and classification of running-water sites in Great

64

Britain and on the prediction of their macroinvertebrate communities. Freshwater Biology

14: 257-80.

Gaston, K.J. 1994. Rarity. Chapman and Hall Press, London.

Gauch, H. G. 1982. Multivariate analysis in community ecology. Cambridge University Press,

Cambridge.

Goslee, S. and D. Urban. 2007. Ecodist: Dissimilarity-based functions for ecological analysis. R

package version 1.1.3.

Gower, J.C. 1966. Some distance properties of latent root and vector methods used in

multivariate analysis. Biometrika 53 : 325-338.

Gower J. C. 1971. A general coefficient of similarity and some of its properties. Biometrics 27:

857–874.

Gower, J. C., and P. Legendre. 1986. Metric and Euclidean properties of dissimilarity

coefficients Journal of Classification 3: 5-48.

Green, R. H. 1979. Sampling design and statistical methods for environmental biologists. Wiley,

New York.

Hannaford, M. J., and V. H. Resh. 1995. Variability in macroinvertebrate rapid-bioassessment

surveys and habitat assessments in a Northern California stream. Journal of the North


Hawkins, C. P., R. H. Norris, J. N. Hogue, and J. W. Feminella. 2000. Development and

evaluation of predictive models for measuring biological integrity of streams. Ecological

Applications 10:1456-1477.

Heino, J., T. Muotka, H. Mykra, R. Paavola, H. Hamalainen, and E. Koskenniemi. 2003.

Defining macroinvertebrate assemblage types of headwater streams: implications for

bioassessment and conservation. Ecological Applications 13:842-852.

65

Hewlett, R. 2000. Implications of taxonomic resolution and sample habitat for stream

classification at a broad geographic scale. Journal of the North American Benthological

Society 19:352-361.

Hirst, C. N., and D. A. Jackson. 2007. Reconstructing community relationships: the impact of

sampling error, ordination approach, and gradient length. Diversity and Distributions

13:361-371.

Jackson, D.A. 1993. Multivariate analysis of benthic invertebrate communities: the implication


methods.

Jackson, D.A. 1995. PROTEST: A PROcrustean randomization test of community environment

concordance. Ecoscience 2: 297-303.

Jackson, D.A. 1997. Compositional data in community ecology: the paragidm or peril of

proportions? Ecology 78: 929-940.


occurrence and association or simply measures of occurrence? American Naturalist 133:

436-453.

Joy, M. K., and R. G. Death. 2000. Development and application of a predictive model of

riverine fish community assemblages in the Taranaki region of the North Island, New

Zealand. New Zealand Journal of Marine and Freshwater Research 34:241-252.

Jurasinski, G. 2007. Simba: A collection of functions for similarity calculation of binary data. R

package version 0.2-5.

Kennard, M. J., B. J. Pusey, A. H. Arthington, B. D. Harch and S. J. Mackay, 2006.

Development and application of a predictive model of freshwater fish assemblage

composition to evaluate river health in eastern Australia. Hydrobiologia 572: 33–57.

King, J.R., and D.A. Jackson. 1999. Variable selection in large environmental data sets using

principal components analysis. Environmetrics 10: 67-77.

66

Krebs, C.J. 1998. Similarity coefficients and cluster analysis. Pages 293-323 in Ecological

methodology. Harper and Row, New York.

Legendre, P., and E.D. Gallagher. 2001. Ecologically meaningful transformations for ordination


Legendre, P. and L. Legendre. 1998. Numerical ecology. Elsevier Science B.V., Amsterdam.

853 pp.

Mace, G.M. 1994. Classifying threatened species: means and ends. Philosophical Proceedings of

the Royal Society of London. Series B. 344: 91-97.

Marchant, R. 1990. Robustness of classification and ordination techniques applied to macro-

invertebarte communities from the La Trobe River, Victoria. Australian Jouranl of Marine

and Freshwater Research 41: 493-504

Marchant, R., A. Hirst, R.H. Norris, R. Butcher, L. Mixzeling, and D. Tiller. 1997. Classification

and ordination of macroinvertebrate assemblages from running waters in Victoria,

Australia. Journal of North American Benthological Society 16: 664-681.

Marchant, R. 1999. How important are rare species in aquatic ecology and bioassessment? A

comment on the conclusions of Cao et al. Limnology and Oceanography 44: 1840–1841.

Marchant, R. 2002. Do rare species have any place in multivariate analysis for bioassessment?

Journal of the North American Benthological Society 21: 311-313

Marchant, R, R.H. Norris, and A. Milligan. 2006. Evaluation and application of methods for

biological assessment of streams: summary of papers. Hydrobiologia 572: 1-7.

Margules, C. R. 1986. Conservation evaluation in practice. Pages 297-314 in Wildlife

Conservation Evaluation. Chapman and Hall, London.

McCune, B. and J.B. Grace. 2002. Analysis of ecological communities MjM Software Design,

Gleneden Beach, Oregon.

67

McGarigal, K., S. Cushman, and S. Stafford. 2000. Multivariate statistics for wildlife and

ecology research. New York: Springer-Verlag.

Metzeling, L., D. Tiller, P. Newall, F. Wells, and J. Ree. 2006. Biological objectives for the

protection of rivers and streams in Victoria, Australia. Hydrobiologia 572:287-299.

Mugodo, J., M. Kennard, P. Liston, S. Nichols, S. Linke, R. H. Norris and M. Lintermans. 2006.

Local stream habitat variables predicted from catchment scale characteristics are useful for

predicting fish distribution. Hydrobiologia 572: 59–70.

Nichols, S. and R. H. Norris, 2006. River condition assessment may depend on the sub-sampling

method: field live-sort versus laboratory sub-sampling of invertebrates for bioassessment.

Hydrobiologia 572: 195–213.

Norris, R. H. 1995. Biological monitoring: the dilemma of data analysis. Journal of the North


Norris, R. H., and C. P. Hawkins. 2000. Monitoring river health. Hydrobiologia 435:5-17.

Norris, R. H., and M. C. Thoms. 1999. What is river health? Freshwater Biology 41:197-209.

Paavola, R., T, R. Muotka, R. Virtanen, J. Heino, D.A. Jackson, and A. Maki-Petays. 2006.

Spatial scale affects community concordance among fishes, benthic macroinvertebrates

and bryophytes in boreal streams. Ecological Applications 16:368-379.

Paukert, C.P., and T.A. Wittig. 2002. Applications of multivariate statistical methods in fisheries.

Fisheries 27: 16-22.

Peres-Neto, P. R., and D. A. Jackson. 2001. How well do multivariate data sets match?

Evaluating the association of multivariate biological data sets: comparing the robustness of

Mantel test and a Procrustean superimposition approach. Oecologia 129:169-178.

Peres-Neto, P. R., D. A. Jackson, and K. M. Somers. 2003. Giving meaningful interpretation to

ordination axes: assessing loading significance in principal component analysis. Ecology

84: 2347-2363.

68

Podani, J. 2000. Introduction to the exploration of multivariate biological data. Backhuys

Publishers, Leiden, Netherlands.

Podani, J. 2005. Multivariate exploratory analysis of ordinal data in ecology: pitfalls, problems

and solutions. Journal of Vegetation Science 16: 497-510.


115: 179-185.

Poos, M.S., N.E. Mandrak, and R.L. McLaughlin. 2008. A practical framework for selecting

among single species, multi-species and ecosystem-based recovery plans. Canadian Journal

for Fisheries and Aquatic Science 65: 2656-2666.

Poos, M.S., N.E. Mandrak, and R.L. McLaughlin. 2007. The effectiveness of two common

sampling methods for sampling imperiled freshwater fishes. Journal of Fish Biology 70:

691-708.

Poos, M.S., S.C. Walker, and D.A. Jackson. 2009. Functional diversity indices can be driven by

methodological choices and species richness. Ecology 90: 341-347.

Preston, F. W. 1948. The commonness, and rarity, of species. Ecology 29:254-283.

Oksanen, J., R. Kindt, P. Legendre, B. O'hara, G.L. Simpson, M. Henry, and H. Stevens. 2008.

Vegan: Community ecology package. R package version 1.11-0.

Olden, J. D., D. A. Jackson, and P. R. Peres-Neto. 2001. Spatial isolation and fish communities

in drainage lakes. Oecologia 127:572-585.

OMNR 2007.Stream assessment protocol for southern Ontario. Picton, Ontario: Ontario Ministry

of Natural Resources.

Orloci, L. and M.M. Mukkattu. 1973. The effect of species number and type of data on the

resemblance structure of a phytosociological collection. Journal of Ecology 61:37–46.

69

Rabinowitz, D., S. Cairns, and T. Dillon. 1986. Seven forms of rarity and their frequency in the

flora of the British Isles. Pages 182–204 in M. E. Soule (ed.). Conservation biology: the

science of scarcity and diversity. Sinauer Associates, Sunderland,Massachusetts

Resh, V. H., L. A. Beche, and E. P. McElravy. 2005. How common are rare taxa in long-term

benthic macroinvertebrate surveys? Journal of the North American Benthological Society

24:976-989.

Reynolds, J. B. 1996. Electrofishing. Pages 221-253 in B. R. Murphy and D. W. Willis (ed.).

Fisheries techniques. American Fisheries Society, Bethesda, Maryland.

Reynoldson, T. B., D. M. Rosenberg, and V. H. Resh. 2001. Comparison of models predicting

invertebrate assemblages for biomonitoring in the Fraser River catchment, British

Columbia. Canadian Journal of Fisheries and Aquatic Sciences 58:1395-1410.

Roberts, D.W. 2008. Ordist: Ordination and Multivariate Analysis for Ecology. R package

version 1.3.

Rundle, H. D., and D. A. Jackson. 1996. Spatial and temporal variation in littoral-zone fish

communities. Canadian Journal of Fisheries and Aquatic Sciences 53:2167-2176.

Simpson, J. C., and R. H. Norris. 2000. Biological assessment of river quality: development of

AUSRIVAS models and outputs. Pages 125-142 in J. F. Wright, D. W. Sutcliffe, and M. T.

Furse (ed.). RIVPACS and Similar Techniques for Assessing the Biological Quality of

Freshwaters. Biological Association and Environment Agency, Ambleside, Cumbria, U.K.

Sloane, P. I. W. and R. H. Norris, 2003. Relationship of AUSRIVAS-based macroinvertebrate

predictive model outputs to a metal pollution gradient. Journal of the North American

Benthological Society 22: 457–471.

Somers, K. M., R. A. Reid, and S. M. David. 1998. Rapid biological assessments: how many

animals are enough? Journal of the North American Benthological Society 17:348-358.

Stanfield, L.W. and M.L. Jones. 1998. A comparison of full-station visual and transect-based

methods of conducting habitat surveys in support of habitat suitability index models for

southern Ontario. North American Journal of Fisheries Management, 18: 657-675.

70

Staton, S.K., A. Dextrase, J.L. Metcalfe-Smith, J. DiMaio, M. Nelson, J. Parish, and E. Holm.

2003. Status and trends of Ontario's Sydenham River ecosystem in relation to aquatic

species at risk. Environmental Monitoring and Assessment 88: 283-310.

Wallace, J. B., J. W. Grubaugh, and M. R. Whiles. 1996. Biotic indices and stream ecosystem

processes: results from an experimental study Ecological Applications 6:140-151.

Wartenberg, D., S. Ferson, and F.J. Rohlf. 1987. Putting things in order: A critique of detrended

correspondence analysis. American Naturalist 129: 434-448.

Webb, L. J., J.G. Tracey, W.T. Williams, and G.N. Lance. 1967. Studies in the numerical

analysis of complex rain forest communities. II. The problem of species sampling. Journal

of Ecology 55: 525– 538.

Wright, J. F., D. W. Sutcliffe, and M. T. Furse. 2000. Assessing the biological quality of fresh

water: RIVPACS and other techniques. Freshwater Biological Association, Cumbria, UK.

71

Appendices Appendix 3.1 – Site-level effects of methodological choices in bioassessments. Shown are the ratios between mean site-level vector residuals from Procrustes analyses of sites having species removed and those sites having no species removed. Mean site-level vector residual values were separated by sites which had rare species removed (M1: n=2; M5: n=19; and M10: n=63); and compared with those sites that not.

M1 M5 M10 PCoA-J 13.83 2.58 1.10PCoA- Φ 2.40 1.45 1.03PCoA-RR 9.64 1.70 1.26PCoA-SM 12.61 3.47 1.11NMDS-J 8.66 2.41 0.93NMDS- Φ 9.25 2.00 1.03NMDS-RR 7.91 1.91 1.27NMDS-SM 9.58 1.21 0.79CA 13.97 2.46 1.61PCA* 12.61 3.47 1.11 Overall Average 9.76 2.13 1.13

Note: Abbreviations of various treatments are carried forward from Table 1. *PCA values are shown for comparison, but are not included in overall averages.

72

Appendix 3.2 – Summary of three-dimensional ordination results. Shown are eigenvalues for Principal Coordinates Analyses (PCA) and Correspondence Analyses (CA), with percent variance explained shown in parentheses. Stress values are shown for Non-Metric Multidimensional Scaling (NMDS).

Ordination Technique Axis 1 Axis 2 Axis 3

A-PCoA-J 3.43 (10.67) 2.79 (19.37) 1.67 (24.58) 1-PCoA-J 3.43 (10.67) 2.83 (19.48) 1.68 (24.71) 5-PCoA-J 3.49 (10.99) 2.89 (20.05) 1.70 (25.40) 10-PCoA-J 3.69 (12.01) 3.11 (22.12) 1.70 (27.62) A-PCoA- Φ 4.87 (15.05) 3.97 (27.31) 2.14 (33.93) 1-PCoA- Φ 4.93 (15.09) 4.01 (27.37) 2.17 (34.00) 5-PCoA- Φ 5.23 (15.51) 4.21 (28.00) 2.30 (34.82) 10-PCoA- Φ 6.19 (17.25) 4.40 (29.51) 2.61 (36.79) A-PCoA-RR 2.21 (13.00) 2.06 (25.12) 1.62 (27.02) 1-PCoA-RR 2.23 (13.12) 2.07 (25.29) 1.63 (27.21) 5-PCoA-RR 2.32 (13.65) 2.15 (26.29) 1.65 (28.23) 10-PCoA-RR 2.58 (15.18) 2.30 (28.71) 1.68 (30.68) A-PCoA-SM 2.03 (15.63) 1.65 (28.36) 0.86 (34.95) 1-PCoA-SM 2.10 (15.67) 1.71 (28.41) 0.89 (35.02) 5-PCoA-SM 2.36 (16.04) 1.92 (29.07) 1.00 (35.85) 10-PCoA-SM 3.06 (17.56) 2.37 (31.16) 1.25 (38.36) A-NMDS-J 0.4158 --- --- 1-NMDS-J 0.4183 --- --- 5-NMDS-J 0.4179 --- --- 10-NMDS-J 0.4235 --- --- A-NMDS- Φ 0.4186 --- --- 1-NMDS- Φ 0.4219 --- --- 5-NMDS- Φ 0.4245 --- --- 10-NMDS- Φ 0.4492 --- --- A-NMDS-RR 0.3994 --- --- 1-NMDS-RR 0.3990 --- --- 5-NMDS-RR 0.4012 --- --- 10-NMDS-RR 0.3950 --- --- A-NMDS-SM 0.3830 --- --- 1-NMDS-SM 0.3870 --- --- 5-NMDS-SM 0.3882 --- --- 10-NMDS-SM 0.4120 --- --- ACA--- 0.32 (12.17) 0.22 (20.62) 0.15 (26.17 1CA--- 0.31 (12.24) 0.22 (21.10) 0.14 (26.83) 5-CA--- 0.30 (14.03) 0.22 (24.31) 0.14 (30.99) 10-CA--- 0.23 (14.88) 0.21 (28.98) 0.12 (36.62)

73

Chapter 4: Contrasting direct versus indirect dispersal in metapopulation viability analyses

Abstract Species dispersal is a central component of metapopulation models. Spatially realistic

metapopulation models, such as stochastic patch-occupancy models (SPOMs), quantify species

dispersal using indirect estimates of colonization potential based on inter-patch distance. In this

study, indirect parameterization of SPOMs was compared with dispersal and patch dynamics

quantified directly from empirical data. For this purpose two metapopulations of an endangered

minnow, redside dace (Clinostomus elongatus), were monitored using mark-recapture techniques

across 43 patches, re-sampled across a one year period. More than 2,000 fish were marked with

visible implant elastomer tags coded for patch location and dispersal and patch dynamics were

monitored. Direct and indirect parameterization of SPOMs provided qualitatively similar

rankings of viable patches; however, there were differences of several orders of magnitude in the

estimated intrinsic mean times to extinction, from 24 and 148 years to 362 and >100,000 years,

depending on the population. In several cases, patches that were in close proximity (high

colonization potential) that were not used by redside dace. This study demonstrates the

importance of incorporating species and patch-specific data directly into metapopulation models,

especially given heterogeneous landscapes.

Keywords: metapopulations, dispersal, population viability analysis, stochastic patch-occupancy

models, parameterization.

74

Introduction

Species dispersal is a central component in the study of spatially structured populations. At a

landscape scale, population viability strongly depends on individual dispersal allowing re-

colonisation of empty habitats or patches (Hanski 1999a). For this reason, species dispersal is

considered the ‘glue’ for maintaining local populations within a network of suitable habitats

(Hansson 1991; Vandewoestijne et al. 2004). The degree of dispersal has an impact on local

population dynamics, on gene flow, and on adaptation to local conditions. For example, low

dispersal can foster isolation and local adaptations (Thomas and Hanski 1997; Resetarits et al.

2005). Alternatively, high species dispersal can have a stabilizing effect on metapopulation

dynamics (Hanski 1999a).

Many species with spatially structured populations are in decline, and population viability

models provide a statistical evaluation of species viability to facilitate informed management

decisions (Ackakaya 2000; Nowicki et al. 2007). Metapopulation viability analyses provide a

spatially realistic evaluation of the local population structure (Hanski 2001; March 2008). A

metapopulation is defined as a system of local populations (patches) connected by dispersing

individuals (Hanski and Gilpin 1991). By quantifying patch dynamics, metapopulation viability

analyses can be used to understand better the importance of ecological processes such as species-

specific dispersal, patch quality and landscape influences (Moilanen and Hanski 1998), and to

enhance management through evaluation of minimum amount of habitat or population size

needed to maintain viability (Hanski 1999b; Robert 2009).

Understanding how the parameterization of metapopulation viability analyses may impact results

can inform managers as to the potential areas of concern when developing management

decisions. One popular type of metapopulation viability analysis is stochastic patch-occupancy

models (i.e. SPOMs), which have been used extensively to model the viability of spatially

structured populations (Hanksi 1999; Moilanen 1999). For example, SPOMs were used in

studies of species with conservation concern, such as capercaillie (Grimm and Storch 2000),

American pika (Moilanen et al. 1998) and Glanville fritillary, and silver spotted skipper

butterflies (Hanski et al. 1994). As SPOMs provide a simplification over traditional population-

viability analyses (Akayaka and Sjögren-Gulve 2000), they do not require demographic or stage

75

data, but only occupancy, colonization and extinction rates, which can be estimated readily from

empirical data (Hanski 1994,1999; Moilanen 1999, 2004; Grimm et al. 2004).

The influence of dispersal on metapopulation viability is often evaluated using some

approximation of colonization potential (Verbroom et al. 1993; Hanski and Gilpin 1997; Frank

and Wissel 2002; Heinz et al. 2005). The easiest approach to describe colonization potential (i.e.

patch accessibility) is as a function of distance between a starting patch to a target patch and the

ability of species to disperse (Hanksi 1994; Hanski et al. 1996; Heniz et al. 2005). This

relationship can be quantified in several ways; however, most often this estimation is done by

assuming that colonization potential declines exponentially with distance (i.e. exponential decay;

Hanski 1994; Vos et al. 2001; Frank and Wissel 2002). It is uncertain how well the assumption

of exponential decay can model species-specific dispersal (Hill et al. 1996; Baguette et al.2000;

Heinz et al. 2005). Whether simple formulae are adequate in describing species- and patch-

specific movement in metapopulation models remains an open question (Heinz et al. 2005;

Marsh 2008).

The overall aim of this study is to assess whether direct parameterization of species dispersal can

impact estimates of metapopulation viability. For this assessment, a detailed mark-recapture

survey of the endangered fish, redside dace (Clinostomus elongatus), was conducted in the

Greater Toronto Area, Ontario, Canada. The redside dace is a spatially structured, pool-dwelling

species that is undergoing declines in the majority of its range due to impacts from urbanization

(COSEWIC 2007; Poos and Jackson, submitted). Two locations on the Rouge River were

monitored, including one location on Leslie Tributary, and the other location on Berczy Creek

(Figure 4.1). These locations were shown previously to have among the highest abundances of

redside dace recently sampled across its entire Canadian range (Reid et al. 2008).

76

A) Leslie Tributary

L6

L8L9

L10

L12 L11L13L17

L18L19

L20

L5

L3 L2

L4

L1

L14

L7

100m

N

N

L15

B1 N100m

B) Berczy Creek

B3

B4

B2B5B6

B7

B8

B9

B10B11

B12B13B14

B15B16B17

B18 B19

B20B21

B23B22

L16

0 2 4 8

Figure 4.1 – Study sites on Rouge River, Ontario where redside dace (Clinostomus elongatus) dispersal and patch dynamics were monitored. Study locations: A) Leslie Tributary, and B) Berczy Creek, were sub-divided into extensive sites (black), where redside dace were tagged with a color-coded visual implant elastomer tag, and extended sites (grey), which were monitored for tag movement.

77

Methods The metapopulation dynamics of redside dace were monitored by enumerating the dispersal of

tagged individuals on monthly intervals during a one-year period. For this study, each location

was sub-divided into two areas: intensively monitored sites where individuals were tagged; and,

extended sites, that were beyond those areas where fish were not tagged but where tagged fish

could have potentially moved (Figure 4.1). As meta-populations can be defined in a number of

ways (Hanski 1999a), a metapopulation was defined as an assemblage of local populations

inhabiting spatially distinct habitat patches (Moilanen and Hanski 1998). Redside dace live

primarily in clear water with well-defined pools (COSEWIC 2007); therefore, each spatially

distinct pool segregated by a well-defined riffle (e.g. a passable, but natural migratory barrier)

was selected as a habitat patch. Leslie Tributary was sub-divided into 20, connected and distinct

patches, with 10 intensive sites and five extended sites on both upstream and downstream ends.

Similarly, Berczy Tributary was sub-divided into 13 intensive sites with five extended sites on

each of the upstream and downstream ends (Figure 4.1).

Sampling was conducted using multiple-pass depletion surveys at each pool. Using a twenty-

foot bag seine (1/4” mesh), each site was surveyed until depletion of redside dace, with a

minimum of three sampling events conducted at each site per time period. At each pool, redside

dace were implanted with visual implant elastomer (VIE) tags colour-coded for their location

(Plates 1 and 2). VIE tags were chosen because they had good tag retention and negligible

effects on survival, growth and behavior when used on other species (Dewey and Ziegler 1996;

Goldsmith et al. 2003; Walsh and Winkelman 2004). Tags were injected subcutaneously near

the anal fin on the ventral surface (See Plates 1 and 2). All redside dace were held in well-

oxygenated flow-through bins for 2-4 hours to monitor for potential physiological stress, and

then returned to the river at the site of capture. Both intensive and extended sites were re-

sampled for redside dace at monthly intervals, except under winter-ice conditions (November-

March) and when redside dace were spawning (June) to not disrupt this important life stage for

an endangered species. All redside dace sampled in a recapture event were examined for the

presence of a VIE tag. Redside dace dispersal and metapopulation dynamics were tracked and

mark-recapture data were recorded. If redside dace were re-captured at a new location, they

were subsequently tagged posterior to the existing tag, with a new colour code for the recapture

78

location. The dispersal patterns, such as average distance dispersed, and proportion of stationary

tags, of the metapopulations were compared using non-parametric Mann-Whitney U-tests and

log-linear models (G-tests with Yates continuity correction; Zar 1999) respectively, in the R

language v2.80 (R Development Team 2008).

Determining Metapopulation Viability

The viability of the two stream metapopulations was quantified using direct and indirect

parameterization of stochastic patch-occupancy model (SPOM). SPOMs use a time-continuous

Markov-chain model (Hanski 1999; Grimm et al. 2004). Each patch (i) is assumed to be in one

of two states, vacant (xi =0) or occupied (xi=1). Changes in these states can occur from a patch

becoming vacant due to local extinction (xi: 1 0) or correlated extinction (i.e. regional

stochasticity) from another patch (xj, xi: 1 0). Alternatively, a vacant patch can become

occupied (xi: 0 1) via colonization from another patch (j). The state of the whole

metapopulation (xi, … xn) is given by a vector of states xi of these individual patches. The

metapopulation models were parameterized using a combination of indirect (i.e. dispersal ability

from empirical data and incidence functions; Hanski 1999) and direct parameterization of

SPOMs (using rates of actual patch colonization from empirical data alone). The models were

quantified as follows.

Colonization Rate:

Colonization between two patches i and j (bij) was defined using an incidence-function model

(Hanski 1994): · · exp / I , where y is a parameter, and Mi is the number of

emigrants from pool i. The mean number of emigrants leaving a pool was estimated using data

from the tagging study. To account for the potential uncertainty with missing emigrants leaving a

patch, the probability of detection at each pool (PDi) was quantified using maximum likelihood

from the n-pass depletion surveys (Zippin 1956, 1958) with the Bayesian modification from

Carle and Strub (1978). These were coded in the R v2.80 (R Development Team 2008) using the

fisheries-assessment package FAS (Ogle 2009). The total number of emigrants leaving each

patch per year was measured as ∑ … 1 1 , where is the

uncorrected number of emigrants. Similar to most metapopulation models, a distance-based

dispersal kernel using a negative exponential decay was used, where exp / , and dij is the

im

79

distance from patch i to patch j and d0 is the mean dispersal ability of redside dace. This type of

dispersal kernel has been used extensively in metapopulation models and assumes that patch

accessibility is dependent on distance (Hansson 1991; Hanski et al. 1996; Hokit et al. 1999;

Moilanen 2004). As dispersal data are quantified indirectly using patch area or depth, hereafter,

this approach is referred to as indirect parameterization of the metapopulation model. This type

of indirect parameterization allows researchers to extrapolate relationships in patch occupancy,

often by using species life-history characteristics, without the need of labor-intensive field

studies (Hanski, 1994; Moilanen 2004; Heinz et al. 2005).

A non-linear (polynomial) regression of mean distance of dispersal of recaptured fish through

time was used to identify potential dispersal across the patches. The fits of these non-linear

regressions were highly significant (Leslie Tributary; r2 = 0.92, p < 0.01, Berczy Creek; r2 =

0.88, p < 0.01) and the average distance dispersed (dI) for a one-year period (one time step in

final SPOM model) was 210m for redside dace in Berczy Creek and 150m for Leslie Tributary

(Table 1). As there were several consecutive surveys, it was possible to estimate y from the

number of transitions (i.e. an empty patch becoming occupied and vice versa; Hanski 1999a).

For this, a GLM procedure was used which considered multiple snapshots of the sampling events

using a binomial distribution and logistic-link function developed in the R programming

language v2.80 (R Development Team 2008) using the incidence function (see Oksanen 2004 for

details). The value of the y parameter for Leslie Tributary was 0.0816 and Berczy Creek was

0.0713. Finally, for dij, a distance matrix was measured using the river distance between patches

as calculated by a geographic information system.

Extinction Rate:

Extinction rates can be quantified in many ways (Hanksi 1999a). The simplest form of

determining extinction rate (Ei) is using the area of the patch (Ai), and given by ,

where e defines the extinction probability of a patch of unit size, and x defines the scaling of the

extinction risk with patch area (Hanski 1998; Moilanen 2004). This model assumes that

probability of extinction generally depends on population size, which, in turn, is usually given by

a simple linear or power function of patch area. This relationship has been demonstrated on both

empirical and theoretical grounds (Lande 1993; Foley 1994; Hanski 1994; 1999a; Hanski et al.

1996). Here extinction rates were calculated using patch (i.e pool) depth (d). Patch depth was

80

calculated by taking the average of 60 equidistant point measurements as suggested by the

Ontario Stream Assessment Protocol (OMNR 2007). Patch depth was used because redside dace

are known to be pool-dwelling species (COSEWIC 2007) and, therefore, depth may be more

relevant to model patch dynamics. Indeed, redside dace abundances were more correlated with

patch depth (r=0.44, p=0.0018) than patch area (r = 0.39, p = 0.0048; Poos and Jackson,

unpublished data). The extinction rate was fitted using an incidence function relating species

presence in the patches over time and depth. For this modeling, a GLM procedure was used that

considered multiple snapshots of sampling events using a binomial distribution developed in the

R programming language v2.80 (R Development Team 2008) using the incidence function

(Oksanen 2004). The parameters of the incidence function for Leslie Tributary and Berczy

Creek were x = 0.4926, 0.5652, and e = 3.685, 4.187, respectively.

Incorporating Dispersal Directly into the Metapopulation Model:

Recent theoretical studies on the impact of species movement have found that it can alter

metapopulation viability (Heinz et al. 2005; 2006; Revilla and Wiegand 2008). Therefore, the

incidence function models were extended by incorporating species dispersal directly into the

metapopulation model using empirical data of patch-specific movement. As dispersal data was

directly used to quantify the metapopulation model, hereafter, this is referred to as direct

parameterization of the metapopulation model. For colonization rate, a model developed by

Frank and Wissel (1998; 2002) was used, which (in this case) is identical to the incidence

function model and allows the incorporation of patch dynamics (Grimm et al. 2004; Heinz et al.

2006). This model took into account three processes; emigration of individuals from occupied

patches; dispersal to a target patch; and, the establishment of a new subpopulation on the target

patch. The rate of colonization, bij, was defined as

· · .

0 where, Mi was the number of emigrants leaving the occupied patch i per year

(previously defined), ni was the number of connections from patch i to other patches, rij was the

probability of an individual started at patch i successfully dispersing to patch j, and Ij was the

number of immigrants needed to establish a new subpopulation (Frank and Wissel 1998; 2002).

For the probability of dispersal between patches (rij), a patch-colonization matrix was developed

using the empirical tagging results for each time period. As the tags were colour coded for patch

81

location (at each time period), rij was defined empirically as the ratio of fish that started at patch i

that dispersed to location j, across all recaptured fish. In this instance, tag loss was not accounted

for as no tag-related behavioral response or tagging related mortality was assumed (i.e. this ratio

was adequate given equal likelihood of mortality of a tagged fish versus non-tagged fish). Given

the monitoring data (>75 hours), only minor amounts of tag-related mortality occurred (<0.01%),

and all occurred on the first sampling day (likely due to experience bias). Finally, to quantify Ij,

an incidence-function model was developing using the probability a patch persistence (across all

time periods) given the starting population size (StPopn) of each patch at the start of the study

(t1). For this approach a GLM function was used with binomial distribution and logit link in the

R programming language v2.80 (R Development Team 2008; Appendix 4.1).

Regional Stochasticity

Regional stochasticity refers to the level of correlated extinctions caused by factors influencing a

shared geographic location, such as weather or disease (Lande et al. 1988; Lande 1993; Foley

1994). Regional stochasticity has the ability to impact metapopulation viability by incorporating

the influence of the fate of proximal patches. The influence of regional stochasticity was

quantified at three levels: 0 (no influence of regional stochasticity); 0.1 (a moderate level of

regional stochasticity); and, 0.2 (more severe regional stochasticity).

Comparing Viability of Metapopulations Using Direct versus Indirect Parameterization

The ultimate viability of patch (i) was defined using the intrinsic mean time to extinction Tm =

1/λ, determined using the reciprocal value of the overall extinction rate λ calculated using a plot

of −ln(1 − P0(t)), where P0 is the probability of extinction at a given time (t) (Verbroom et al.

1991; Grimm et al. 2004; Grimm and Wissel 2004). Intrinsic mean time to extinction has been

previously shown to be an adequate currency in assessing the viability of metapopulations and

can be easily extracted from simulation data (Frank and Wissel 1998; Grimm and Wissel 2004;

Heinz et al. 2006). Transitions in metapopulations were simulated 10,000 times using ‘stochastic

time steps’ (Frank et al. 2004; Grimm et al. 2004) of transition probabilities of extinction and

colonization rates. For this estimation a manually created sub-routines was created in the

software program Meta-X (Frank et al. 2003; Grimm et al. 2004), a metapopulation program

flexible for incorporating behavior into metapopulation-viability analysis (Heinz et al. 2006).

82

Results In total, 2,141 redside dace were tagged and monitored across 43 patches during in a one-year

period from 2007-2008. Due to logistical issues, the stream systems were not sampled during

winter-ice conditions (November-March) or when redside dace were spawning in June given the

potential to disrupt spawning activities. Recapture rates for redside dace - calculated as the

proportion of fish marked during the preceding marking period that were recaptured - were

generally high (>25%) during the initial four monitoring events, ending in October 2007. These

numbers were greatly reduced by the following spring, with recapture rates < 10% likely due to

high over-winter mortality or the re-distribution of tagged fishes to areas beyond the study

location (Table 4.1). In addition, the capture efficiency – as determined by probability of

detection using n-pass depletion surveys (Zippin et al. 1954; 1956; Carle and Strub 1978) – was

also very high for both study systems: Leslie Tributary (mean 71%); and, Berczy Creek (mean

65.6%, Appendix 4.1).

Table 4.1 – Summary of mark-recapture information of the endangered fish the reside dace (Clinostomus elongatus) used to directly parameterize stochastic patch occupancy metapopulation models. Shown are two locations in the Greater Toronto Area, Ontario, Canada: A) Leslie Tributary, and B) Berczy Creek. Note: items denoted with a single asterisk (*) represent a significant difference between populations (Mann-Whitney U for average dispersal, G-test for stationary tags, p<0.05).

A) Leslie Tributary B) Berczy Creek

Time Cum. Tags

Recap. (%)

Avg. (m)

Max. (m)

Stationary Tags (%)

Cum. Tags

Recap. (%)

Avg. (m)

Max. (m)

Stationary Tags (%)

T1: July 133 0.46 21* 175 0.67 342 0.61 9* 125 0.74 T2: Aug. 305 0.26 91 227 0.19 511 0.26 84 290 0.23 T3: Sept. 404 0.28 71 547 0.52 770 0.26 66 357 0.44 T4: Oct. 483 0.26 139 680 0.03* 1,045 0.25 77 315 0.41* T5: Apr. 503 0.04 196* 680 0.00 1,137 0.03 129* 411 0.00 T6: May 542 0.09 182 649 0.14 1,376 0.03 174 411 0.10 T7: July 662 0.10 180* 547 0.17 1,479 0.06 82* 275 0.29 Avg. 0.14 105 0.31 0.18 53 0.49

Legend: Cum. Tags refers to the cumulative number of tags released in the population; Recap (%) refers to the recapture rate for cumulative tags, Avg. (m) refers to the average distance the recaptured tags were captured at, Max. (m) refers to the maximum distance recaptured tags were captures at, and Stationary Tags (%) refers to the percentage of tags, at each time interval, that were recaptured in the same location as tagged.

Metapopulation Dynamics

In all cases, dispersal was higher in Leslie Tributary as compared with the Berczy Creek;

however, this difference was only statistically significant in time periods 1, 5, 7 (Mann-Whitney

U-test; p=0.015, 0.004, 0.033, respectively; Table 4.1). The difference in dispersal was not due

83

to more tags dispersing as there was no significant difference in the proportion of stationary tags,

except for time period 4 (G-test, p=0.0003; Table 4.1). More likely, the increase in dispersal was

due to larger average dispersal per individual, as indicated by mean and maximum dispersal

through time (Table 4.1).

Metapopulation and Patch Viability

Metapopulation viability, as indicated by both probability of extinction through time and the

intrinsic mean time to extinction, were orders of magnitude different depending on whether the

patch-occupancy model was parameterized indirectly using colonization potential or directly

using observed colonization (Figure 4.2; Table 4.2). For example, when the patch-occupancy

models were parameterized directly using observed colonization, the Leslie Tributary population

was inviable long-term and the Berczy Creek showed much longer viability. The intrinsic mean

time to extinction for Leslie Tributary was 24 years, and occurred in as little as 12 years

(regional stochasicity set at 0, 0.2, respectively). The probability of extinction was plotted over

time and showed that 95% of the simulations were extinct in less than 100 years. Similarly, the

intrinsic mean time to extinction for Berczy Creek was calculated as a maximum of 148 years

and occurred in as little as 32 years (regional stochasticity = 0, 0.2 respectively), with 95% of

simulations showing metapopulation extinction in under 1000 years (Figure 4.2; Table 4.2).

84

Table 4.2 – Intrinsic mean time (in years) to extinction (Grimm and Wissel 2004) of two metapopulations of the endangered fish the redside dace (Clinostomus elongatus) for different levels of regional stochasticity.

A) Leslie Tributary B) Berczy Creek

Regional Stochasicity 0 0.1 0.2 0 0.1 0.2

Intrinsic mean time to extinction (Indirect Parameterization)

362 95 48 109,594 3,417 764

Intrinsic mean time to extinction (Direct Parameterization)

24 17 12 148 54 32

When the patch-occupancy models were parameterized using indirect colonization, based on

estimated dispersal ability, one population (Berczy Creek) was deemed as viable and quasi-

stationary (regional stochasticity = 0; Figure 4.2; Table 4.2), with 95% of simulations showing

viability beyond 250,000 years (Figure 4.2). The remaining population estimates varied

considerably in their viability, ranging in intrinsic mean times to extinction from 48 to 348 years

in Leslie Tributary and from 764 to >109,000 years in Berczy Creek (regional stochasticity 0.2,

0, respectively).

85

I) Direct parameterization of dispersal II) Indirect parameterization of dispersal

A)

B)

YEARS

Figure 4.2 – Metapopulation viability of the endangered species the redside dace (Clinostomus elongatus) in two stream metapopulations: A) Leslie Tributary, and B) Berczy Creek. Shown are the probabilities of extinction (y-axis) in years (x-axis) of a stochastic patch-based metapopulation model. Models were parameterized using: I) indirect parameterization of colonization and dispersal via patch distance, and; II) direct parameterization of colonization and dispersal using empirical estimates from a mark-recapture study. Legend: Vertical hashes represent a time interval of 100 years, solid lines indicate population trajectories where regional stochasticity was set to 0, dashed lines set to 0.1 and dotted lines set at 0.2.

86

Specific-patch viability mirrored overall metapopulation viability, with all patches showing

reduced viability when parameterized directly from empirical data (Figure 4.3). In all cases

(except L12) patch viability was over-estimated with the indirect parameterization relative to

direct parameterization of the SPOM (Figure 4.3). Mean patch viability was significantly higher

with indirect parameterization of the SPOM for both Leslie Tributary (mean indirect patch

viability = 0.51 ± 0.10, mean direct patch viability = 0.35 ± 0.08; Welch’s t-test; t= 26.85, p-

value << 0.0001) and Berczy Creek (mean indirect patch viability = 0.69 ± 0.09, mean direct

patch viability = 0.39 ± 0.08; Welch’s t-test; t = 11.204, p-value << 0.0001).

0

0.5

1

0 0.5

B15L13

B14

L14

L12

L9

B16B12

B9

B10 B8L10

L6

B7B13

B6B18 B11

L11

L7

1

Patc

h V

iabi

lity

(Ind

irect

Par

amet

eriz

atio

n)

Patch Viability (Direct Parameterization)

L15B17

L8

Figure 4.3 – Differences in patch viability parameterized using indirect (y-axis) and direct (x-axis) patch dynamics of the endangered species the redside dace (Clinostomus elongatus) in two stream metapopulations: A) Leslie Tributary (L6-L15), and B) Berczy Creek (B6-B18). Shown are the mean probabilities of persistence of a given patch across 10,000 simulations. To demonstrate the variability in patch viability, 25% quantiles are overlaid as the negative of both the vertical and horizontal axes, while 75% quantiles are overlaid as the positive vertical and horizontal axes. The dashed line is a 1:1 line.

87

Interestingly, the rankings of patch viability did not markedly differ based on the

parameterization of the SPOM. For example, both indirect and direct parameterization of the

SPOM identified the same five most-viable patches per population (overall) as: L6, L8, L10,

L11, L9; and, B7, B13, B6, B11, B18. One notable difference in patch viability was that several

patches that were in close proximity to good-quality patches had significantly lower viability

when parameterized using direct parameterization (Figure 4.3). Patches such as L9, L11, B6 and

B9 had reduced viabilities when directly parameterized (mean indirect viability = 0.57, 0.84,

0.86, 0.90, respectively; mean direct viability = 0.28, 0.46, 0.63, 0.17; Figure 4.3).

Discussion Metapopulation-viability models have a long history of use (e.g. Levins 1969, 1970) and provide

advantages over traditional population-viability analyses (PVAs). One clear advantage of using

stochastic patch-occupancy metapopulation models (SPOMs) over traditional PVAs (e.g.

structured models, demographic models; (Akcakaya and Sjorgen-Gulve 2000; Morris and Doak

2002) is that they require the parameterization of fewer variables (Ovaskainen and Hanski 2004).

This reduction is especially advantageous for modeling endangered species, where enumeration

is complicated by rarity and where greater uncertainty exists. Simplification is often needed as

incorporating species-specific or demographic data into ecological studies can be difficult, time

consuming, or not economically possible. Stochastic patch-occupancy models allow for

simplification of metapopulations, as only patch occupancy, colonization and extinction rates are

needed, even within a single snapshot (Moilanen et al. 2004; Marsh 2008).

There is a tradeoff between simplification of PVAs by using less parameters and with the added

value and information that those parameters may have (Shreeve et al. 2004). This study

demonstrates that differences in species dispersal patterns, a key component of metapopulation

models such as SPOMs, have the ability to dramatically impact estimates of metapopulation

viability. Further, this study demonstrates that direct parameterization of species-specific

dispersal can reduce the overall estimates of viability of redside dace metapopulations by several

orders of magnitude over estimates using indirect estimates (e.g. exponential-decay kernels). In

several cases (e.g. L9, L11, B6 and B9), patches that were in close proximity to good-quality

patches (i.e. had high dispersal potential and high viability with indirect parameterization) but

had significantly lower viability when parameterized using direct parameterization (Figure 4.3).

88

From these results it may be inferred that the study patches are likely distributed across a

heterogeneous landscape where occupancy rates or habitats may not be equal, as assumed by

SPOMs. Landscape and patch heterogeneity has been shown to impact colonization potential,

which alters metapopulation dynamics (Gustafson and Gardner 1996; Heinz et al. 2005). Even if

habitats were equal, studies of the rosyside dace (Clinostomus funduloides), a sister species,

demonstrated that dispersal could not be predicted by suitable habitats (Freeman and Grossman

1993). These conclusions are in agreement with others which indicate that better integration of

species-specific behaviour is needed into the analyses of metapopulations (Tischendorf 2001;

Tischendorf and Fahrig 2001; Vos et al. 2001; Heinz et al. 2005, 2006; Baguette and vanDyck

2007; Marsh 2008).

There is an ongoing debate whether SPOMs are useful in cases where the assumptions of classic

(i.e. Levin’s type) metapopulations are not met (see Levins 1969; Harrison 1994; Baguette 2004;

Hanski 2004; Shreeve et al. 2004). For example, empirical studies have demonstrated large

temporal variation in patch dynamics (i.e. colonization and extinction rates) can lead to

sensitivity in SPOMs (Crone et al. 2001; Thomas et al. 2002). In addition, there are few

empirical examples of metapopulations that meet the assumption of a constant pulse of

extinction-colonization (e.g. pool frog, Sjogren-Gulve 1991; Glanville fritillary butterfly, Hanski

et al. 1994; but see Nowicki et al. 2007); whereas, the vast majority do not (Harrison 1994;

Baguette et al 2004). However, recently SPOMs have been shown to be appropriate for use over

a range of spatially structured populations from classic metapopulations to species found in

fragmented landscapes with patchy distributions (Ovaskainen and Hanski 2004). For example,

using a SPOM with individual-based background, Ovaskainen and Hanski (2004) demonstrated a

unifying framework for incorporating metapopulation dynamics into SPOMs. Their study

suggested that instead of attempting to identify metapopulation types, research should focus on

relevant processes, such as dispersal. By understanding processes behind the variability shown

between (and within) patch dynamics, the conservation and management of endangered species

can be improved (Revilla and Wiegand 2008).

Comparative simulations of how metapopulation analyses perform when altered are important

tools for ensuring appropriate management action. Interestingly, despite finding large

differences in the prediction of intrinsic mean time to extinction (Table 4.2), this study

demonstrates that regardless of how the SPOMs were parameterized (directly or indirectly), they

89

identified qualitatively similar patches as having the highest viability (Figure 4.2). Such results

are reassuring given that SPOMs rarely are used for exact quantitative analyses, but rather used

to compare among several scenarios to develop decision support (Lindenmayer and Possingham

1996; Heinz et al. 2006). These results suggest that qualitative comparisons of SPOMs may still

be a fruitful management option; however, care should be taken with estimates of probability of

extinction through time, which may be over-estimated (as shown here). Indeed, others have

shown SPOMs to be comparable to other spatially realistic models (Kindvall et al. 2000; Keeling

2002; Ovaskainen and Hanski 2004). Modifications to SPOMs, such as consideration of patch

quality (Hanski and Moilanen 1998; Ovaskainen and Hanski 2002), improved dispersal metrics

(Ovaskainen 2004; Heinz et al. 2005), incorporation of transition states (Thomas and Hanski

2004), rigorous parameter-estimation techniques (Moilanen 1999; Dreschler et al. 2003), and/or

direct parameterization of dispersal data, will only ensure better integration of biological data

into SPOMs.

Studying metapopulations in stream settings is challenging and has limitations which should be

noted. Fagan (2002) demonstrated how a dendritic network can provide additional isolation of

patches not encountered in terrestrial landscapes. Further, Gotelli and Taylor (1999)

demonstrated how stream fishes may not fit Levin’s type metapopulation models as migration

may be asynchronous in upstream versus downstream movement. Finally, mark-recapture

studies have shown that tagged individuals may travel outside recapture territory, thereby,

reducing estimates of overall colonization (Ovaskainen 2004). All the above examples may

explain the lower viability using direct parameterization of SPOMs in this study. In response, the

study design accounted for such potential shortcomings in several ways. First, the sampling was

done sequentially, across the entire stream network (using block nets during sampling to

eliminate movement induced by the sampling of each pool). Therefore, unlike most studies in

terrestrial systems, this study was able to monitor the stream metapopulations within defined

boundaries, sampling the entire patch, as well as neighboring patches and connections. Second,

this study re-sampled the patches at 7 time intervals to determine intra-and inter- annual

asynchrony in movement and rate of patch fidelity. In general asynchrony in movement was

identified at a single time period (T4; G-test, p<0.05); however, there was also variation in patch

fidelity (Table 4.1). This variation in patch fidelity indicates that redside dace may be more

consistent with the patchy population model (Harrison 1991) than the with a Levin’s type

90

metapopulation model (Levins 1969). Temporal variation in patch fidelity was also shown in

metpopulations of rosyside dace (Freeman and Grossman 1993). Third, in this study the

boundaries of the recapture locations were extended to include patches beyond the marking

locations (~ 350 meters on upstream and downstream ends per location; Figure 4.1). These

extended patches allowed both average dispersal of tagging individuals and the rates of dispersal

out of their marking locations to be determined. Regardless of time period, > 75% of tags were

recaptured within 350 m meters of their starting pool for Leslie Tributary and > 90% of tags in

Berczy Creek, suggesting that the monitoring was done at the appropriate scale. Finally, this

study accounted for the probability of missing tags due to sampling bias by correcting for

abundance counts using probability of detection. These approaches suggest that, although

asynchrony (and perhaps over-winter mortality) may be an issue, the study design was adequate

to monitor species and patch specific movement.

Understanding how species- and patch-specific qualities can alter SPOMs is an important area

for advancing metapopulation-viability analyses, which are a crucial tool for the management of

endangered species (Ackakaya 2000; Morris and Doak 2002; McCoy and Mushinsky 2007).

Species- and patch-specific processes can alter metapopulation dynamics in many ways

(Roitberg and Mangel 1997; Hanski and Moilanen 1998; Schtickzelle et al. 2006). Patch quality

can affect both the probabilities of colonization and extinction of an empty patch (Hanski and

Moilanen 1998; Thomas et al. 2002). Changes in landscape structure can alter migration

pathways between patches, which may impact dispersal (Roitberg and Mangel 1997;

Schtickzelle et al. 2006; North and Ovaskainen 2007). Finally, the configuration of patches may

play a role in population viability (Robert 2009). Maintenance not only of high-quality patches,

but their connectivity, should be an important aspect of endangered species management,

including redside dace.

Accounting for regional and stochastic processes are important considerations for the

management of endangered species. In this study, when the rate of regional stochasticity was

altered, the intrinisic mean time to extinction quickly decreased (Table 4.2). The impact of

regional stochasticity was small in cases where populations were already considered to be

inviable (e.g. Leslie Tributary with 0 stochasticity), but it had a large impact in cases where a

population was viable or quasi-stationary (e.g. Berczy Creek with 0 stochasticity; Figure 4.2).

For example, the Berczy Creek population was considered to be viable if regional stochasticity

91

was ignored. Altering rates of regional stochasticity from 0 to 0.1 and 0.2 caused the populations

to move towards extinction (intrinsic mean time to extinction from >109,000 to 54 and 32 years,

respectively). These results indicate that regional stochastic factors (e.g. weather, drought) that

may alter patch dynamics and have undue influence on metapopulation viability, as others also

have shown (Grossman et al. 1982; 1985; Lande 1993; Foley 1994; Robert 2009).

The conservation applications of species-specific dispersal may help inform management

decisions. By incorporating species- and patch-specific data directly into metapopulation

models, managers may be better apt at determining the relative importance of spatial and

temporal factors, such patch connectivity and seasonal impacts. In this study, empirical

estimation of patch viability was shown to be qualitatively similar when parameterizing SPOMs,

but that estimates of metapopulation viability were shown to be significantly higher when using

indirect parameterization of dispersal. These results indicate that care is needed in ensuring that

even simplified metapopulation models, such as SPOMs, are consistent with biological data.

Comparisons of how species- and patch-specific data directly impact metapopulation models, as

done here, may be one way to accomplish this cautionary aspect (Dreschler et al. 2003; Grimm et

al. 2004). Further study into the impact of species-specific behavior and patch dynamics on

metapopulation viability will provide additional insight for both management and conservation

issues.

92

Acknowledgements

Funding was provided by NSERC Canada and OGS Scholarships to M.S.P., an NSERC

Discovery Grant to D.A.J., Interdepartmental Recovery Fund #1410 provided by Fisheries and

Oceans (DFO), the Ontario Ministry of Natural Resources (OMNR), and the University of

Toronto. All field work was conducted under an approved Animal Care Protocol (# 20006805)

from the University of Toronto Animal Care Committee; and under the approval of the redside

dace recovery team (chair A. Dextrase) and with the guidance of the Ontario Ministry of Natural

Resources (J. Pisapio). The Toronto Region Conservation Authority (TRCA) Aquatics Group

(Christine Tu, David Lawrie, Tim Rance) provided logistical support and help during this

project. Field work was conducted by a dedicated group of volunteers from- the University of

Toronto – A. Drake, C. Harpur, B. Edwards, M. St. John, M. Neff, P. Venturelli, A. Manning, M.

Granados, J. Ruppert, S. Sharma, N. Puckett, C. Hart, C. Howard, M. Luksenberg, J. Brett, S.

Walker – and the Toronto Region Conservation Authority – D. Lawrie, C. Tu, T. Parker, T.

Rance, B. Paul, E. Elton, B. Stephens, C. Hart, L. DelGiudice, B. Moyle, and M. Parish – and P.

Ng, M. Ken, C. Zehr, B. Edwards, Y. Nozoe, D. Trim, K. Lee, D. Morodvanschi and D. Forder

(Ontario Streams). This work was improved by discussion with Marie-Josee Fortin and review of

earlier drafts of this manuscript. Finally reviewers were helpful for in providing valuable

suggestions.

References

Akçakaya, H. R. 2000. Viability analyses with habitat-based metapopulation models Population

Ecology 42:45-53.

Akçakaya, H. R., and P. Sjögren-Gulve. 2000. Population viability analysis in conservation

planning: an overview. Ecological Bulletins 48:9-21.

Baguette, M. 2004. The classical metapopulation theory and the real, natural world: a critical

appraisal. Basic and Applied Ecology 5:213-224.

Baguette, M., S. Petit, and F. Queva. 2000. Population spatial structure and migration of three

butterfly species within the same habitat network: consequences for conservation. Journal

of Animal Ecology 37:100-108.

93

Baguette, M., and H. VanDyck. 2007. Landscape connectivity and animal behavior: functional

grain as a key determinant for dispersal. Landscape Ecology 22:1117-1129.

Carle, F. L., and M. R. Strub. 1978. A new method for estimating population size from removal

data. Biometrics 34:621-630.

COSEWIC. 2007. COSEWIC Assessment and update status report on the redside dace

Clinostomus elongatus in Canada. Committee on the Status of Endangered Wildlife in

Canada, Ottawa, ON.

Crone, E. E., D. Doak, and J. Pokki. 2001. Ecological influences on the dynamics of a field vole

metapopulation. Ecology 82:831-843.

Dewey, M. R., and S. J. Ziegler. 1996. An evaluation of flourescent elastomer for marking

bluegills in experimental studies. Progressive Fish-Culturist 58:219-220.

Dreschler, M., K. Frank, I. Hanski, R. B. O’Hara, and C. Wissel. 2003. Ranking metapopulation

extinction risk: From patterns in data to conservation management decisions. Ecological

Applications 13:990-998.

Fagan, W. F. 2002. Connectivity, fragmentation, and extinction risk in dendritic

metapopulations. Ecology 83:3243-3249.

Fahrig, L. 1992. Relative importance of spatial and temporal scales in a patchy environment.

Theoretical Population Biology 41:300-314.

Foley, P. 1994. Predicting extinction times from environmental stochasticity and carrying

capacity. Conservation Biology 8:124-137.

Frank, K., and C. Wissel. 1998. Spatial aspects of metapopulation survival: from model results to

rules of thumb for landscape management. Landscape Ecology 13:363-379.

Frank, K., and C. Wissel. 2002. A formula for the mean lifetime of metapopulations in

heterogeneous landscapes. The American Naturalist 159:530-552.

94

Freeman, M. C., and G. D. Grossman. 1993. Effects of habitat availability on dispersion of a

stream cyprinid Environmental Biology of Fishes 37:121-130.

Ghimire, S. K., O. Gimenez, R. Pradel, D. McKey, and Y. Aumeeruddy-Thomas. 2008.

Demographic variation and population viability in a threatened Himalayan medicinal and

aromatic herb Nardostachys grandiflora: matrix modelling of harvesting effects in two

contrasting habitats. Journal of Applied Ecology 45:41-51.

Goldsmith, R. J., G. P. Closs, and H. Steen. 2003. Evaluation of visible implant elastomer for

individual marking of small perch and common bully. Journal of Fish Biology 63:631-

636.

Gotelli, N. J., and C. M. Taylor. 1999. Testing macroecology models with stream-fish

assemblages. Evolutionary Ecology Research 1:847-858.

Grimm, V., and I. Storch. 2000. Minimum viable population size of capercaillie Terao urogallus:

Results from a stochastic model. Wildlife Biology 5:219-225.

Grimm, V., and C. Wissel. 2004. The intrinsic mean time to extinction: a unifying approach to

analysing persistence and viability of populations. Oikos 105:501-511.

Grimm, V., H. Lorek, J. Finke, F. Koester, M. Malachinski, M. Sonnenschein, A. Moilanen, I.

Storch, A. Singer, C. Wissel, and K. Frank. 2004. META-X: generic software for

metapopulation viability analysis. Biodiversity and Conservation 13:165-188.

Grossman, G. D., M. C. Freeman, P. B. Moyle, and J. O. Whittakers. 1985. Stochasticity and

assemblage organization in an Indian stream fish assemblage. American Naturalist

126:275-285.

Grossman, G. D., P. B. Moyle, and J. O. Whittakers. 1982. Stochasticity in structural and

functional characteristics of an Indiana stream fish assemblage: A test of community

theory. American Naturalist 120:423-454.

Gustafson, E. J., and R. H. Gardner. 1996. The effect of landscape heterogeneity on the

probability of patch colonization. Ecology 77:94-107.

95

Hanski, I. 1994. A practical model of metapopulation dynamics. The Journal of Animal Ecology

63:151-162.

Hanski, I. 1998. Connecting the parameters of local extinction and metapopulation dynamics.

Oikos 83:390-396.

Hanski, I. 1999a. Metapopulation Ecology. Oxford University Press, New York.

Hanski, I. 1999b. Habitat connectivity, habitat continuity, and metapopulations in dynamic

landscapes. Oikos 87:209-219.

Hanski, I. 2004. Metapopulation theory, its use and misuse. Basic and Applied Ecology 5:225-

229.

Hanski, I., and M. E. Gilpin. 1997. Metapopulation biology: Ecology, genetics, and evolution.

Academic Press, San Diego, California.

Hanski, I., M. Kuussaari, and M. Nieminen. 1994. Metapopulation structure and migration in the

butterfly Melitaea Cinxia. Ecology 75:747-762.

Hanski, I., A. Moilanen, T. Pakkala, and M. Kuussaari. 1996. The quantitative incidence

function model and persistence of an endangered butterfly metapopulation. Conservation

Biololgy 10:578-590.

Hansson, L. 1991. Dispersal and connectivity in metapopulations. Biological Journal of the

Linnean Society 42:89-103.

Harrison, S. 1991. Local extinction in a metapopulation context. Biological Journal of the

Linnean Society 42:73-888.

Heinz, S. K., L. Conradt, C. Wissel, and K. Frank. 2005. Dispersal in fragmented landscapes:

Deriving a practical formula for patch accessibility. Landscape Ecology 20:83-99.

Heinz, S. K., C. Wissel, and K. Frank. 2006. The viability of metapopulations: individual

dispersal behaviour matters. Landscape Ecology 21:77-89.

96

Hill, J. K., C. D. Thomas, and O. T. Lewis. 1996. Effects of habitat patch size and isolation on

dispersal by Hesperia comma butterflies: implications for metapopulation structure.

Journal of Animal Ecology 65:725-735.

Hokit, D. G., B. M. Stith, and L. C. Branch. 1999. Effects of landscape structure in Florida scrub:

a population perspective. Ecological Applications 9:124-134.

Keeling, M. J. 2002. Using individual-based simulations to test the Levins metapopulation

paradigm. Journal of Animal Ecology 71:270-279.

Kindvall, O. 2000. Comparative precision of three spatially realistic simulation models of

metapopulation dynamics. Ecological Bulletins 48:101-110.

Lande, R. 1993. Risks of population extinction from demographic and environmental

stochasticity and random catastrophes. The American Naturalist 142:911.

Lande, R., S. Engen, and B. E. Saether. 1988. Extinction times in finite metapopulation models

with stochastic local dynamics. Oikos 83:383-389.

Levins, R. 1969. Some demographic and genetic consequences of environmental heterogeneity

for biological control. Bulletin of the Entomological Society of America 71:237-240.

Marsh, D. 2008. Metapopulation viability analysis for amphibians. Animal Conservation 11:463-

465.

McCoy, E. G., and H. R. Mushinsky. 2007. Estimates of minimum patch size depend on the

method of estimation and the condition of habitat. Ecology 88:1401-1407.

Moilanen, A. 1999. Patch occupancy models of metapopulation dynamics: Efficient parameter

estimation using implicit statistical inference. Ecology 80:1031-1043.

Moilanen, A. 2004. SPOMSIM: software for stochastic patch occupancy models of

metapopulation dynamics. Ecological Modelling 179:533-550.

Moilanen, A., and I. Hanski. 1998. Metapopulation dynamics: effects of habitat quality and

landscape structure. Ecology 79:2503-2515.

97

Moilanen, A., A. T. Smith, and I. Hanski. 1998. Long-term dynamics in a metapopulation of the

American pika. The American Naturalist 152:530-542.

Morris, W. F., and D. F. Doak. 2002. Quantitative conservation biology: Theory and practice of

population viability analysis. Sinauer Associates Inc., Sutherland, MA.

Nowicki, P., A. Pepkowska, J. Kudlek, P. Skorka, M. Witek, J. Settele, and M. Woyciechowski.

2007. From metapopulation theory to conservation recommendations: Lessons from

spatial occurrence and abundance patterns of Maculinea butterflies. Biological


North, A., and O. Ovaskainen. 2007. Interactions between dispersal, competition and landscape

heterogeneity. Oikos 116:1106-1119.

Ogle, D.H. 2009. Functions to support fish stock assessment textbook (v.0.0-8). Accessed online

(January 12, 2009): http://www.ncfaculty.net/dogle/

Oksanen, J. 2004. Incidence function model in R. Accessed online (February 12, 2009):

http://cc.oulu.fi/~jarioksa/opetus/openmeta/metafit.pdf

OMNR (Ontario Ministry of Natural Resources). 2007. Stream assessment protocol for southern

Ontario. Ontario Ministry of Natural Resources, Picton, Ontario.

Ovaskainen, O. 2004. Habitat-specific movement parameters estimated using mark-recapture

data and a diffusion model. Ecology 85:242-257.

Ovaskainen, O., and I. Hanski. 2002. Transient dynamics in metapopulation response to

perturbation. Theoretical Population Biology 61:285-295.

Ovaskainen, O., and I. Hanski. 2004. From individual behavior to metapopulation dynamics:

Unifying the patchy population and classic metapopulation models. American Naturalist

164:364-377.

Poos, M.S., and D.A. Jackson. 2009. Conservation by consensus: Reducing uncertainties in

modeling the distribution of an endangered species using habitat-based ensemble models.

Ecological Applications (Submitted 2009-06-08: #09-1012).

http://www.ncfaculty.net/dogle/

http://cc.oulu.fi/%7Ejarioksa/opetus/openmeta/metafit.pdf

98

R Development Core Team. 2008. A language and environment for statistical computing. R

Foundation for Statistical Computing, Vienna, Austria. (ISBN: 3-900051-07-0).

Reid, S. M., N. E. Jones, and G. Yunker. 2008. Evaluation of single-pass electrofishing and rapid

habitat assessment for monitoring redside dace. North American Journal of Fisheries

Management 28:50-56.

Resetarits, W. J., C. A. Binckley, and D. R. Chalcraftet. 2005. Habitat selection, species

interactions, and processes of community assembly in complex landscapes: a

metacommunity perspective. Pages 374-398 in M. Holyoak, M. A. Leibold, and R. D.

Holt, editors. Metacommunities - spatial dynamics and ecological communities. The

University of Chicago Pressz, Chicago, Illinois.

Revilla, E., and T. Wiegand. 2008. Movement ecology special feature: Individual movement

behavior, matrix heterogeneity, and the dynamics of spatially structured populations.

Proceedings of the National Academy of Science 105:19120-19125.

Robert, A. 2009. The effects of spatially correlated perturbations and habitat configuration on

metapopulation persistence. Oikos (Online Early, doi: 10.1111/j.1600-

0706.2009.17818.x).

Roitberg, B. D., and M. Mangel. 1997. Individuals on the landscape: Behavior can mitigate

landscape differences among habitats. Oikos 80:234-240.

Schtickzelle, N., G. Mennechez, and M. Baguette. 2006 Dispersal depression with habitat

fragmentation in the bog fritillary butterfly. Ecology 87:1057-1065.

Shreeve, T. G., R. L. H. Dennis, and H. VanDyck. 2004. Resources, habitats and

metapopulations - whither reality? Oikos 106:404-408.

Sjogren-Gulve, P. 1991. Extinctions and isolation gradients in metapopulations: the case of the

pool frog (Rana lessonae). Biological Journal of the Linnean Society 42:135-147.

Thomas, C. D., and I. Hanski. 1997. Butterfly metapopulations. Pages 359-386 in I. Hanski and

M. E. Gilpin, editors. Metapopulation Biology: Ecology, Genetics and Evolution.

Academic Press, London.

99

Thomas, C. D., R. J. Wilson, and O. T. Lewis. 2002. Short-term studies underestimate 30-

generation changes in a butterfly metapopulation. Proceedings of the Royal Society of

London B 269:563-569.

Tischendorf, L. 2001. Can landscape indices predict ecological processes consistently?

Landscape Ecology 16:235-254.

Tischendorf, L., and L. Fahrig. 2001. On the use of connectivity measures in spatial ecology. A

reply. Oikos 95:152-155.

Vandewoestijne, S., T. Martin, S. Liégeois, and M. Baguette. 2004. Dispersal, landscape

occupancy and population structure in the butterfly Melanargia galathea. Basic and

Applied Ecology 5:581-591.

Verboom, J., J. A. J. Metz, and E. Meelis. 1993. Metapopulation models for impact assessment

of fragmentation. Pages 172-192 in C. C. Vos and P. Opdam, editors. Landscape ecology

of a stressed environment. Chapman and Hall, London.

Verboom, J., K. Lankester, and J. A. J. Metz. 1991. Linking local and regional dynamics in

stochastic metapopulation models. Biological Journal of the Linnean Society 42:39-55.

Vos, C. C., J. Verboom, P. F. M. Opdam, and C. J. F. ter Braak. 2001. Toward ecologically

scaled landscape indices. American Naturalist 157:24-41.

Walsh, M. G., and D. L. Winkelman. 2004. Anchor and visible implant elastomer tag retention

by hatchery rainbow trout stocked into an Ozark stream. North AMerican Journal of

Aquaculture 24:1435-1439.

Zar, J.H. 1999. Biostatistical analysis. Upper Saddle River, New Jersey: Prentice Hall.

100

Appendices

Appendix 4.1 –The endangered redside dace (Clinostomus elongatus). Photo credit: Mark Poos.

Appendix 4.2 –Visual implant elastomer (VIE) tag inserted subcutaneously on the ventral surface of the endangered redside dace (Clinostomus elongatus). Photo credit: Mark Poos.

101

Appendix 4.3 – Model parameters used in the stochastic patch-occupancy models. Shown are the number of emigrants (mi), the mean probability of detection (PD), the number of emigrants adjusted for potentially missed tags (Mi), the number of immigrants needed to start a new sub-population, and the rate of extinction (Ei). Other parameters include: the incidence function for Leslie Tributary (dI = 210, x = 0.4926, e = 3.685, y = 6.12), and Berczy Creek (dI = 150, x = 0.5652, e = 4.187, y = 7.01). Not shown: dij given it is a pairwise estimate rather than unique for each pool.

Patch mi PD Mi Ij, 0.5 Ei

Leslie Tributary L6 35 0.710 60 4.89 0.2593 L7 7 0.619 11 2.48 0.2901 L8 38 0.767 67 5.48 0.1956 L9 10 0.740 17 8.21 0.2724 L10 37 0.647 61 5.10 0.2583 L11 38 0.780 68 5.74 0.1870 L12 14 0.727 24 8.03 0.3167 L13 8 0.792 14 2.88 0.2912 L14 0 0.000 0 7.69 0.3383 L15 3 0.800 3 7.69 0.3304 Overall 190 0.718 326

Berczy Creek B6 48 0.676 80 1.91 0.2645 B7 116 0.564 181 7.43 0.1918 B8 23 0.582 36 2.78 0.2231 B9 8 0.667 13 9.20 0.1394 B10 17 0.836 31 2.85 0.1639 B11 34 0.741 59 4.91 0.1683 B12 9 0.800 16 7.43 0.2026 B13 88 0.715 151 5.91 0.2025 B14 2 0.882 4 10.94 0.2110 B15 5 0.800 9 10.94 0.2726 B16 8 0.744 14 1.91 0.1569 B17 0 0.667 0 7.43 0.2388 B18 49 0.661 81 2.78 0.1417 Overall 407 0.656 670

102

Section II:

Reducing uncertainty from methodological choices using consensus methods

103

Chapter 5: Reducing uncertainties in modeling the distribution of endangered species using habitat-based ensemble models

Abstract Modeling the distribution of endangered species is often problematic due to their rarity and the

consequential statistical issues. Recent studies have demonstrated the utility of using ensemble

models to reduce the uncertainty of singular predictions when modeling species distributions.

This study provides a quantitative evaluation of the efficacy of ensemble models for improving

the prediction of endangered species and their habitats using the endangered fish, the redside

dace (Clinostomus elongatus), as a model organism. Specifically this study asks: 1) how well do

ensemble models improve modeling metrics (e.g. specificity, sensitivity and overall

classification); 2) how many singular methods are needed to build the optimal ensemble; and, 3)

what scale(s) and type(s) of habitat are most important for modeling this species. For this

evaluation, five ensemble models were compared based on seven singular approaches. Habitat

variables were derived from 200 sites measured from 1997-2007 and divided hierarchically into

fine-, intermediate-, and broad-scale habitat. In all cases, the ensemble models were equal to or

better than any singular method across all modeling metrics, although there was large variation

with certain combinations of initial models. This study demonstrates how comparative analysis

of modeling types, ensemble approaches and scales can be useful for reducing uncertainty in the

modeling of endangered species and their habitats.

Keywords: ensemble model, consensus model, conservation, biodiversity, biogeography, species

distributions.

104

Introduction Predictive models of species distributions have become important tools in modeling changes in

biogeography and biodiversity (Guisan & Zimmermann, 2000; Guisan & Thuiller, 2005; Sharma

and Jackson 2008). Studies have used predictive models in a wide range of applications

including to evaluate species distributions in relation to climate change (Thuiller 2004; Araújo et

al. 2005), evaluate the establishment and spread of expanding invasive species (Hartley et al.

2006), and model habitat suitability of endangered species (Rodriguez et al. 2007; Marmion et al.

2009; Franklin et al. 2009). With the increasing availability of remote sensing data and advances

of geographic information systems, researchers often are no longer data limited, and are

expanding the use of predictive models to include more applications (Guisan and Zimmerman

2000; Marmion et al. 2009).

Modeling existing and future habitats of endangered species has become a popular application of

predictive models. One reason for this popularity is that faced with limited data on the

distribution, abundance and dynamics of endangered species (Mace et al., 2005; Rodríguez,

2007), predictive models allow for the extrapolation of relatively few field samples to the entire

potential range of a species. However, the application of predictive models to endangered

species remains controversial (Loiselle 2003; Wilson et al. 2005; Thompson et al. 2007). One

difficulty with the standard sets of predictive models used by the majority of ecologists, is that

they are often inappropriate when analyzing data limited to a few sites and scales (Ellison and

Agrawal 2005; Araújo & Guisán, 2006). This, in turn, produces data sets that have many

statistical limitations, including zero-inflated bias, increased collinearity between variables, and

inflated coefficient of variation (Graham 2003; Dixon et al. 2005; Edwards et al. 2006; Guisan et

al. 2006; Dormann et al. 2008). However, uncertainties in predictive models of endangered

species may arise during all stages of modeling including obtaining species level data, obtaining

accurate species counts, and, linking species to landscapes/habitats (Elith et al. 2002; Loiselle

2003; Heikkinen et al. 2006). Therefore, approaches that reduce uncertainty in predictive models

are continually being sought, as are improvements to current approaches (Elith et al. 2006;

Austin 2007).

One way to overcome the difficulties of using singular predictive models has been the use of

ensemble-based approaches, which combine several predictive models (Aruajo and New 2007;

105

Marimon et al. 2008). The goal of ensemble models is to reduce the uncertainty in any singular

approach by combining their predictions. The idea of ensemble approaches dates back to

Laplace (1820) who demonstrated that the probability of error will rapidly decline with the

inclusion of additional predictors. However, in comparison to other disciplines, the application

of ensemble models to ecological issues remains in its infancy (Bates and Granger 1969; Arujo

et al. 2005b). In particular, the use of ensemble models have been restricted to determining

distributions of species under climate-change scenarios (Thuiller 2004; Araujo et al. 2005b;

Thuiller et al 2005; Buisson and Grenouillet 2008), with less emphasis on establishing habitat

relationships. However, as the first step of building an ensemble model is developing singular

predictive models of initial conditions as filters, studies regarding uncertainties with the existing

singular approaches from which the ensembles are based remain relatively understudied.

The use of ensemble models for improving model predictions of species distributions and their

habitats has received little attention. Comparative studies of efficiencies of ensemble models are

scarce (Thuiller 2004; Marimon et al. 2008), and few studies provide guidance on how to build

the most suitable ensemble (e.g. how do models improve prediction, how many initial filters are

needed?). Given the paucity of both ensemble models and data with most endangered species

(Mace et al. 2005; Rodriguez et al. 2007), an evaluation is needed of the application of ensemble

models for identifying habitat characteristics of endangered species. The objective of this paper

is to provide a quantitative comparative analysis of both singular predictive models and

ensemble approaches for identifying existing habitat for the endangered species, the redside dace

(Clinostomus elongatus). Specifically, this study seeks to determine: 1) whether habitat-based

ensemble models improve modeling metrics (e.g. sensitivity, specificity, and overall

classification); 2) to what degree ensemble models actually improve predictive success; and, 3)

how many models should be used to build an optimal ensemble?

106

Methods

Study Area and Species

The redside dace is a pool-dwelling minnow found only in Ontario within its Canadian range

(Figure 5.1). Redside dace is in decline due to changes in land-use through urbanization,

especially in the Greater Toronto Area, which includes 80% of its Canadian range, (COSEWIC

2007). To evaluate the efficacy of predictive models for correctly classifying redside dace

locations and their habitats, a dataset of 100 known redside dace locations sampled from 1997-

2007 was compiled from historical records from various government agencies, universities and

conservation authorities. Absence data were obtained from a larger dataset of 560 locations

obtained from the same agencies and randomly reduced to 100 locations as a means of

developing a balanced approach for assessing modeling metrics (Olden et al. 2004).

A hierarchical approach was used to predict the presence and absence of redside dace. In total

28 variables were used across multiple scales; fine-scale (i.e. site level habitat), intermediate-

scale (i.e. landscape level) and broad-scale (i.e. geologic and geomorphic variables) to help

quantify the importance of redside dace habitat (Table 5.1). These variables included factors

thought to influence redside dace, such as urban land cover (Scott and Crossman 1973; McKee

and Parker 1982; Daniels and Wisniewski 1994; Novinger and Coon 2000; COSEWIC 2007),

ranging to factors thought to influence stream fish in general (Grossman and Freeman 1987).

Fine-scale habitat features were derived using the Ontario Stream Assessment Protocol (OMNR

2007) and included: habitat type (e.g. percent pool, riffle, run); substrate type (e.g. percent

cobble, gravel or fines); stream depth (m); type of in-stream cover (e.g. percent flat rock, round

rock, wood, bank, or macrophytes); characteristics of the stream bank (e.g. percent eroding,

vulnerable, protected or depositional); overland temperature (degrees Celsius), and amount of

adjacent riparian cover (Table 5.1). Intermediate-scale habitat variables included percent

coverage by urban, forest, cropland, pasture and wetland landcover and were obtained from most

recent satellite imagery (2001) that was converted in shapefiles using a geographic information

system. Various spatial buffer sizes were used to characterize adjacent land-use; however a one

kilometer buffer was deemed most appropriate given a sensitivity analysis using various buffer

sizes. Broad-scale habitat variables were assessed using various buffer sizes in a similar fashion

107

to landscape variables. The broad-scale variables were derived from The Canadian System of

Soil Classification and included type of soil geology in the vicinity (Newmarket Till, Elma Till,

Halton Till), as well as slope (Soil Landscapes of Canada Working Group 2007). Soil categories

that occurred at < 5% of sites were removed in order to not over-parameterize the models.

New York,

. U.S.A78°20’5

42°41’2

Ontario, CANADA

Lake Ontario

Greater Toronto Area

Lake Erie

Lake Huron

Figure 5.1 – Distribution of sampling locations between 1997-2007. Closed circles indicate redside dace occurrences, whereas, open circles indicate where redside dace were absent.

108

Table 5.1 – Summary of the hierarchical habitat-based model used for predicting the presence of endangered minnow, redside dace (Clinostomus elongatus). Seven predictive models were used including: LR (logistic regression), CT (classification trees), MARS (multivariate adaptative regression splines), ANN (artificial neural networks), DA (discriminant analysis), RF (random forest), and BR (boosted regression trees). Variables were derived using forward-selection procedures on five independent datasets and are shown as a percentage of datasets where each variable was selected (parentheses indicate negative associations). Variables selected from only one dataset were omitted. A prioi predictions (ap) based on habitat predictors thought to influence the decline of the species are shown for reference, where + indicates a positive correlation, - negative correlation, 0 none.

Scale Variable ap LR CT MARS ANN DA RF BR Fine (Site)

Width - [100] [80] [100] [80] Pool Glides Fast Riffles Slow Riffles

+ 0 0 0

100

60

100

80

[80] [80]

100

[60]

100

[80]

100

[40]

Fine substrate Gravel substrate Cobble substrate

0 0 0

40

Shallow depth Intermediate depth Deep

0 0 0

80

80 Flat rock cover

Round rock cover Wood cover Bank cover Macrophyte cover

0 0 0 + 0

Eroding banks Vulnerable banks Protected banks Depositional banks

+ 0 0 0

[80]

40

40

Intermediate Urban - [100] [60] [100] [100] [100] [100] [80] (Landscape)

Cropland 0 80 Pasture 0 40

Forest 0 80 Wetland 0 Broad Newmarket Till 0 (Geologic) Halton Till 0 Elma Till 0 40 80 60 Slope 0 Temperature 0

109

Building Individual Models

Previous comparisons of individual models have demonstrated that model comparisons

are needed for understanding the relative strengths and weakness of each modeling approach,

and this is especially true for modeling endangered species where greater uncertainty exists

(Olden and Jackson 2001; 2002a). In this stud, five ensemble methods were compared for

improving the predictive success of the redside dace and its habitat. For this assessment, seven

modeling approaches were used as initial filters to identify environmental variables linked with

the occurrence of redside dace (Figure 5.2). These modeling approaches were: logistic

regression (LR); classification trees (CT); multivariate adaptive regression splines (MARS);

artificial neural networks (ANN); discriminant analysis (DA); random forest (RF); and, boosted

regression trees (BR). Each of these modeling approaches has been shown to be useful for

describing species occurrences (Thuiller 2003; Elith et al. 2006; Thuiller et al. 2006; Marimon et

al. 2008) and were used to develop the output models (Figure 5.2). The modeling methods used

represent a continuum of use from logistic regression, the most prevalent and widespread

statistical method for modeling binary data (Hosmer and Lemeshow 1989; Pampel 2000), to

newer methodologies that may perform better, such as artificial neural networks, boosted

regression trees and random forest (Olden and Jackson 2001; Olden et al. 2004; Thuiller et al.

2004). In general, the modeling methods can be broadly broken down into three groups: two

regression-based methods (LR, MARS); three machine-learning methods (BR, RF, ANN); and,

two classification methods (CT and DA; Elith et al. 2006; Marrimon et al. 2009).

110

Step 2: Analyze Training Data

Step 4: Build Ensemble Forecast (e.g. Mean, PCA, WA)

Multivariate Adaptive

Regression Splines

(MARS)

Boosted Regression

(BR)

Discriminant

Analysis (DA)

Random Forest (RF)

Artificial Neural

Networks (ANN)

A 1 2 3 B 4 5 6

Step 1: Partition Data C 7 8 9

Step 3: Produce Output Models Using Predictions of Independent Test Data

A = 1 B = 0 C = 1

A = 1 B = 1 C = 0

A = 1 B = 0 C = 0

A = 0 B = 0 C = 1

A = 1 B = 1 C = 0

A = 0 B = 0 C = 0

A = 0 B = 1 C = 1

Logistic Regression

(LG)

Classification Trees (CT)

A = 1 B = 0 C = 0

Figure 5.2 – Summary of ensemble forecasting approach. Five-fold cross validation was used by partitioning data into five individual test and validation datasets. Using each training dataset, data was analyzed individually across seven initial approaches: logistic regression (LR), classification trees (CT), multivariate adaptive regression splines (MARS), artificial neural networks (ANN), discriminant analysis (DA), random forest (RF) and boosted regression (BR). Five ensemble models were built from initial seven predictions, including: a consensus model (CM), principal components analysis (PCA), weighted average using overall classification (WA), mean (Mn) and median (Md). Modeling metrics (specificity, sensitivity and overall classification) were obtained by comparing the individual or ensemble predictions on the independent test dataset.

111

Models were configured in the following ways. LRs were run using a logit function using

maximum likelihood (Allison 1999; Olden and Jackson 2002a). CTs used a chi-square distance

of 0.05 to determine significant group cut-offs. The number of terminal nodes in the leaf

structure were optimized by iteratively running all terminal nodes and choosing the leaf structure

with the smallest error (in this case a terminal structure of 5 was determined to have the lowest

error). This step was important as too many terminal nodes may artificially inflate correct

classification, whereas too few terminal nodes may not provide meaningful species groupings

(Vayssiéres et al. 2000). These models were developed in the SAS programming language v.9.1.

MARS provides an alternative regression based method using piecewise linear fits rather than a

smoothing parameter (e.g. as in general additive models; Friedman 1991; Elith et al. 2006).

ANNs used a one-hidden layer feed-forward network trained by the back-propagation algorithm

(Bishop 1995). This type of network is considered a universal approximator of any continuous

function, has low associated rates of error and is used most often in ecological studies

(Rumelhart et al. 1986; Hornik and White 1989; Olden and Jackson 2002b, Olden et al. 2004).

ANNs were optimized (optimal referring to minimizing the trade-off between network bias and

variance) for the number of hidden neurons in the neural network by determining empirically the

number of hidden neurons that produces the lowest misclassification rate (Bishop 1995); which

in this case was a layer that contained seven nodes. Random forests were run for each node of

the tree, randomly using m variables on which to base the decision at that node. The best split

was calculated based on these m variables in the training set. Two group (presence/absence)

discriminant function analysis was developed using a linear model. Models 3-6 were built using

the statistical program Statistica v.7. BR prediction was based on an accuracy-weighted vote

across estimated classifiers (Ridgeway 1999) and run using code provided by Elith et al. (2008)

in the R programming language v.2.8.0 (R Development Core Team, 2008)

Habitat variables were chosen using forward step-wise selection methods; except for the

machine-learning methods (ANN, RF, BR), which iteratively fit their own responses to the

habitat data (Olden and Jackson 2002b; Dormann et al. 2008). This approach was necessary to

compare the ability of individual models for selecting habitat variables with high explanatory

power. Although bias can be introduced by variable selection methods, recent studies have

demonstrated that selection procedures may actually have very small impact on resultant models

(Maggini et al. 2006; Meynard and Quinn 2007; Dormann et al. 2008). A forward-selection

112

procedure (p < 0.05) was used to determine variables with high explanatory power. In the cases

of machine-learning methods, the single ANN model with the highest overall classification (e.g.

highest area under the curve) was retained. With the RF and BR approaches the importance of

variables was determined by estimating the relative influence of each variable reducing the loss

function, based on a square error algorithm (Thuiller, 2003).

Model thresholds were calculated using receiver operator characteristic curves (ROC) with

thresholds balanced between misclassification of species presence and species absence (Olden

and Jackson 2001). It should be noted that the choice of model thresholds can impact the results

(Jiménez-Valverde & Lobo 2007; Lobo et al. 2008). For example, researchers must weigh the

need for correct classification of either presence or absence at the expense of inflated error (i.e.

misclassification). One could choose model-selection thresholds that prioritize correct

classification, at the expense of increased misclassification of species absences (or vice versa).

Such a decision may be well suited for conservation applications, where one would wish to

emphasize correct predictions of the true occurrence of an endangered species, but were willing

to accept higher commission rates (Loiselle et al. 2003). For the purpose of this study, a

balanced approach was used, where models were select that had equal likelihood of

misclassification of species presence and absence. This approach was well suited for this study

as model were evaluated based on their specificity, sensitivity and overall classification, which

require equal consideration (Hartley et al. 2006). In addition, previous studies (e.g. Marrimon et

al. 2008), which have used area-under-the-curve operations to evaluate predictive performance

of ensemble models may inappropriately produce models that have high overall classification

due solely to model specificity, which would not otherwise be known (Loiselle et al 2003;

Jiménez-Valverde. & Lobo 2007; Lobo et al. 2008).

For all of the model comparisons, data were split into two types: 80% of the data were used to

train each model, and 20% of the data were used to validate and test the predictive capability of

each model (Figure 5.2). A 5-fold cross validation procedure was used where five separate

models were calculated for each statistical method to build a complete validation data set (Figure

5.2). The validation data is an important component of model evaluation, as it provides an

independent unbiased evaluation of the predictive capability of each trained model (Olden et al.

2002a).

113

Evaluating Individual Models

Predictive models were compared in a number of ways. First, model outputs were compared to a

set of a prioi predictions based on habitat predictors thought to influence the decline of the

species (Scott and Crossman 1973; McKee and Parker 1982; Daniels and Wisniewski 1994;

Novinger and Coon 2000). Second, predictive models were evaluated by three modeling

metrics: model sensitivity (the ability of each model to correctly predict species presence); model

specificity (the ability of each model to correctly predict species absence); and, overall

classification (the ability of each model to correctly classify both species presence and absence).

The use of these metrics provides an alternative means to evaluate each model, their comparative

successes, their associated errors, and their relationship to ecological relevant variables, for both

species presence and absence (Olden and Jackson 2002a). The distinction between modeling

metrics is an important one as models for predicting imperiled species are often difficult to

assess because they tend to artificially inflate correct classification of species absence (model

specificity), where the majority of sites are associated and the habitats do not reflect the ecology

of the model species, whereas, model sensitivity measures the ability to classify species

presence, which may be more relevant for conservation applications. Finally, predictive outputs

were compared to the observed data using unweighted pair-group method (UPGMA) cluster

analysis using phi similarity. Cluster analysis has been used widely in ecological literature

(Legendre and Legendre 1997) as the resultant output produces a dendrogram which connects

closely matching neighbors. For example, statistical methods connected by branches proximal to

one another match closer than methods connected by branches more distal (Podani 2000). The

phi coefficient is the binary correlation coefficient and is not biased by frequency of occurrence

as has been shown for other coefficients (Jackson et al 1989).

Building Ensemble Models

Ensemble models allow for a decrease in predictive uncertainty of singular models, by using a

combination of their predictions (Figure 5.2). Various ensemble models have been proposed,

including those that use selective algorithms or basic mathematical functions such as the mean

and median predictions (Araújo et al. 2005; Marrimon et al. 2008). Here, five ensemble

approaches currently in favor were compared: a consensus model (CM), Principal Components

Analysis (PCA), weighted average using overall classification (WA), Mean (Mn) and Median

(Md) ensemble approaches. Each of these approaches have been used extensively in building

114

ensemble models (e.g. Gregory et al., 2001; Johnson and Omland 2004; Thuiller, 2004; Araújo et

al 2005a; 2005b, 2006; Thuiller et al 2005; Araújo and New 2007; Goswami and O’Connor,

2007; Marimon et al. 2008)

All ensemble models were built from reducing a matrix of model predictions (j) by sites (i) to a

vector of ensemble predictions. For example, the consensus ensemble model used a majority

rules criterion, where all model outputs were compared and the majority output (presence or

absence) was retained. For the mean ensemble model, all continuous model outputs were

averaged and the final output was rounded to either presence or absence. Similarly, the median

ensemble model took the median value of all seven individual outputs (either presence or

absence). The principal components analysis used the dominant axis of an eigenvalue

decomposition of a covariance matrix programmed in the R v.2.8.0 (R Development Core Team,

2008). The resultant eigenvalue was scaled to presence absence using the ROC approach

highlighted earlier. Finally, the weighted average ensemble model used the overall classification

(across all predictions) and multiplied it to each prediction to produce a singular ensemble

prediction.

Evaluating Ensemble Models

Ensemble models were evaluated using identical metrics as the individual models: specificity,

sensitivity and overall classification, and cluster analysis. The number of initial models needed

to build the most appropriate ensemble model was also evaluated. For this, a re-sampling

approach was developed using a sample-based rarefaction routine (Colwell et al. 2004), which

was coded using R v.2.8.0 (R Development Core Team, 2008). This re-sampling approach

randomly selected predictions at a given site from a matrix of model predictions by site. Each

prediction was matched to the observed data and model specificity, sensitivity and overall

classification was calculated across study sites. The set of predictions was then randomly

permuted and a new prediction was added from a competing model and evaluated as to whether

this additional prediction increased overall classification (Figure 5.2). This randomization

procedure was repeated 1000 times to determine 95% confidence intervals for all combinations

of ensemble models.

115

Results

I) Individual Models

The majority of the initial predictive models identified similar fine-scale habitat features,

including: positive associations with deep, pool habitats with gravel substrate, and negative

associations with stream width and fast riffles (Table 5.1). Only a negative association with

urban land cover and a positive association with Elma till, came out as a strong indicator at a

broader scale. These findings are in agreement with previous studies of current knowledge of

this species and its habitat (Scott and Crossman 1973; McKee and Parker 1982; Daniels and

Wisniewski 1994; Novinger and Coon 2000; COSEWIC 2007; Table 5.1).

All individual models generally performed well (> 80% overall classification). Performance was

generally better for model specificity (range 84-90%; Table 5.2) than for model sensitivity (75-

87%). When individual models were compared, logistic regression was most closely related to

the observed data, followed by the learning-based approaches, such as boosted regression,

random forest, and artificial neural networks (Fig. 5.3A). These methods also had the highest

rates of model specificity and generally the highest rates of model sensitivity, although artificial

neural networks dropped off in this area (Table 5.2). Alternatively, classification-based methods,

such as classification trees and discriminant analysis were least associated with the observed

data, followed by multivariate adaptive regression splines.

116

Table 5.2 – A comparison of model sensitivity (correct classification of species presence), specificity (correct classification of species absence), and overall classification (correct classification of both species presence and absence) for redside dace (Clinostomus elongatus). Single models are: LR (logistic regression); CT (classification trees); BR (boosted regression trees); MARS (multivariate adaptive regression splines); ANN (artificial neural networks); DA (discriminant analysis); and, RF (random forest). Ensemble forecasts are: consensus model (CM); principal component analysis (PCA); weighted average using overall classification (WA); mean (Mn); and, median (Md).

Singular Models Ensemble Models

LG CT BR MARS ANN DA RF CM PCA WA Mn Md

Sensitivity (%) 87 75 83 81 75 84 83 87 88 90 88 87

Specificity (%) 90 86 90 88 89 84 89 94 91 90 89 92

Overall Classification (%) 88 81 87 85 82 84 86 91 90 90 89 90

117

II) Ensemble Models

Ensemble models provided as good as or improved model performance over singular

methods. Overall classification increased from 1-10% with ensemble models (Table 5.2). On

average, ensemble models improved model sensitivity over model specificity. For example,

whereas, only logistic regression had model sensitivity above 85%, all ensemble models had

rates of model sensitivity at 87% or above and had model sensitivity equal or above to all the

individual models. With the exception of the mean ensemble result, all ensemble models

provided equal or superior specificity of 89% rather than the 90% achieved by the best individual

model. Overall, the various ensemble methods produced similar levels of model sensitivity,

model specificity and overall classification; however, the consensus ensemble model provided

the highest rate of classification, due to having both higher values of model sensitivity and

specificity. The various ensemble models show close association (i.e. phi correlation measure) to

one another (Fig. 5.3) and to the observed data, which were similar to the association between

the observed data and the best individual model (i.e. logistic regression).

Figure 5.3- Cluster analysis showing the relationship with the observed distribution (Obs.) of redside dace (Clinostomus elongatus) and: A) the seven individual modeling approaches alone, and B) with ensemble forecasts included. Model short forms are carried over from Table 5.2.

118

There was large variation when comparing all possible combinations of consensus ensemble

models. Whereas, model sensitivity and overall classification increased (on average) with the

addition of more initial models, the average model specificity decreased for combinations of

three and five initial models, before improving at the final seven-input ensemble model (Fig.

5.4). In addition, the variation (e.g. 25th and 75th percentiles, as well as minima and maxima) in

model sensitivity, specificity and overall classification increased when larger combinations of

ensembles were considered (e.g. there were 35 combinations of ensembles for three-input

models, and 21 for five-ensemble models). For the three-model consensus, the combination of

LG, MARS, RF had the highest model specificity, CT, ANN, BR had the highest model

sensitivity and LG, RF, BR had the highest overall classification. Alternatively, the combination

of learning-based methods ANN, RF, BR had the lowest model specificity and overall

classification, whereas, the combination of LG, CT, RF showed the lowest model sensitivity

(Appendix 5.1). Similarly, for the five-model consensus models, combinations with all three

learning-based methods performed the worst, where the combination of MARS, ANN, RF, BR,

DA had the lowest sensitivity and overall classification and LG, MARS, ANN, RF, BR had the

lowest specificity (Appendix 5.1).

119

A)

B)

C) Figure 5.4 – Box and whisker plots showing the variability in consensus ensemble forecasts for predicting the presence of an endangered redside dace. Consensus ensemble models were compared across all combinations of one (n =7), three (n =35), five (n =21) and seven (n=1) input models (x-axis). Boxes are 25th and 75th percentiles, horizontal lines indicate the median, vertical lines indicate the upper and lower values, diamonds indicate the mean and are connected by dashed lines. Modeling metrics were: A) model sensitivity (i.e. correct classification of species presence); B) model specificity (i.e. correct classification of species absence); and, C) overall classification. Dashed lines indicate 95% confidence intervals.

120

Discussion Ensemble models provide a useful method for reducing uncertainty when modeling distributions

of endangered species. Endangered species are not only rare, but they are also often difficult to

capture and enumerate, thereby, complicating the evaluation of their habitat. Thus methods

which can be used to reduce the uncertainty of model type should allow for improvement in the

management of endangered species. In this study, ensemble models improved model specificity

for the endangered redside dace (Clinostomus elongatus) by 6.9%, model sensitivity by 3.2% and

overall classification by 5.3% (on average); over individual models. Previous studies have

demonstrated that ensemble approaches can reduce uncertainty and improve model fit for

predicting future distributions (Thuiller, 2004; Thuiller et al., 2005; Araujo and New 2007;

Marrimon et al. 2008). Here, this study provides the first evidence that ensemble models can be

equally applied to modeling current distribution and provides increased model performance over

singular approaches.

Ensemble models may be especially useful for identifying the potential importance of habitat for

endangered species, and at the scale in which they function. Modeling the distributions of

endangered species and their habitats is an activity filled with uncertainty. Using the endangered

redside dace as an example, this study demonstrates that although predictive models varied in

their ability to identify correctly existing habitat, in all cases an ensemble approach improved

model prediction. In addition, whereas previous studies have demonstrated the utility of

ensemble models for identifying habitats at broader scales (e.g. landscape), this study

demonstrates the utility in developing a multi-scaled approach. With the increase in availability

of remote-sensing data and advances in geographic information systems, researchers are left with

a plethora of data from which to model distributions (Guisan and Zimmerman 2000). Often

climatic, topographic and landuse data are available readily (Guralnick et al. 2007), whereas

information on fine-scale variables is more scarce (Austin 2007; Dorman et al. 2008). This study

demonstrates a situation where fine-scale habitat models had a greater ability to describe factors

relevant to the redside dace than broader scale factors (Table 5.1).

The need for comparative approaches using many statistical methods has also been highlighted

as another way of reducing uncertainty with modeling endangered species (Guisan and

Zimmerman 2000; Olden and Jackson 2002a). The choice of modeling approach has been

121

shown to have severe consequences for the application to conservation decisions (Loiselle 2003;

Wilson et al. 2005; Rodriguez et al. 2007; Marrimon et al. 2008). For example, Pearson et al.

(2006) showed that distribution changes of South African plant species varied from 92% loss to a

322% gain depending on the model they used. In addition, Dorman et al (2008) demonstrated

that of several uncertainties in modeling species distributions, including variable selection and

collinearity between variables, the choice of model type had the largest impact. In this study,

there was variation in how each model type fitted predictions, with logistic regression most

closely resembled the observed distribution, followed closely by the three machine learning

methods: boosted regression, random forest and artificial neural networks. These results are in

general agreement with other studies showing the benefits of machine-learning methods such as

random forest, and general boosted regression (Cutler et al., 2007). One reason for the superior

performance of logistic regression is that more complex models (which iteratively fit a solution)

can be prone to overfitting (Olden and Jackson 2002a). As previous quantitative comparative

analyses have demonstrated, the success of predictive modeling approaches can be largely data

dependent and there is no clear indication of the preeminence of any singular approach (Olden

and Jackson 2002a; Araujo and New 2007). Comparative analyses can be used to identify

problems related to modeling approaches given the available data and to determine situations

where comparative analysis may work better (Guisan and Zimmerman 2000; Olden and Jackson

2002a).

Comparative approaches are needed not just to ensure that appropriate models are being

developed, but to ensure the best ensemble models are being built. Comparisons of ensemble

models are rare (e.g. Araujo et al. 2005; Marrimon et al. 2008) and only one (Marimon et al.

2008) has evaluated the relationship between ensemble models and predictive performance. In

this study, there was variation in the ability of ensemble models to improve prediction over

singular approaches. The consensus ensemble model (i.e. vote counting) provided the highest

model specificity and the best estimated overall classification, whereas the weighted average

ensemble model provided the highest model sensitivity (Table 5.2). The remaining ensemble

models performed similarly, including those based on principal components analysis, which have

been used most frequently in previous studies (Thuiller 2004; Araujo et al. 2005b). The reason

for this similarity is likely due to the lack of independence in the information provided by the

initial models. Future research is needed into understanding the study settings in which different

ensemble methods are likely to perform best (Marrimon et al. 2007).

122

The evaluation of the efficacy of combining initial models into ensemble models has received

little attention and remains an important knowledge gap (Loiselle et al. 2003; Wilson et al. 2005;

Marrimon et al. 2008). Decisions such as how many initial models are needed to build the most

useful ensemble model have not been addressed. Ensemble models work under the assumption

that the initial models provide some independent information (Araujo and New 2007): at some

stage, adding more initial models beyond those that provide independent information may

actually decrease the utility of the ensemble approach. Here a simple sub-sampling procedure

(Colwell et al. 2004) was used to demonstrate that an optimal ensemble model could be

developed using as few as three initial models (Figure 5.3). As understanding the contributions

of initial models to ensemble forecasts remains a limitation to the ensemble approach (Araujo

and New 2007), the use of sample based rarefaction techniques may allow substantial insight

into ensuring that appropriate ensemble models are being produced.

The application of ensemble models to ecological issues remains in its infancy (Araujo et al.

2005b). Previous studies on ensemble models have demonstrated their utility in reducing

uncertainty with singular methods (Thuiller et al. 2004; Araujo et al. 2005b; Marrimon et al.

2007), however several challenges remain to ensure ensemble models are applied appropriately,

especially for modeling distributions of endangered species. First, researchers should have

thorough understanding of predictive modeling, their uncertainties and conditions under which

they should be applied (Elith et al. 2002; Loiselle et al. 2003). Second, researchers should be

mindful of the limitations of working with endangered species, which provide additional

statistical problems (Thompson 2003; Ellison and Agrawal 2005). Third, comparative analyses

of not only singular methods, but ensemble methods are needed to ensure that the most

appropriate models are being constructed. The development of appropriate validation data

(Araujo and New 2007), decision of modeling metrics (e.g. AUC, sensitivity, specificity), and

consideration of how many initial models are needed to build the ensemble, are all needed to

ensure that ensemble models are actually improvements to singular methods. Fourth, the goals

of the ensemble model need to be explicitly testable. Such testing should include the use of

models to corroborate hypothesis with either a priori prediction or existing knowledge and

ensuring that habitat is modeled using appropriate scale/s.

123

Conclusion

This study demonstrates that ensemble models can provide marked improvements to singular

approaches in situations where there is uncertainty exists, such as modeling suitable habitat of

endangered species. Using the endangered redside dace as a model organism, this study shows

that an ensemble approach can improve model sensitivity, specificity and overall classification.

These findings have important consequences for improving species distribution models of

endangered species. In addition, this study demonstrates the utility of sub-sampling procedures

for determining how many initial models are needed to build an optimal ensemble. As ensemble

models are dependent on their initial inputs, improvements to future ensemble models can be

expected with greater consideration of how initial models perform best. This study provides an

example of how comparative analyses across many scales, initial modeling types and ensemble

approaches can be used to improve the prediction of endangered species and their habitats.

Acknowledgements

Funding was provided by NSERC Canada and OGS Scholarships to M.S.P., an NSERC

Discovery Grant to D.A.J., Interdepartmental Recovery Fund #1410 provided by Fisheries and

Oceans (DFO), the Ontario Ministry of Natural Resources (OMNR), and the University of

Toronto. The following agencies and individuals were helpful for provided data for this research:

Royal Ontario Museum, Fisheries and Oceans, Ontario Ministry of Natural Resources, Toronto

Region Conservation Authority (TRCA), Conservation Halton (CH), Lower Lake Simcoe

Conservation Authority (LLSCA), Credit Valley Conservation Authority, D Forder (Ontario

Streams), J Anderson (LLSCA), L Stanfield (OMNR), E Holm (ROM), D Lawrie (TRCA), S

Jarvie (TRCA), S Watson-Leung (CH), and S Reid (DFO). In addition, M Neff and G Rawnsley

provided field assistance. This manuscript benefited from discussions with C Harpur and A

Drake. Finally anonymous reviewers were helpful for providing comments on early drafts of

this paper.

124

References

Allison, P. D. 1999. Logistic Regression Using the SAS System. Cary, N.C. SAS Institute.

Araujo, M. B., and M. New. 2007. Ensemble forecasting of species distributions. TRENDS in


Araújo, M. B., R. G. Pearson, W. Thuiller, and M. Erhard. 2005a. Validation of species–climate

impact models under climate change Global Change Biology 11:1504-1513.

Araújo, M. B., R. J. Whittaker, R. J. Ladle, and M. Erhard. 2005b. Reducing uncertainty in


14:529-538.

Araújo, M. B., and A. Guisán. 2006. Five (or so) challenges for species distribution modelling.


Austin, M. 2007. Species distribution models and ecological theory: a critical assessment and

some possible new approaches. Ecological Modelling 200:1-19.

Bishop, C. M. 1995 Neural Networks for Pattern Recognition. Oxford University Press, New

York, U.S.A.

Burgman, M., D. B. Lindenmayer, and J. Eltih. 2005. Managing landscapes for conservation

under uncertainty. Ecology 86:2007-2017.

Buisson, L., and G. Grenouillet. 2009. Contrasted impacts of climate change on stream fish

assemblages along an environmental gradient. Diversity and Distributions 15:613-626.

Burnham, K. P., and D. R. Anderson. 2002. Model selection and multimodel inference: a

practical information-theoretic approach, 2nd edition. Springer-Verlag, New York, New

York.

Cade, B. S., B. R. Noon, and C. H. Flather. 2005. Quantile regression reveals hidden bias and

uncertainty in habitat models. Ecology 86:786-800.

125

Colwell, R. K., C. X. Mao, and J. Chang. 2004. Interpolating, extrapolating, and comparing

incidence-based species accumulation curves. Ecology 85:2717-2727.

COSEWIC. 2007. COSEWIC Assessment and update status report on the redside dace

Clinostomus elongatus in Canada. Committee on the Status of Endangered Wildlife in

Canada, Ottawa, ON.

Culter, D. R., T. C. Edwards, K. H. Beard, A. Cutler, K. T. Hess, J. Gibson, and J. J. Lawler.

2007. Random forests for classification in ecology. Ecology 88:2783-2792.

Cunningham, R. B., and D. B. Lindenmayer. 2005. Modeling count data of rare species: Some

statistical issues. Ecology 86:1135-1142.

Daniels, R. A., and S. J. Wisniewski. 1994. Feeding ecology of redside dace, Clinostomus

elongatus. Ecology of Freshwater Fish 3:176-183.

Dixon, P. M., A. M. Ellison, and N. J. Gotelli. 2005. Improving the precision of estimates of the

frequency of rare events. Ecology 86:1114–1123.


Components of uncertainty in species distribution analysis: A case study of the great grey


Edwards, T. C., D. R. Cutler, N. E. Zimmermann, L. Geiser, and J. Alegriae. 2005. Model-based

stratifications for enhancing the detection of rare ecological events. Ecology 86:1081-

1090.

Elith, J., M. A. Burgman, and H. M. Regan. 2002. Mapping epistemic uncertainties and vague

concepts in predictions of species distributions. Ecological Modelling 157:313-329.







126

Elith, J., J. R. Leathwick, and T. Hastie. 2008. A working guide to boosted regression trees.

Journal of Animal Ecology 77:802-813.

Ellison, A. M., and A. A. Agrawal. 2005. The statistics of rarity. Ecology 86:1079-1080.

Goswami, M., and K. M. O’Connor. 2007. Real-time flow forecasting in the absence of

quantitative precipitation forecasts: a multi-model approach. Journal of Hydrology

334:125-140.

Graham, M. H. 2003. Confronting multicollinearity in ecological multiple regression. Ecology

84:2809-2815.

Gregory, A. W., G. W. Smith, and J. Yetman. 2001. Testing for forecast consensus. Journal of

Business and Economic Statistics 19:34-43.

Grossman, G. D., and M. C. Freeman. 1987. Microhabitat use in a stream fish assemblage.

Journal of Zoology 212:151-176.

Guisan, A., O. Broennimann, R. Engler, M. Vust, N. G. Yoccoz, A. Lehmann, and N. E.

Zimmermann. 2006. Using niche-based distribution models to improve the sampling of

rare species. Conservation Biology 20:501-511

Guisan, A., and W. Thuiller. 2005. Predicting species distribution: offering more than simple

habitat models. Ecology Letters 8:993-1009.

Guisan, A., and N. E. Zimmermann. 2000 Predictive habitat distribution models in ecology.


Guralnick, R. P., A. W. Hill, and M. Lane. 2007. Towards a collaborative, global infrastructure

for biodiversity assessment. Ecology Letters 10:663-672.

Hartley, S., R. Harris, and P. J. Lester. 2006. Quantifying uncertainty in the potential distribution

of an invasive species: climate and the Argentine ant. Ecology Letters 9:1068-1079.

127

Heikkinen, R. K., M. Luoto, M. B. Araújo, R. Virkkala, W. Thuiller, and M. T. Sykes. 2006.

Methods and uncertainties in bioclimatic envelope modelling under climate change.

Progress in Physical Geography 30:751-777.

Hornik, K., and H. White. 1989. Multilayer feedforward networks are universal approximators.

Neural Networks 2:359-366.

Hosmer, D. W., and S. Lemeshow. 1989. Applied Logistic Regression. John Wiley and Sons,

New York.

Jiménez-Valverde, A., and J. M. Lobo. 2007. Threshold criteria for conversion of probability of

species presence to either–or presence–absence. Acta Oecologica 31:361-369.

Johnson, J. B., and K. S. Omland. 2004. Model selection in ecology and evolution. TRENDS in



Legendre, P., and L. Legendre. 1998. Numerical Ecology, 2nd edition. Elsevier Science BV,

Amsterdam, Neitherlands (853 pp.).

Lobo, J. M., A. Jiménez-Valverde, and R. Real. 2008. AUC: a misleading measure of the

performance of predictive distribution models. Global Ecology and Biogeography

17:145-151.

Loiselle, B. A., C. A. Howell, C. H. Graham, J. M. Goerck, T. Brooks, K. G. Smith, and P. H.

Williams. 2003. Avoiding pitfalls of using species distribution models in conservation

planning. Conservation Biology 17:1591-1600.

Mace, G. M., H. P. Possingham, and N. Leader-Williams. 2005. Prioritizing choices in

conservation. Pages 17-34 In D. MacDonald and K. Services (ed.). Key Topics in

Conservation Biology. Blackwell Publishers, Oxford.

MacNally, R. 2000. Regression and model-building in conservation biology, biogeography and

ecology: The distinction between - and reconciliation of – ‘predictive’ and ‘explanatory’

models. Biodiversity and Conservation 9:655-671.

128

Maggini, R., A. Lehmann, N. E. Zimmermann, and A. Guisan. 2006. Improving generalized

regression analysis for the spatial prediction of forest communities. Journal of

Biogeography 33:1729-1749.


consensus methods in predictive species distribution modelling. Diversity and

Distributions 15: 59-69.

McKee, P. M., and B. J. Parker. 1982. The distribution biology and status of the fishes

Campostoma anomalum Clinostomus elongatus Notropis photogenis Cyprinidae and

Fundulus notatus Cyprinodontidae in Canada. Canadian Journal of Zoology 60:1347-

1358.

McCullagh, P., and J. A. Nelder. 1989. Generalized Linear Models. Chapman and Hall, London,

England.

Meynard, C. N., and J. F. Quinn. 2007. Predicting species distributions: a critical comparison of

the most common statistical models using artificial species. Journal of Biogeography

34:1455-1469.

Moisen, G. G., E. A. Freeman, J. A. Blackard, T. S. Frescino, N. E. Zimmermann, and T. C.

Edwards. 2006. Predicting tree species presence and basal area in Utah: A comparison of

stochastic gradient boosting, generalized additive models, and tree-based methods.

Ecological Modelling 199: 176-187.

Novinger, D. C., and T. G. Coon. 2000. Behavior and physiology of the redside dace,

Clinostomus elongatus, a threatened species in Michigan. Environmental Biology of

Fishes 57:315-326.

Olden, J. D., and D. A. Jackson. 2001. Fish habitat relationships in lakes: gaining predictive and

explanatory insight by using artificial neural networks. Transactions of the American

Fisheries Society 130:878-897.

Olden, J. D., and D. A. Jackson. 2002a. A comparison of statistical approaches for modelling

fish species distributions. Freshwater Biology 47:1976-1995.

129

Olden, J. D., and D. A. Jackson. 2002b. Illuminating the 'black box': Understanding variable

contributions in artificial neural networks. Ecological Modelling 154:135-150.

Olden, J. D., M. K. Joy, and R. G. Death. 2004. An accurate comparison of methods for

quantifying variable importance in artificial neural networks using simulated data.


OMNR. 2007. Stream assessment protocol for southern Ontario. Ontario Ministry of Natural

Resources, Picton, Ontario.

Pampel, F. C. 2000. Logistic regression: A primer. Sage Publications, London.

Pearson, R. G., W. Thuiller, M. B. Araújo, E. Martinez-Meyer, L. Brotons, C. McClean, L.

Miles, P. Segurado, T. P. Dawson, and D. C. Lees. 2006. Model-based uncertainty in

species’ range prediction. Journal of Biogeography 33:1704-1711.

Podani, J. 2000. Introduction to the exploration of multivariate biological data. Backhuys

Publishers, Leiden, The Netherlands.

R Development Core Team. 2008. A Language and Environment for Statistical Computing. R

Foundation for Statistical Computing, Vienna, Austria. (ISBN: 3-900051-07-0).

Ridgeway, G. 1999. The state of boosting. Computing Sciences and Statistics 31:172-181.

Rodríguez, J. P., L. Brotons, J. Bustamante, and J. Seoane. 2007. The application of predictive

modelling of species distribution to biodiversity conservation. Diversity and Distributions

13:243–251.

Rumelhart, D. E., G. E. Hinton, and R. J. Williams. 1986. Learning representations by

backpropagation errors. Nature 323:533-536.

Scott, W. B., and E. J. Crossman. 1973. Freshwater fishes of Canada. Bulletin 184 Fisheries

Research Board of Canada, Ottawa, ON.

130




Soil Landscapes of Canada Working Group, 2007. Soil Landscapes of Canada v3.1.1 Agriculture

and Agri-Food Canada. (digital map and database at 1:1 million scale).

Thomson, J. R., E. Fleishman, R. MacNally, and D. S. Dobkin. 2007. Comparison of predictor

sets for species richness and the number of rare species of butterflies and birds. Journal of

Biogeography 34:90-101.

Thuiller, W. 2003. BIOMOD: optimizing predictions of species distributions and projecting

potential future shifts under global change. Global Change Biology 9:1353-1362.


Global Change Biology 10:2020-2027.



102:8245-8250.

Thuiller, W. F., G. Midgley, M. Rougeti, and R. Cowling. 2006. Predicting patterns of plant

species richness in megadiverse South Africa. Ecography 29:733-744.

Vayssiéres, M. P., R. E. Plant, and B. H. Allen-Diaz. 2000. Classification Trees: An Alternative

Non-Parametric Approach for Predicting Species Distributions. Journal of Vegetation

Science 11:679-694.

Whittingham, M. J., P. A. Stephens, R. B. Bradbury, and R. P. Freckleton. 2006. Why do we still

use stepwise modelling in ecology and behaviour? Journal of Animal Ecology 75:1182-

1189.

Wilson, K. A., M. I. Westphal, H. P. Possingham, and J. Elith. 2005. Sensitivity of conservation

planning to different approaches to using predicted species distribution data. Biological


131

Appendices Appendix 5.1 – Model metrics for all combinations of consensus ensemble models. Models are: LR (logistic regression), CT (classification trees), MARS (multivariate adaptive regression splines), RF (random forest), ANN (artificial neural networks), BR (boosted regression trees, and DA (discriminant analysis).

Model/s Included Sensitivity Specificity Overall Classification

LG 0.8700 0.9000 0.8800 CT 0.7500 0.8600 0.8100 MARS 0.8100 0.8800 0.8500 RF 0.8300 0.8900 0.8600 ANN 0.7500 0.8900 0.8200 BR 0.8300 0.9000 0.8700 DA 0.8400 0.8400 0.8400 LG,CT,MARS 0.8822 0.8516 0.8669 LG,CT,ANN 0.8428 0.8650 0.8539 LG,CT,RF 0.8878 0.7922 0.8400 LG,CT,BR 0.8784 0.8254 0.8519 LG,CT,DA 0.9350 0.8198 0.8774 LG,MARS,ANN 0.8640 0.8796 0.8718 LG,MARS,RF 0.9318 0.8188 0.8749 LG,MARS,BR 0.9222 0.8250 0.8736 LG,MARS,DA 0.9088 0.8228 0.8658 LG,ANN,RF 0.8222 0.8646 0.8434 LG,ANN,BR 0.7900 0.8730 0.8315 LG,ANN,DA 0.8594 0.8396 0.8495 LG,RF,BR 0.9164 0.8656 0.8909 LG,RF,DA 0.9286 0.8102 0.8694 LG,BR,DA 0.9118 0.8610 0.8864 CT,MARS,ANN 0.9058 0.8532 0.8795 CT,MARS,RF 0.9150 0.8292 0.8721 CT,MARS,BR 0.8836 0.8408 0.8622 CT,MARS,DA 0.9296 0.8400 0.8848 CT,ANN,RF 0.9056 0.8626 0.8841 CT,ANN,BR 0.8688 0.8894 0.8791 CT,ANN,DA 0.8980 0.8804 0.8892 CT,RF,BR 0.8670 0.8280 0.8475 CT,RF,DA 0.9174 0.8106 0.8640 CT,BR,DA 0.8970 0.8372 0.8671 MARS,ANN,RF 0.7368 0.8718 0.8043

132

MARS,ANN,BR 0.7182 0.8446 0.7814 MARS,ANN,DA 0.7430 0.8284 0.7857 MARS,RF,BR 0.7808 0.8068 0.7938 MARS,RF,DA 0.7982 0.8018 0.8000 MARS,BR,DA 0.8052 0.8274 0.8163 ANN,RF,BR 0.6836 0.8766 0.7801 ANN,RF,DA 0.7234 0.8670 0.7952 ANN,BR,DA 0.7018 0.8750 0.7884 RF,BR,DA 0.7818 0.8424 0.8121 LG,CT,MARS,ANN,RF 0.8744 0.8826 0.8785 LG,CT,MARS,ANN,BR 0.8654 0.8892 0.8773 LG,CT,MARS,ANN,DA 0.8624 0.8558 0.8591 LG,CT,MARS,RF,BR 0.9064 0.8502 0.8783 LG,CT,MARS,RF,DA 0.9194 0.8344 0.8769 LG,CT,MARS,BR,DA 0.8908 0.8730 0.8819 LG,CT,ANN,RF,BR 0.8574 0.9074 0.8824 LG,CT,ANN,RF,DA 0.8656 0.8660 0.8658 LG,CT,ANN,BR,DA 0.8432 0.9006 0.8719 LG,CT,RF,BR,DA 0.8888 0.8256 0.8572 LG,MARS,ANN,RF,BR 0.8232 0.8244 0.8238 LG,MARS,ANN,RF,DA 0.8792 0.8366 0.8529 LG,MARS,ANN,BR,DA 0.8238 0.8686 0.8462 LG,MARS,RF,BR,DA 0.9072 0.8344 0.8708 LG,ANN,RF,BR,DA 0.8094 0.8720 0.8407 CT,MARS,ANN,RF,BR 0.8762 0.8834 0.8798 CT,MARS,ANN,RF,DA 0.9140 0.8562 0.8851 CT,MARS,ANN,BR,DA 0.8804 0.8944 0.8874 CT,MARS,RF,BR,DA 0.9078 0.8462 0.8770 CT,ANN,RF,BR,DA 0.8504 0.8802 0.8653 MARS,ANN,RF,BR,DA 0.7368 0.8424 0.7896

133

Appendix 5.2 – R Code for testing configurations of 1,3, 5 and 7 prediction consensus models data<-read.table ("F:\\R\\Predict.txt",h=T) # load prediction matrix iters<-10000 # define number of iterations model<-matrix(nrow=iters,ncol=8) for (i in 1:iters){ # start loop (data[1:200,1:8][sample(200,1,replace=TRUE),])->temp # from sample of 200, randomly pick 1 row (with replacement) as.matrix(temp)->temp2 # make temporary matrix temp2->model[i,] # fill this matrix with model outputs from n number of random sites } # end loop model[1:iters,1]->OBS # define where obs is (e.g. col 2) model[1:iters,2]->LG # name model in column 3 model[1:iters,3]->CT # name nmodel in column 4 model[1:iters,4]->MARS # name model in column 5 model[1:iters,5]->RF # name model in column 6 model[1:iters,6]->ANN # name model in column 7 model[1:iters,7]->BR # name model in column 8 model[1:iters,8]->DA # name model in column 9 # define combination # THREE MODEL COMBINATIONS combo<-cbind(LG,CT,GLM) #combo<-cbind(LG,CT,ANN) #combo<-cbind(LG,CT,RF) #combo<-cbind(LG,CT,BR) #combo<-cbind(LG,CT,DA) #combo<-cbind(LG,MARS,ANN) #combo<-cbind(LG,MARS,RF) #combo<-cbind(LG,MARS,BR) #combo<-cbind(LG,MARS,DA) #combo<-cbind(LG,ANN,RF) #combo<-cbind(LG,ANN,BR) #combo<-cbind(LG,ANN,DA) #combo<-cbind(LG,RF,BR) #combo<-cbind(LG,RF,DA) #combo<-cbind(LG,BR,DA)

134

#combo<-cbind(CT,MARS,ANN) #combo<-cbind(CT,MARS,RF) #combo<-cbind(CT,MARS,BR) #combo<-cbind(CT,MARS,DA) #combo<-cbind(CT,ANN,RF) #combo<-cbind(CT,ANN,BR) #combo<-cbind(CT,ANN,DA) #combo<-cbind(CT,RF,BR) #combo<-cbind(CT,RF,DA) #combo<-cbind(CT,BR,DA) #combo<-cbind(MARS,ANN,RF) #combo<-cbind(MARS,ANN,BR) #combo<-cbind(MARS,ANN,DA) #combo<-cbind(MARS,RF,BR) #combo<-cbind(MARS,RF,DA) #combo<-cbind(MARS,BR,DA) #combo<-cbind(ANN,RF,BR) #combo<-cbind(ANN,RF,DA) #combo<-cbind(ANN,BR,DA) #combo<-cbind(RF,BR,DA) # FIVE MODEL COMBINATIONS #combo<-cbind(LG,CT,MARS,ANN,RF) #combo<-cbind(LG,CT,MARS,ANN,BR) #combo<-cbind(LG,CT,MARS,ANN,DA) #combo<-cbind(LG,CT,MARS,RF,BR) #combo<-cbind(LG,CT,MARS,RF,DA) #combo<-cbind(LG,CT,MARS,BR,DA) #combo<-cbind(LG,CT,ANN,RF,BR) #combo<-cbind(LG,CT,ANN,RF,DA) #combo<-cbind(LG,CT,ANN,BR,DA) #combo<-cbind(LG,CT,RF,BR,DA) #combo<-cbind(LG,MARS,ANN,RF,BR) #combo<-cbind(LG,MARS,ANN,RF,DA) #combo<-cbind(LG,MARS,ANN,BR,DA) #combo<-cbind(LG,MARS,RF,BR,DA) #combo<-cbind(LG,ANN,RF,BR,DA) #combo<-cbind(CT,MARS,ANN,RF,BR) #combo<-cbind(CT,MARS,ANN,RF,DA) #combo<-cbind(CT,MARS,ANN,BR,DA) #combo<-cbind(CT,MARS,RF,BR,DA) #combo<-cbind(CT,ANN,RF,BR,DA) #combo<-cbind(MARS,ANN,RF,BR,DA) # SEVEN MODEL COMBINATION

135

#combo<-cbind(LG,CT,MARS,RF,ANN,BR,DA) combo1<-apply(combo,1,mean) # take mean of models being combined (e.g. combo#) as.matrix(combo1)->t # load mean predictions from combos as matrix round(t)->RoundAll # round predictions (consensus) matrix(nrow=iters,ncol=1)->OC # make matrix for overall classificaiton (OC) matrix(nrow=iters,ncol=1)->presence # make matrix for overall presence matrix(nrow=iters,ncol=1)->absence # make matrix for overall absence cbind(model[,1],RoundAll)->temp3 # combine obs with ensemble rowSums(temp3)->temp4 # sum rows (obs + ensemble), if = 2 then true presence, if = 1 then disagree, if = 0 then true absence as.matrix(temp4)->tempsum # place into matrix called tempsum for (k in 1:iters){ # start loop OC[k,]<-if (RoundAll[k,]==model[k,1]) 1 else 0 # compare combo (ie. ensemble) to obs for each observation in matrix (iters by predictions), if they agree then 1, otherwise 0 presence[k,]<- if (tempsum[k,]==2) 1 else 0 # make vector of true presences absence[k,]<-if (tempsum[k,]==0) 1 else 0 # make vector of true absences } (sum(OC/iters))->TrueOC # rate of overall classification sd(combo1)->SDModels # SD of ensemble #error <- qt(0.95,df=length(combo1$vals)-1)*sd(combo1$vals)/sqrt(length(combo1$vals)) (sum(presence/iters*2))->Truepr # sensitivity rate (sum(absence/iters*2))->Trueab # specificity rate Truepr Trueab TrueOC

136

Chapter 6: Using consensus methods to identify (and reduce) sensitivity from methodological choices when measuring functional diversity

Abstract Functional diversity indices have become important tools for measuring variation in species

characteristics that are relevant for ecosystem services. Recently, a popular method for

measuring functional diversity, FD, was shown to be sensitive to methodological choices in its

calculation. The objective of this study was to determine whether consensus methods can be used

to identify situations where methodological choices may be an issue when measuring

dendrogram-based functional diversity, FD. To calculate FD, a distance measure and a

clustering method must be chosen. Using data from natural communities, this study

demonstrates that consensus methods were able to determine instances where the choice of

distance measure (Euclidean and cosine) and clustering method (UPGMA, complete and single

linkage) produced qualitatively different relationships across communities and markedly

different dendrogram topologies. In particular this study highlights how consensus methods

may aid in the choice of a particular index of functional diversity with the hope that such

discussions may improve biodiversity-ecosystem studies.

Keywords: functional diversity, clustering, distance measures, community composition,

biodiversity, ecosystem productivity, ecological organization; index.

137

Introduction The association between ecosystem properties and levels of species diversity is well studied

(Tilman 1997, Symstad 1998, Loreau et al. 2001). This association was thought to be driven by

the tendency for species-rich communities to have wider variation in functional traits (Diaz and

Cabido 2001, Heemsbergen et al. 2004, Hooper et al. 2005). The importance of functional traits

has led to the development of indices of functional diversity aimed at quantifying functional trait

variation; the evaluation of functional diversity indices continues to be an area of active research

(Walker 1999, Symstad 2000, Petchey and Gaston 2002, Mason et al. 2003, 2005, Naeem and

Wright 2003, Botta-Dukat 2005, Mouillot et al. 2005). Several studies (Cardinale et al. 2000,

Petchey and Gaston 2002) have concluded that ecosystem function tend to correlate more

strongly with functional diversity indices than with species diversity indices. Results such as

these have spurred interest in developing new and improved functional diversity indices that

incorporate ecosystem functions (Wright et al. 2006).

To calculate most functional diversity indices, a method is required for quantifying interspecific

differences in functional traits. In cases where there is only one trait of interest, simple

approaches may be appropriate, such as the weighted-trait variation (FDVar; Mason et al. 2003)

or the functional evenness (known as functional regularity; Mouilet et al. 2005). However, the

flexibility to use more than one trait often is required to understand even simple natural systems

and in such cases, the inclusion of trait matrices, distance measures, and sometimes

dendrograms, is required. Unfortunately, the use of these multivariate statistical procedures

introduces complications that require researchers to make several key decisions for data analysis.

Ultimately, these decisions should have minimal effect on patterns of species characteristics as

they relate to ecosystem function. However, recent studies (e.g. Poos et al. 2009) have shown

that these methodological decisions may be more important than thought here to fore. Not

surprisingly there are countless ways to incorporate multivariate functional variation into

measures of diversity.

A popular index, known as FD (Petchey and Gaston 2002), measures functional diversity as the

total branch length of a dendrogram based on functional traits. To produce a dendrogram several

decisions need to be made. First, the number and type of traits important to ecosystem function

need to be identified. Second, a distance measure needs to be chosen that characterizes the

138

relative differences among species based on their traits. Finally, a clustering algorithm is needed

to produce a dendrogram that hierarchically segregates species into functional groups based on

their relative distances (Petchey and Gaston 2002). This method has been criticized recently for

having the additional subjective step of clustering traits onto a dendrogram (Podani and Schmera

2006; Poos et al. 2009). Although standard methods may provide one way to reduce

subjectivity, it is unlikely that a single distance measure or clustering algorithm can be used in all

circumstances (Poos et al. 2009). Therefore, methods should be sought which can be used to

identify sensitivity.

Despite the common use of FD as a measure, claims related to whether FD can be used to derive

ecologically robust conclusions have only been quantitatively evaluated recently (Podani and

Schmera 2006; Poos et al. 2009). For example, Poos et al. (2009) showed that the probability of

two random assemblages showing contrasting levels of functional diversity ranged from 0 to as

high as 97.6%. Recently, consensus methods have been suggested as a means for providing a

standardized approach for dealing with variation in methodological issues (Mouchet et al. 2008;

Poos and Jackson, submitted). However, there are difficulties when employing consensus

methods to methodological choices in functional diversity, such as how many initial methods are

needed to build the optimal consensus (Poos et al. 2009), and to what degree adding poorly fitted

models may undermine a consensus approach (Poos and Jackson, submitted). Therefore the

objective of this study is to determine whether consensus methods can be used to understand

when sensitivity in measuring FD may be an issue. For this purpose, sensitivity is defined in two

ways. First, a sensitive index is defined as one where the same qualitative trends in functional

diversity across communities do not persist despite methodological choices (e.g. distance

measure and clustering algorithm). This definition is meant to compare the broad ecological

consequences of applying FD to ecological communities, with the implicit assumption that a

robust index should provide qualitatively repeatable trends. Second, a sensitive index is defined

as one where the identified dendrogram topologies persist. This definition requires that the

produced patterns of species groupings are maintained. If FD is not robust in this sense, it would

suggest that the implication of using FD may be unclear. Explicit recognition of the effects of

using a dendrogram, and the decisions needed to get there (e.g. choosing a distance measure and

clustering algorithm) need to be better understood, so appropriate guidelines for making these

decisions can be formulated.

139

Methods

In this study, the same five data sets used in previous studies of FD (Petchey and Gaston 2002,

Podani and Schmera 2006; Poos et al. 2009). These datasets represent a variation in the number

and type of species (from 13 to 37), and the number and type of functional traits (from 6 to 27).

For example, the three vertebrate datasets use characteristics ranging from foraging behavior to

the consumption of prey species as their functional traits (Holmes et al. 1979, Jaksic and Medel

1990, Munoz and Ojeda 1997), whereas, the remaining two datasets rely on vegetative

characteristics, such as rooting depth and herbivore palatability, of the plants being studied

(Golluscio and Sala 1993, Chapin et al. 1996).

The measure functional diversity, FD, is based on the total branch length of a dendrogram of

functional traits. To obtain this dendrogram, species traits must be assigned a distance (or

resemblance) measure and clustering algorithm. Distance measures quantify the association

between two entities based on their characteristics (e.g. species based on their functional traits).

There are a large number of distance measures from which to choose depending on the data

(Jackson et al. 1989; Legendre and Legendre 1998; Podani 1999). Two distance measures were

used: Euclidean distance as suggested by Holmes et al. (1979), and cosine distance. Cosine

distance was used because it down-weights the potential over-fit created by covarying traits

(Legendre and Legendre 1998), a problem often encountered when analyzing functional traits of

species (Petchey and Gaston 2006), whereas, Euclidean distance emphasizes larger values, in

particular where positive covariance exists between traits. All trait matrices were standardized so

that all traits have a mean = 0 and variance =1 (i.e. z-scores; Holmes et al. 1979, Gaston and

Petchey 2002).

Variability in ecological data is often associated with just a few entities of which clustering into

key groupings can provide insight, such as the clustering of species based on functional traits

(Legendre and Legendre 1998; Podani 1999). Three clustering algorithms were used in this

analysis, unpaired pair group method with arithmetic mean (UPGMA), single linkage (i.e.

nearest neighbor) and complete linkage (i.e. maximum or farthest neighbor). These algorithms

represent natural endpoints across a methodological continuum of hierarchical clustering

algorithms, where single linkage lies on one end, complete linkage on the other, and UPGMA

lies somewhere in the middle (Gordon 1999, Podani and Schmera 2006; Poos et al. 2009).

140

Using Consensus Methods to Identify Uncertainty when Measuring FD

To determine whether distance measure or clustering algorithm influenced dendrogram

topologies of FD, a routine was developed (in MatLAB version 7.1) to randomize the removal of

n species from the dataset and recalculate FD for each species combination, clustering algorithm

and distance measure. Each level of n species was replicated 1000 times. Functional diversity

(FD) was measured at each species richness interval as the total distance of branches in the

dendrogram. As FD measures the total branch lengths of a functional dendrogram, which relies

on clustering method and distance measure, all dendrograms were rescaled to value between 0-1

using the full species model. The range in FD at the full species level at each different clustering

method and distance measure was summarized.

The initial dendrogram was compared to each variation in distance measure using consensus

trees (Margus and McMorris 1981, Rohlf 1982) in NT-SYS (Rohlf 1997). Dendrograms were

compared using the consensus index CI(C) (Rohlf 1982, 1997). Unlike cophenetic correlation

(e.g. Mouchet et al. 2008), which compares a dendrogram to the un-modeled raw data, the

consensus index compares the similarity of dendrograms based on their cluster membership

(Sokal and Rohlf 1962, Shao and Soskal 1986, Legendre and Legendre 1998). The 50%

majority rules consensus index was used where a value of one indicates all subgroups share at

least 50% membership (i.e. the consensus tree is completely bifurcated indicating similar

topology between the original trees) and a value of zero indicates no subgroups are shared

(Jackson et al. 1989, Lapointe and Legendre 1990). Although a more strict measure of

consensus can be used, the use of a 50% majority rules leads to a more liberal assessment of the

similarity between trees than a strict measure would provide.

141

Results

The Relationship between FD, Distance Measure & Clustering Algorithm

The relationship between FD and community type changed with distance measure and clustering

algorithm. In particular, a change in distance measure caused communities to be ranked

differently with respect to FD (Figure 6.1). Differences in levels of FD between communities

changed with either a change in distance measure or a change in clustering method. For

example, FD for predatory vertebrate communities was more similar to Patagonian forb

communities using UPGMA and Euclidean distance, but more similar to intertidal fish

communities when the clustering algorithm was changed to complete linkage (Figure 6.1).

Similarly, the Patagonian forb communities showed similar levels of FD with three different

communities, depending on the distance measure and clustering method (Figure 6.1). Rescaling

FD did not change these conclusions.

Altering the clustering algorithm had a greater impact on the measured variation of FD than

altering of distance measure. On average, the overall choice of distance measure and clustering

algorithm accounted for a range of 27.4% in the measured amount of functional diversity. At

maximum species richness measured functional diversity ranged between a minimum of 21%

and a maximum of 61% (mean 34.2%) across clustering methods. Similarly, at maximum

species richness the measured amount of functional diversity ranged between a minimum of 12%

and a maximum of 41% (mean 20.5%) based on distance measure (Figure 6.2). Qualitatively,

single linkage showed the smallest amount of functional diversity, whereas, complete linkage

showed the largest.

142

Figure 6.1 - The relationship between distance measure (Euclidean or cosine) and clustering algorithm (SL = single linkage / nearest neighbor, UPGMA = unweighted pair group method with arithmetic mean, CL = complete linkage / farthest neighbor) with FD using five community data sets: A) Arctic vegetation (Chapin et al. 1996); B) Insectivorous birds (Holmes et al. 1979); C) Patagonian forbs (Golluscio and Sala 1993); D) Intertidal fish (Munoz and Ojeda 1997); and, E) Predatory vertebrates (Jaksic and Medel 1990). FD values were re-scaled relative to the Arctic vegetation data, which has the highest FD values. This standardization leads to the appearance of a constant outcome for the Arctic dataset, but this consistency is solely an artifact of using it as the reference point rather than the outcomes not differing depending on the resemblance measure or clustering algorithm.

There was large variation in the measured amount of functional diversity as it relates to the

removal of species (Figure 6.2). Not surprisingly, FD was related to species richness, with

communities with less species showing smaller ranges of FD. In general, there was a greater

similarity between the fitted functional curves for FD and clustering algorithms than the

functional curves for FD and similar distance measures. There were two exceptions to this

generality. First, the predatory vertebrate dataset showed functional relationships more closely

related to similar distance measures (Figure 6.2D), whereas, the curve fitted with UPGMA

clustering algorithm and Euclidean distance of the intertidal fishes dataset was more closely

related to the curve based on cosine distance and single linkage, than a similar clustering

algorithm or distance measure. Overall, conclusions regarding the qualitative relationship of

functional diversity among communities were not robust to methodological choices (i.e. FD was

not consistent across communities).

143

A) B)

C) D)

E) Figure 6.2 - The relationship between distance measure (solid lines = Euclidean distance, dashed lines = cosine distance) and building a dendrogram using a clustering algorithm (1 = complete linkage / farthest neighbor, 2 = unweighted pair group method with arithmetic mean / UPGMA, 3 = single linkage / nearest neighbor) where species are individually removed when calculating FD. Five community data sets are shown: A) Arctic vegetation (Chapin et al. 1996), B) Insectivorous birds (Holmes et al. 1979), C) Patagonian forbs (Golluscio and Sala 1993), D) Intertidal fishes (Munoz and Ojeda 1997), and E) Predatory vertebrates (Jaksic and Medel 1990). Shown inset are 50% majority rule consensus trees demonstrating lack of between group fidelity of species where calculating functional diversity using different distance measures, but the same clustering approach.

144

Identifying Sensitivity in FD Using Consensus Methods

The overall low value of the CI(C) index indicates that the decision of choosing a distance

measure influences the overall dendrogram to such an extent that there was little resemblance

between the dendrogram based on Euclidean distance and the dendrogram based on cosine

distance (Table 6.1; Figure 6.2 inset). A comparison of consensus values indicates that FD

identified different dendrogram groups (only 46-51% similarity) depending on the distance

measure or clustering algorithm (unpublished results) used (Table 6.1).

Table 6.1 – Consensus measures of dendrogram group fidelity across distance measures (Euclidean and cosine) for each clustering algorithm: single linkage, unweighted pair group method with arithmetic means (UPGMA), and complete linkage. Group fidelity was determined by majority rules consensus tress using CI(C) consensus index.

Data set No. species

Complete Linkage UPGMA Single

Linkage Insectivorous birds 22 0.55 0.35 0.35 Intertidal fish 13 0.55 0.45 0.55 Patagonian forbs 24 0.45 0.50 0.41 Predatory vertebrates 11 0.22 0.55 0.33 Arctic vegetation 37 0.69 0.71 0.66 Average --- 0.49 0.51 0.46

The clustering algorithm did not improve the similarity between functional topologies. For

example, single linkage, unweighted pair group methods using arithmetic mean, and complete

linkage all showed similar rates of consensus tree resemblance, regardless of the size of the tree

or the dataset used (Table 6.1). Overall, dendrogram topologies were not robust to the choice of

distance measure or clustering algorithm.

Discussion Functional diversity has become an important, but controversial focus of research at the

boundary between community and ecosystem ecology (Tilman 2000, Mason et al. 2003, Leps et

al. 2006; Poos et al. 2009). This study focused on a popular measure of functional diversity: the

total branch length of a functional dendrogram, known as FD (Petchey & Gaston 2002). The

results demonstrate that consensus methods were able to identify instances where FD was not

robust to the choice of distance measure or clustering algorithm. Specifically, consensus

methods indicated that both definitions of robustness were not supported. First, the qualitative

145

relationships did not persist across communities (Figure 6.1). Second, the dendrogram

topologies of communities measured using FD differed with the decision of clustering method

and distance measure (Figure 6.2). These results were all in agreement with previous

quantitative analysis of FD (Poos et al. 2009).

The ability of consensus methods to determine qualitative differences of measuring functional

diversity across communities have ecological repercussions to many biodiversity-ecosystem

functioning studies. It is commonly assumed that diversity is a relative concept; that is, the

diversity of a community can only be judged vis-à-vis another community (Magurran 2004). In

this case, the conclusions reached by comparing communities based on functional diversity

would depend on the distance measure used to create the dendrogram and, therefore,

relationships among communities would be altered (Figure 6.1). For example, the FD for the

Patagonian forbs community was identical to predatory vertebrate community using single

linkage and Euclidean distance. However, if the distance measure was altered to cosine distance,

the Patagonian forbs community more closely resembled the insectivorous bird communities,

whereas, a change in clustering algorithm would show the Patagonian forbs community more

closely resembled the intertidal fish community. In this study, a difference in clustering

algorithm or distance measure altered community relationships altogether, and these

relationships persisted regardless of the scaling used. These differences have a high likelihood

of altering ecological interpretations of functional diversity across communities that, in turn, will

potentially confound ecosystem-based studies.

Different methods of calculating FD can lead to different dendrograms, and consequently

different measures of functional diversity. For example, data analyzed using different distance

measures and clustering algorithms varied by a range of 27% in the measured amount of

functional diversity at maximum species richness (Figure 6.2). Therefore, studies which use

dendrogram-based methods of measuring functional diversity may drastically under or over

estimate the amount of functional diversity, assuming some “true” value can be determined.

Therefore one may assume two communities are more similar using one method when they may

not be when based on another measure. Finally, decisions are required regarding how the data

are treated - for example, depending on how tied values in a similarity matrix are treated, a

number of different dendrograms can be produced (Jackson et al. 1989). As the clustering

technique will produce dendrograms (with the underlying goal of determining group structure)

146

whether or not true groups exist (Jackson et al. 1989), the cumulative effect of these decisions to

the relevance of the identified groups may be unknowingly large. For example, this study

suggests that the decision of adding distance measure and clustering algorithm to produce a

dendrogram will alter the range in the measure of FD (e.g. 34.2% for clustering algorithm and

20.5% for distance measure) and the ecological conclusions reached (Fig. 6.1).

Identifying areas where sensitivity may be an issue can aid in developing more robust or

representative indices of functional diversity. Recently, studies have attempted to use consensus

methods, which take the average of several different methodological approaches, to reconcile

differences in FD from methodological issues (Mouchet et al. 2008). However, previous

applications of consensus methods may not be sufficient in determining where sensitivity may be

an issue. In this paper, only 46-51% of the topologies were in agreement (Table 6.1) and there

was large variation in the measurement of FD across methodological choices and species (Figure

6.2). Although consensus methods were useful in identifying issues of sensitivity, using

consensus methods as a standardized approach may not be appropriate because averaging three

poorly associated models together may actually produce a worse outcome (Poos and Jackson

submitted). In addition, previous applications of consensus methods provide several more

decisions – such as how many initial models are needed to build the optimal consensus, and

which consensus method is most appropriate (Poos et al. submitted) – which may further impact

FD (Poos et al. 2009). There is considerable debate regarding the most appropriate measure of

functional diversity and the qualities that metric should possess (Loreau et al. 2001, Mason et al.

2003, Ricotta 2005, Leps et al. 2006). Clearly, regardless of the index used, any index of

functional diversity must be robust (e.g. qualitatively similar across methods and dendrograms)

to decisions inherent in its calculation or one must decide upon a common statistical

methodology in order to permit comparisons amongst studies. Quantitative comparisons of how

functional diversity indices differ are rare (e.g. Petchey et al. 2004; Walker et al. 2008), and

evaluations of other functional diversity indices are needed. In calculating FD, the decisions

inherent in its calculation represent two additional difficulties aside from previous criticisms of

which species, what kind of diversity, and which ecosystem function (Bengtsson 1998, Symstad

et al. 1998, Cardinale 2000, Jax 2005) are to be included and therefore should also define the

choice of similarity measure and clustering algorithm. Further criticisms, such as how many

functional traits (Walker et al. 1999, Podani and Schmera 2006), what qualifies as a functional

147

group (Petchey and Gaston 2006), and what type of consensus approach, also apply. Explicit

recognition and justification of each of these decisions is warranted for improving functional

diversity research.

Acknowledgements Funding NSERC & OGS Scholarships to M.S.P and S.C.W., NSERC Discovery Grant to D.A.J.,

Ontario Ministry of Natural Resources, and University of Toronto.

References Bengtsson, J. 1998. Which species? What kind of diversity? Which ecosystem function? Some

problems in studies of relations between biodiversity and ecosystem function. Applied Soil

Ecology 10: 191-199.

Botta-Dukat, Z. 2005. Rao's quadratic entropy as a measure of functional diversity based on

multiple traits. Journal of Vegetation Science 16: 533-540.

Cardinale, B. J., K. Nelson, and M.A. Palmer. 2000. Linking species diversity to the functioning

of ecosystems: On the importance of environmental context. Oikos 91: 175-183.

Chapin, F. S. I., M.S. Bret-Harte, S.E. Hobbie, and Z. Hailan. 1996. Plant functional types as

predictors of transient responses of arctic vegetation to global change. Journal of Vegatation

Science 7: 347-358.

Diaz, S., and M. Cabido. 2001. Vive la difference: plant functional diversity matters to

ecosystem processes. Trends in Ecology & Evolution 16: 646-655.

Golluscio, R. A., and O.E. Sala. 1993. Plant functional types and ecological strategies in

Patagonian forbs. Journal of Vegatation Science 4: 839-846.

Gordon, A. D. 1999. Classification, 2nd ed. Chapman and Hall, London, United Kingdom.

Heemsbergen, D. A., M.P. Berg, M. Loreau, J.R. van Hal, J.H. Faber, and H.A. Verhoef. 2004.

Biodiversity effects on soil processes explained by interspecific functional dissimilarity.

Science 306: 1019.

148

Holmes, R. T., R.E.J. Bonney, and S.W. Pacala. 1979. Guild structure of the Hubbard Brook bird

community: a multivariate approach. Ecology 60: 512-520.

Hooper, D. U., F.S. Chapin, J.J. Ewel, A. Hector, P. Inchausti, S. Lavorel, J.H. Lawton, D.M.

Lodge, M. Loreau, S. Naeem, B. Schmid, H. Setala, A.J. Symstad, J. Vandermeer, and D.A.

Wardle. 2005. Effects of biodiversity on ecosystem functioning: A consensus of current

knowledge. Ecological Monographs 75: 3-35.

Jackson, D. A., K.M. Somers, and H.H. Harvey. 1989. Similarity coefficients: measures of co-


453.

Jaksic, F. M., and R.G. Medel. 1990. Objective recognition of guilds: testing for statistically

significant species clusters. Oecologia 82: 87-92.

Jax, K. 2005. Function and "functioning" in ecology: What does it mean? Oikos 111: 641-648.

Lapointe, F.J., and P. Legendre. 1990. A statistical framework to test the consensus of two nested

classifications. Systematic Zoology 39: 1-13.

Legendre, P., and Legendre, L. 1998. Numerical ecology, 2nd ed. Elsevier B.V.

Leps, J., F., de Bello, S. Lavorel, and S. Berman. 2006. Quantifying and interpreting functional

diversity of natural communities: practical considerations matter. Preslia 78: 481-501.

Loreau, M., S. Naeem, P. Inchausti, J. Bengtsson, J.P. Grime, A. Hector, D.U. Hooper, M.A.

Huston, D. Raffaelli, B. Schmid, D. Tilman, and D.A. Wardle. 2001. Biodiversity and

ecosystem functioning: Current knowledge and future challenges. Science 294: 804-808.

Margurran, A. E. 2004. Measuring Biological Diversity. Blackwell Publishing, Oxford, United

Kingdom.

Margus, T., and F.R. McMorris. 1981. Consensus n-trees. Bulletin of Mathematical Biology 43:

239-244.

149

Mason, N. W. H., K. MacGillivray, J.B. Steel, and J.B. Wilson. 2003. An index of functional

diversity. Journal of Vegetation Science 14: 571-578.

Mason, N. W. H., D. Mouillot, W.G. Lee, and J.B. Wilson. 2005. Functional richness, functional

evenness and functional divergence: the primary components of functional diversity. Oikos

111: 112-118.

Mouillot, D., W.H.N. Mason, O. Dumay, and J.B. Wilson. 2005. Functional regularity: a

neglected aspect of functional diversity. Oecologia 142: 353-359.

Munoz, A. A., and F.P. Ojeda. 1997. Feeding guild structure of a rocky intertidal fish

assemblage in central Chile. Environmental Biology of Fishes 49: 471-479.

Naeem, S., and J.P. Wright. 2003. Disentangling biodiversity effects on ecosystem functioning:

deriving solutions to a seemingly insurmountable problem. Ecology Letters 6: 567-579.

Petchey, O. L., and K.J. Gaston. 2002. Functional diversity (FD), species richness and


Petchey, O. L., and K.J. Gaston. 2006. Functional diversity: back to basics and looking forward.


Petchey, O.L., A. Hector and K.J. Gaston. 2004. How do different measures of functional

diversity perform. Ecology 85: 847-857.

Podani, J. 1999. Extending Gower's general coefficient of similarity to ordinal characters. Taxon

48: 331-340.

Podani, J., and D. Schmera. 2006. On dendrogram-based measures of functional diversity. Oikos

115: 179-185.

Poos, M. S., and D. A. Jackson. Conservation by consensus: reducing uncertainties in modeling

the distribution of an endangered species using habitat-based ensemble models. Ecological

Applications (submitted).

150

Poos, M. S., S. W. Walker, and D. A. Jackson. 2009. Functional-diversity indices can be driven

by methodological choices and species richness. Ecology 90:341-347.

Ricotta, C. 2005. A note on functional diversity measures. Basic and Applied Ecology 6: 479-

486.

Rohlf, F. J. 1982. Consensus indices for comparing classifications. - Mathematical Biosciences

59: 131-144.

Rohlf, F. J. 1997. NTSYSpc Numerical Taxonomy and Multivariate Analysis System. Vol. 2.0.

Exeter Software, Setauket, New York.

Shao, K. and R.R. Sokal. 1986. Significance tests of consensus indices. Systematic Zoology 35:

582-590.

Simpson, E. H. 1949. Measurement of diversity. Nature 163: 688.

Sokal, R. R., and F.J. Rohlf. 1962. The comparisons of dendrograms by objective methods.

Taxon 11:33-40.

Symstad, A. J., D. Tilman, J. Willson, and J.M.H. Knops. 1998. Species loss and ecosystem

functioning: Effects of species identity and community composition. Oikos 81: 389-397.

Symstad, A. J. 2000. A test of the effects of functional group richness and composition on

grassland invasibility. Ecology 81: 99-109.

Tilman, D., J. Knops, D. Wedin, P. Reich, M. Ritchie, and E. Siemann. 1997. The Influence of

functional diversity and composition on ecosystem processes. Science 277: 1300-1303.

Tilman, D. 2000. Causes, consequences and ethics of biodiversity. Nature 405: 208-211.

Walker, B.H., A. Kinzig, and J. Langridge. 1999. Plant attribute diversity, resilience, and

ecosystem function: the nature and significance of dominant and minor species. Ecosystems

2: 95-113.

Walker, S.C, M.S. Poos, and D.A. Jackson. 2008. Functional rarefaction: estimating functional

diversity from field data. Oikos 117: 286-296.

151

Wright, J. P., S. Naeem, A. Hector, C. Lehman, P.B. Reich, B. Schmid, and D. Tilman. 2006.

Conventional functional classification schemes underestimate the relationship with

ecosystem functioning. Ecology Letters 9: 111-120.

152

Appendices

Appendix 6.1 – MatLAB Code for calculating total branch lengths of dendrograms from various species combinations

function branches = branch_lengths(Z) linkages = size(Z,1); species = linkages+1; branches = zeros(linkages, 2); for i = 1:linkages if (Z(i,1) < (1+species)) & (Z(i,2) < (1+species)) branches(i,1) = Z(i,3); branches(i,2) = Z(i,3); elseif (Z(i,1) > species) & (Z(i,2) < (1+species)) branches(i,1) = Z(i,3) - Z((Z(i,1)-species),3); branches(i,2) = Z(i,3); elseif (Z(i,1) < (1+species)) & (Z(i,2) > species) branches(i,1) = Z(i,3); branches(i,2) = Z(i,3) - Z((Z(i,2)-species),3); else branches(i,1) = Z(i,3) - Z((Z(i,1)-species),3); branches(i,2) = Z(i,3) - Z((Z(i,2)-species),3); end end

153

Chapter 7: General Conclusions

This thesis provides novel advancement in determining the influence of methodological choices

in conservation-based models and on how consensus methods may reduce uncertainty in such

models. In particular, this thesis demonstrates that regardless of the scale (local as in Chapter 5

or regional in Chapter 6), of the species, or study system in question, methodological choices

have the ability to dramatically impact resultant analyses and ecological inquiry. For example, in

Chapter 2, methodological choices had the ability to provide levels of sensitivity over 97% in the

measure of functional diversity. Similarly, in Chapter 4, methodological choices changed

estimates of population viability by several orders of magnitude. In Chapter 3, the addition or

removal of rare species did not impact multivariate analyses as much as choice of distance

measure or multivariate method, which had the ability to drastically alter bioassessments. As

these findings are novel, this chapter will highlight some conclusions regarding the importance

of methodological choices and attempt to provide recommendations for minimizing

methodological impacts.

A) Conservation-based Models in General

The importance of understanding the impact of methodological choices is not only timely, but

essential. With complex statistical software readily available, ecologists now have several dozen

approaches to choose from when developing conservation-based models. Perhaps not

surprisingly, this thesis demonstrates, as have previous authors, the importance of model type in

impacting results (e.g. Jackson et al. 1989; Jackson 1993; Guissan and Zimmerman 2000;

Thuiller 2005; Elith et al. 2006; Dormann et al. 2008; Marrimon et al. 2009). For example, in

Chapter 3, multivariate technique impacted analyses more than choice in distance measure or the

removal of rare species. This is in agreement with others who have demonstrated that rare

species may provide useful information (e.g. Cao et al 1998; 2001) and that selection of

multivariate method may influence interpretation strongly (e.g. Jackson et al. 1989; Jackson

1993). However, this thesis also demonstrates that despite laborious efforts in developing

modeling comparisons (Olden and Jackson 2002; Thuiller et al. 2005; Elith et al. 2006; Sharma

and Jackson 2008; Marrimon et al. 2009; this thesis), there has been no evidence of the pre-

eminence of any singular methodological approach. Issues related to methodological impacts

154

will vary depending on the dataset and objectives in question, such as which distance measure,

clustering algorithm, ordination (or other). Therefore, methods that can be used to reduce the

influence of methodological choices, or highlight when methodological choices may be an issue

- such as the use of consensus models - will help recognize and reduce the impact of

methodological choices.

B) Functional Diversity

Incorporating functional traits can be another way of improving modeling approaches for species

with conservation concern. Species’ functional characteristics strongly influence ecosystem

properties (Loreau et al. 2001; Hooper et al. 2005), and the understanding of the relationships

between functional diversity and community structure has been important for identifying

mechanisms of biodiversity effects. To date, the focus on functional traits has been on dominant

species (Grime 1998); although rare (presumably including imperiled) species can have a large

influence on ecosystem processes (Power et al. 1996).

Methodological choices in the measure of functional diversity remain a controversial topic

(Mason et al. 2003, Ricotta 2005, Podani and Schmera 2006). For example, this thesis (Chapters

2 and 6) demonstrates that the measure of functional diversity is greatly complicated by

methodological choices. Therefore, discussion is warranted on the future use of FD as a metric

of functional diversity and some of the qualities this metric poses. Ultimately, the choice of what

to do regarding the impact of methodological choices on measuring functional diversity may rely

on the advantages and disadvantages of the features of each approach. Many of these features

have been discussed previously (Mason et al. 2003, Hooper et al. 2005, Mouillot et al. 2005,

Ricotta 2005, Leps et al. 2006, Petchey and Gaston 2006; Schmera et al. 2009a; 2009b) and here,

properties of metrics of functional diversity are discussed plus are new insights into areas worthy

of future research.

One issue with metrics of functional diversity is whether or not they increase monotonically with

species richness (known as ‘set monotonicity’; Ricotta 2005; Schmera et al. 2009a). For

example, unlike Rao’s quadratic entropy (Rao 1982, Botta-Dukat 2005), FD has the intuitive

property of a monotonic relationship with species richness (Petchey and Gaston 2006). FD

cannot decrease when a species is added to a community, and when a species is removed FD

155

cannot increase (Petchey and Gaston 2002). However, in some cases FD violates this feature

(Podani and Schmera 2006) and Walker et al. (1999, 2008) have noted that functional attribute

diversity (FAD) previously has been misidentified as lacking this feature. Recent advancements

in measuring FAD (Schmera et al. 2009a) have helped to clarify the issue of set monotonicity in

FAD; however, future research into whether set monotonicity is a requirement for robust

estimation of functional diversity remains to be determined. For example, Rao’s quadratic

entropy is gaining currency as a flexible index of functional diversity as it is an intuitive

extension of Simpson’s index of diversity (Simpson 1949, Botta-Dukat 2005, Leps et al. 2006),

yet does not have set monotonicity (Pavoine and Bonsall 2009).

Given that a change in distance measure caused a related change in the qualitative relationships

of functional diversity across communities (in Chapter 6), it bears asking to what degree a

distance measure is required. Although the choice of distance measure involves subjectivity that

may influence the analysis, the inclusion of a distance measure allows for a continuous

segregation of multiple species based on multiple functional traits (Legendre and Legendre

1998). Therefore, the question of whether or not to include a distance measure depends to what

extent ecological diversity is based on the trait dissimilarity among species in a community

(Tilman 1997, Petchey and Gaston 2002) and to what degree a distance measure can distinguish

those traits and/or species. Few studies relate trait dissimilarity among species in a community

(Petchey and Gaston 2002, Garnier et al. 2004, Heemsbergen et al. 2004, Leps et al. 2006), and

unfortunately, even fewer studies determine the degree from which distance measure can

distinguish those trends. One mechanism to lower the subjectivity of the calculation in the

choice of distance measure is to provide standards based on the purpose of the study. Where

traits have a mixture of data types the use of Gower similarity may more applicable (Podani and

Schmera 2006), although quantitative comparisons with other distance measures are still needed.

Furthermore, unlike Euclidean distance, Gower similarity can be used with missing values and

has the advantage of not being influenced by the unit of measure (Gower 1971, Legendre and

Legendre 1998, Podani 1999). Future research should focus on quantitative comparisons of pair-

wise distance measures and ensuring they retain strong linkages to raw data on species traits.

The treatment of functionally redundant species in functional diversity indices remains

controversial. Central to this debate is whether an index of functional diversity should change if

a species is added or lost that is identical to one already present; a feature known as functional

156

redundancy (Walker et al. 2008; Schmera et al. 2009b). Petchey and Gaston (2006) contend that

a good index of functional diversity should not increase if a redundant species is added.

However, this feature may be a product of the number of traits used to segregate species, and the

ecosystem function in question, and not necessarily because the species provide no additional

function (Rosenfield 2002). More recently, improvements to the calculation of functional

diversity indices have allowed for comparison of functional diversity across communities

without sampling bias (Walker et al. 2008). This feature may provide additional benefit to

functional diversity indices as it deemphasizes the current trend in identifying functionally

redundant species and emphasizes the need to compare functional traits across broader

assemblages (Rosenfeld 2002).

There is a growing consensus that functional diversity is likely to be the component of

biodiversity most relevant to ecosystem function (Wright et al 2006). Recently, biodiversity

theory and management perspectives have converged, where each has embraced the need for

incorporating species-specific biology into models (Srivastava and Velland 2005). For example,

large-bodied species that occupy high trophic positions in food webs and occur at low abundance

are thought to be particularly vulnerable to extinction (Lawton and May 1995). The application

of functional traits, such as these, to improve models is becoming more widespread (Cardillo and

Bromham 2001; Olden et al. 2006), and their ability to improve models for species with

conservation concern needs to e assessed. The consideration of both diversity and evenness of

species traits may be an important consideration as evenness measures not only the range of

functional variation, but how much of the functional variation is filled within that range (Mason

et al. 2003 Mouillot et al. 2005). Newer approaches that disassemble functional evenness from

functional richness (e.g. Walker et al. 2008) and allow the researcher to distinguish one from the

other, may allow for better application of functional diversity indices to research on species with

conservation concerns.

157

Recommendations

• Methodological choices should be closely scrutinized when modeling species with

conservation concern. As this thesis (Section I) demonstrates, regardless of scale, study

system or ecological question, methodological choices had the ability greatly to impact

results.

• The use of singular statistical methods should be avoided when dealing with species

where data are deficient (as often is the case of species with conservation concern). This

dependency was true regardless of the scale - community level as in Chapters 2, 3, and 6,

population level as in Chapter 5, or the metapopulation level in Chapter 4. The need for

comparative approaches using multiple statistical methods has been highlighted as one

means to reduce problems related to conservation-based models (Guisan and Zimmerman

2000; Olden and Jackson 2002).

• In the case of modeling species with conservation concerns, singular statistical methods

should not be interpreted in isolation. As Chapter 6 demonstrated, several methods

provided contrasting explanatory relationships. These relationships, if taken in isolation,

may bias future conservation efforts in areas where species are not showing strong habitat

relationships.

• This thesis demonstrates that consensus methods may provide reduced uncertainty in

modeling species with conservation concerns. The advantages of consensus modeling

over singular approaches are numerous :

o It is an intuitive extension of modeling using singular approaches;

o Advancements in computing power have accelerated the use of new statistical

models, which can be added to this approach (e.g. random forest, boosted

regression);

o It reduces biases based on choosing singular statistical approaches, and the

resultant limitations of fitting data to that approach (especially if assumptions are

not met);

158

o It produces a prioritized model output that can be used to address conservation

priorities, identify areas of high conservation value … etc., even if there are

differences across the models and data is noisy;

o Model thresholds can be adjusted readily to prioritize the correct classification of

species presence or absence (e.g. a balance approach was used here in); and,

o Misclassifications can be identified and plotted spatially to identify areas where

error rates were high and data can be collected to refine the models.

References Araújo, M. B., and A. Guisán. 2006. Five (or so) challenges for species distribution modelling.


Araújo, M. B., and M. New. 2007. Ensemble forecasting of species distributions. Trends in


Araújo, M. B., R. J. Whittaker, R. J. Ladle, and M. Erhard. 2005. Reducing uncertainty in


14:529-538.

Botta-Dukat, Z. 2005. Rao's quadratic entropy as a measure of functional diversity based on

multiple traits. Journal of Vegetation Science 16: 533-540.

Cao Y., D.D. Williams, and N.E. Williams. 1998. How important are rare species in community

ecology and bioassessment. Limnology and Oceanography 43: 1403–1409.

Cao Y., D.P. Larsen, and R.S. Thorne. 2001. Rare species in multivariate analysis for

bioassessment: some consideration. Journal of the North American Benthological Society 20:

144–153.

Cardillo, M., and L. Bromham. 2001. Body size and risk of extinction in Australian mammals.

Conservation Biology 15: 1435-1440.

Cunningham, R.B., and D.B. Lindenmayer. 2005. Modeling count data of rare species: some

statistical issues. Ecology 86: 1135-1142.

159


Components of uncertainty in species distribution analysis: a case study of the great grey








Ellison, A.M., and A.A. Agrawal. 2005. The statistics of rarity. Ecology 86: 1079-1080.

Grime, J.P. 1998. Benefits of plant diversity to ecosystems: immediate, filter and founder effects.

Journal of Ecology 86: 902-910.

Garnier, E., J. Cortez, G. Billes, M. L. Navas, C. Roumet, M. Debussche, G. Laurent, A.

Blanchard, D. Aubry, A. Bellmann, C. Neill, and J. P. Toussaint. 2004. Plant functional

markers capture ecosystem properties during secondary succession. Ecology 85:2630-

2637.

Gower, J.C. 1971. A general coefficient of similarity and some of its properties. Biometrics 27:

857_874.

Guisan, A., and N. E. Zimmermann. 2000 Predictive habitat distribution models in ecology.


Green, R.H. and R.C. Young. 1993. Sampling to detect rare species. Ecological Application 3:

351-366.

Heemsbergen, D. A., M.P. Berg, M. Loreau, J.R. van Hal, J.H. Faber, and H.A. Verhoef. 2004.

Biodiversity effects on soil processes explained by interspecific functional dissimilarity.

Science 306: 1019.

Hooper, D. U., F. S. Chapin, J. J. Ewel, A. Hector, P. Inchausti, S. Lavorel, J. H. Lawton, D. M.

Lodge, M. Loreau, S. Naeem, B. Schmid, H. Setala, A. J. Symstad, J. Vandermeer, and

160

D. A. Wardle. 2005. Effects of biodiversity on ecosystem functioning: A consensus of

current knowledge. Ecological Monographs 75: 3-35.

Jackson, D.A. 1993. Multivariate analysis of benthic invertebrate communities: the implication


methods. Canadian Journal of Fisheries and Aquatic Sciences 50: 2641-2651.



453.


Lawton, J.H. and R.M. May (eds.). 1995. Extinction rates. Oxford University Press, London,

United Kingdom.

Legendre, P. and E.D. Gallagher. 2001. Ecologically meaningful transformations for ordination


Legendre, P., and L. Legendre. 1998. Numerical Ecology, 2nd ed. - Elsevier B.V.

Leps, J., F. de Bello, S. Lavorel, and S. Berman. 2006. Quantifying and interpreting functional

diversity of natural communities: practical considerations matter. Preslia 78: 481-501.

Loreau, M., S. Naeem, P. Inchausti, J. Bengtsson, J. P. Grime, A. Hector, D. U. Hooper, M. A.


ecosystem functioning: current knowledge and future challenges Science 294: 804-808.


consensus methods in predictive species distribution modelling. Diversity and Distributions

15: 59-69.

Mason, N. W. H., K. MacGillivray, J.B. Steel, and J.B. Wilson. 2003. An index of functional

diversity. Journal of Vegetation Science 14: 571-578.

161

Mouillot, D., W.H.N. Mason, O. Dumay, and J.B. Wilson. 2005. Functional regularity: A

neglected aspect of functional diversity. Oecologia 142: 353-359.

Olden, J. D., and D. A. Jackson. 2002. A comparison of statistical approaches for modelling fish

species distributions. Freshwater Biology 47:1976-1995.

Olden, J.D., N.L. Poff, and B.P. Bledsoe. 2006. Incorporating ecological knowledge into

ecoinformatics: An example of modeling hierarchically structured aquatic communities

with neural networks. Ecological Informatics 1: 33-42

Pavoine, S., and M. B. Bonsall. 2009. Biological diversity: distinct distributions can lead to the

maximization of Rao’s quadratic entropy Theoretical Population Biology 75:153-163.

Petchey, O. L. and K.J. Gaston. 2002. Functional diversity (FD), species richness and


Petchey, O. L. and K.J. Gaston. 2006. Functional diversity: back to basics and looking forward.


Podani, J. 1999. Extending Gower's general coefficient of similarity to ordinal characters. Taxon

48:331-340.


115: 179-185.

Power, M.E., D. Tilman, J.A. Estes, B.A. Menge, W.J. Bond, L.S. Mills, G. Daily, J.C. Castilla,

J. Lubchenco, and R.T. Paine. 1996. Challenges in the quest for keystones. BioScience 46:

609-620.

Rao, C.R. 1982. Diversity and dissimilarity coefficients—a unified approach. Theoretical

Population Biology 21: 24-43.

Ricotta, C. 2005. A note on functional diversity measures. Basic and Applied Ecology 6: 479-

486.

162

Rodríguez, J. P., L. Brotons, J. Bustamante, and J. Seoane. 2007. The application of predictive

modelling of species distribution to biodiversity conservation. Diversity and Distributions

13:243–251.

Rosenfeld, J. S. 2002. Functional redundancy in ecology and conservation. Oikos 98:156-162.

Schmera, D., T. Eros, and J. Podani. 2009a. A measure for assessing functional diversity in

ecological communities. Aquatic Ecology 43:157-167.

Schmera, D., J. Podani, and T. Eros. 2009b. Measuring the contribution of community members

to functional diversity. Oikos 118:961-971




Simpson, E. H. 1949. Measurement of diversity. Nature 163:688.

Srivastava, D. S., and M. Vellend. 2005. Biodiversity-ecosystem function research: Is it relevant

to conservation? Annual Review of Ecology, Evolution, and Systematics 36: 267-294.


Global Change Biology 10: 2020-2027.



102:8245-8250.

Tilman, D., J. Knops, D. Wedin, P. Reich, M. Ritchie, and E. Siemann. 1997. The Influence of

Functional Diversity and Composition on Ecosystem Processes. Science 277: 1300-1303.

Walker, B.H., A. Kinzig, and J. Langridge. 1999. Plant attribute diversity, resilience, and

ecosystem function: the nature and significance of dominant and minor species. Ecosystems

2: 95-113.

163

Walker, S. W., M. S. Poos, and D. A. Jackson. 2008. Functional rarefaction: estimating

functional diversity from field data. Oikos 117:286-296.

Wright, J. P., S. Naeem, A. Hector, C. Lehman, P. B. Reich, B. Schmid, and D. Tilman. 2006.

Conventional functional classification schemes underestimate the relationship with

ecosystem functioning. Ecology Letters 9:111-120.

conservation by consensus: reducing uncertainty from ... · conservation by consensus: reducing...

Documents