geographic profiling as a novel spatial tool for targeting...

12
704 Geographic profiling as a novel spatial tool for targeting the control of invasive species Mark D. Stevenson, D. Kim Rossmo, Robert J. Knell and Steven C. Le Comber M. D. Stevenson ([email protected]), R. J. Knell and S. C. Le Comber, School of Biological and Chemical Sciences, Queen Mary, Univ. of London, London E1 4NS, UK. – D. K. Rossmo, Center for Geospatial Intelligence and Investigation, Dept of Criminal Justice, Texas State Univ., 601 Univ. Drive, San Marcos, TX 78666, USA. Geographic profiling (GP) was originally developed as a statistical tool in criminology, where it uses the spatial locations of linked crimes (for example murder, rape or arson) to identify areas that are most likely to include the oender’s residence. e technique has been successful in this field, and is now widely used by police forces and investigative agencies around the world. Here, we show that this novel technique can also be used to identify source populations of invasive species, using their current locations as input, as a prelude to targeted control measures. Our study has two main parts. In the first, we use computer simulations to compare GP to other simple measures of spatial central tendency (centre of minimum dis- tance, spatial mean, spatial median), as well as to a more sophisticated single parameter kernel density model. GP performs significantly better than any of these other approaches. In the second part of the study, we analyse historical data from the Biological Records Centre (BRC) for 53 invasive species in Great Britain, ranging from marine invertebrates to woody trees, and from a wide variety of habitats (including littoral habitats, woodland and man-made habitats). For 52 of these 53 data sets, GP outperforms spatial mean, spatial median and centre of minimum distance as a search strategy, particularly as the number of sources (or potential sources) increases. We analyse one of these data sets, for Heracleum mantegazzianum, in more detail, and show that GP also outperforms the kernel density model. Finally, we compare fitted parameter values between dierent species, groups and habitat types, with a view to identifying general values that might be used for novel invasions where data are lacking. We suggest that geographic profiling could potentially form a useful component of inte- grated control strategies relating to a wide variety of invasive species. Invasive species are now viewed as the second most important driver of world biodiversity loss behind habitat destruction and have been identified as a significant compo- nent of global change (Vitousek et al. 1996, Wilcove et al. 1998). e cost of invasive species can run from millions to billions of dollars per occurrence (Mooney and Drake 1986, Pimentel et al. 2000). Invasive species have been shown to aect native species through predation and com- petition, by modifying ecosystem functions, by altering the abiotic environment and by spreading pathogens (Strayer et al. 2006, Ricciardi and Cohen 2007). For these reasons, prevention and control of invasive species has been identified as a priority for conservation organisations and government wildlife and agriculture ministries globally (Mooney and Drake 1986, Hulme 2006). Geographic profiling (GP) is a spatial modelling tool originally developed in criminology to help prioritise large lists of suspects in cases of serial crime. Typically, such cases involve too many, rather than too few, suspects; for example, the investigation of the Yorkshire Ripper murders in the UK between 1975 and 1980 generated 268 000 names (Doney 1990). In criminology, GP uses spatial data concerning the locations of connected crime sites to create a probability surface that is overlaid on the study area to produce a geoprofile. Geoprofiles do not provide an exact location for the criminal’s home, but rather allow the police to pri- oritise investigations by systematically checking suspects associated with locations in descending order of the height of these locations on the geoprofile, facilitating an opti- mal search process based on decreasing probability density (Rossmo 1993, 2000). In essence, the model has two components. First, there is a distance decay function, such that the probability of a crime drops with increasing distance from the criminal’s anchor point (usually a home or workplace). Second, there is an area, the buer zone, surrounding the criminal’s home in which crimes are less likely (note that the buer zone radius can be set to zero). e distance-decay function arises because travel – for either human criminals or inva- sive species – involves costs. e buer zone, in crimino- logy, arises partly because criminals may avoid locations too near their home, because of increased risk of being iden- tified, and partly because plane geometry dictates that, if potential crime sites are randomly distributed, the number Ecography 35: 704–715, 2012 doi: 10.1111/j.1600-0587.2011.07292.x © 2012 e Authors. Ecography © 2012 Nordic Society Oikos Subject Editor: Pedro Peres-Neto. Accepted 9 December 2011

Upload: ledat

Post on 19-Jan-2019

223 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Geographic profiling as a novel spatial tool for targeting ...webspace.qmul.ac.uk/rknell/pdfs/Ecography 2012 Stevenson.pdf · Geographic profiling as a novel spatial tool for targeting

704

Geographic profiling as a novel spatial tool for targeting the control of invasive species

Mark D. Stevenson, D. Kim Rossmo, Robert J. Knell and Steven C. Le Comber

M. D. Stevenson ([email protected]), R. J. Knell and S. C. Le Comber, School of Biological and Chemical Sciences, Queen Mary, Univ. of London, London E1 4NS, UK. – D. K. Rossmo, Center for Geospatial Intelligence and Investigation, Dept of Criminal Justice, Texas State Univ., 601 Univ. Drive, San Marcos, TX 78666, USA.

Geographic profiling (GP) was originally developed as a statistical tool in criminology, where it uses the spatial locations of linked crimes (for example murder, rape or arson) to identify areas that are most likely to include the o!ender’s residence. "e technique has been successful in this field, and is now widely used by police forces and investigative agencies around the world. Here, we show that this novel technique can also be used to identify source populations of invasive species, using their current locations as input, as a prelude to targeted control measures. Our study has two main parts. In the first, we use computer simulations to compare GP to other simple measures of spatial central tendency (centre of minimum dis-tance, spatial mean, spatial median), as well as to a more sophisticated single parameter kernel density model. GP performs significantly better than any of these other approaches. In the second part of the study, we analyse historical data from the Biological Records Centre (BRC) for 53 invasive species in Great Britain, ranging from marine invertebrates to woody trees, and from a wide variety of habitats (including littoral habitats, woodland and man-made habitats). For 52 of these 53 data sets, GP outperforms spatial mean, spatial median and centre of minimum distance as a search strategy, particularly as the number of sources (or potential sources) increases. We analyse one of these data sets, for Heracleum mantegazzianum, in more detail, and show that GP also outperforms the kernel density model. Finally, we compare fitted parameter values between di!erent species, groups and habitat types, with a view to identifying general values that might be used for novel invasions where data are lacking. We suggest that geographic profiling could potentially form a useful component of inte-grated control strategies relating to a wide variety of invasive species.

Invasive species are now viewed as the second most important driver of world biodiversity loss behind habitat destruction and have been identified as a significant compo-nent of global change (Vitousek et al. 1996, Wilcove et al. 1998). "e cost of invasive species can run from millions to billions of dollars per occurrence (Mooney and Drake 1986, Pimentel et al. 2000). Invasive species have been shown to a!ect native species through predation and com-petition, by modifying ecosystem functions, by altering the abiotic environment and by spreading pathogens (Strayer et al. 2006, Ricciardi and Cohen 2007). For these reasons, prevention and control of invasive species has been identified as a priority for conservation organisations and government wildlife and agriculture ministries globally (Mooney and Drake 1986, Hulme 2006).

Geographic profiling (GP) is a spatial modelling tool originally developed in criminology to help prioritise large lists of suspects in cases of serial crime. Typically, such cases involve too many, rather than too few, suspects; for example, the investigation of the Yorkshire Ripper murders in the UK between 1975 and 1980 generated 268 000 names (Doney 1990). In criminology, GP uses spatial data concerning

the locations of connected crime sites to create a probability surface that is overlaid on the study area to produce a geoprofile. Geoprofiles do not provide an exact location for the criminal’s home, but rather allow the police to pri-oritise investigations by systematically checking suspects associated with locations in descending order of the height of these locations on the geoprofile, facilitating an opti-mal search process based on decreasing probability density (Rossmo 1993, 2000).

In essence, the model has two components. First, there is a distance decay function, such that the probability of a crime drops with increasing distance from the criminal’s anchor point (usually a home or workplace). Second, there is an area, the bu!er zone, surrounding the criminal’s home in which crimes are less likely (note that the bu!er zone radius can be set to zero). "e distance-decay function arises because travel – for either human criminals or inva-sive species – involves costs. "e bu!er zone, in crimino-logy, arises partly because criminals may avoid locations too near their home, because of increased risk of being iden-tified, and partly because plane geometry dictates that, if potential crime sites are randomly distributed, the number

Ecography 35: 704–715, 2012 doi: 10.1111/j.1600-0587.2011.07292.x

© 2012 "e Authors. Ecography © 2012 Nordic Society Oikos Subject Editor: Pedro Peres-Neto. Accepted 9 December 2011

Page 2: Geographic profiling as a novel spatial tool for targeting ...webspace.qmul.ac.uk/rknell/pdfs/Ecography 2012 Stevenson.pdf · Geographic profiling as a novel spatial tool for targeting

705

of potential crime sites increases with the distance from the home. In biology, there is also ecological evidence for the existence of a bu!er zone, for example in trees and in bee species (Dramstad 1996, Saville et al. 1997, Singh et al. 2001).

Geographic profiling has been successfully applied in criminology, and it is now widely used by police forces around the world. Recently, it has been applied to biological data (Le Comber et al. 2006, 2011, Martin et al. 2009, Raine et al. 2009). Here, we extend this work to consider invasive species. In this analysis, sites colonised by the species in ques-tion are analogous to crime sites, while the source or sources of the invasion are analogous to the criminal’s home.

If geographic profiling is to provide an e!ective method for locating source populations of invasive species, it first of all needs to be demonstrated that it can do better than other methods of prioritizing searches. In addition, since it is likely that many or even most invasions will concern species where data are, at least initially, lacking, it would clearly be helpful if general model values could be used for di!erent groups of taxa (for example, what values of the bu!er zone radius are most appropriate for woody trees, or marine invertebrates?). In this study, we address both of these issues. Specifically, we ask 1) can geographic profil-ing be used to locate sources of invasive species? 2) If so, is GP more e#cient than other simple approaches such as spatial mean, spatial median or centre of minimum dis-tance, or a more complex single parameter kernel density model? 3) Is it possible to use general parameter values for di!erent species or groups or for species occupying di!erent types of habitats in cases where data may be lacking? 4) Do model variables alter over time, specifically in the earlier or later stages of invasions?

Methods

General approach

Our general approach was to fit the GP model parameters using species locations at time t in such a way that they optimally predict species locations at time t–1, and then validate the model by using the same fitted parameters to test whether the locations at time t–1 predict the locations at time t–2. In this way, model fitting and model testing are independent. In this case, the time steps chosen were decades (e.g. 1960–1969, 1970–1979). For each of the spe-cies analysed, three decades were chosen (times t, t–1 and t–2). "e decades selected were not the same for each species, since decades where there were large data sets were preferen-tially chosen; early datasets, where data might be expected to be unreliable and susceptible to di!erences in sampling e!ort, were also avoided.

Geographic profiling model

A full description of the model can be found in Rossmo (2000). Here, we use a slight variant, introduced by Le Comber et al. (2006), which uses Euclidean rather than Manhattan distances (this approach was chosen as there is no reason that invasive species should move in the restricted pattern defined by urban (particularly North American) street layouts). "e geographic profiling func-tion generates a prioritised surface that describes the opti-mal search pattern for the sources of invasive species. For each point (i, j) within the target area, the score function (p) is calculated:

Sums probability across all ‘crimesites’ (invasion locations)

Turns first term off inside the bufferzone radius, and on outside

Uses parameter f to specify distancedecay moving outwards from the bufferzone radius. This reflects the fact that

dispersal probability declines withdistance

pij = k

Uses f and g to specify the increase in dispersalprobability moving away from the source,

reaching a maximum value at a distance equal tothe radius of the buffer zone. This reflects the

reduced probability of dispersal within thebuffer zone

Turns first term on inside thebuffer zone radius, and off outside

c

n=1 (xi – xn)2 + (yj – yn)2f

(1 – ) (Bg–f )+

(xi – xn)2 + (yj – yn)2g2B –

Page 3: Geographic profiling as a novel spatial tool for targeting ...webspace.qmul.ac.uk/rknell/pdfs/Ecography 2012 Stevenson.pdf · Geographic profiling as a novel spatial tool for targeting

706

where

(xi – xn)2 + (yj – yn)2 B = 1

Sets phi to 1 if ‘crime site’ is outside the radius of thebuffer zone

and

(xi – xn)2 + (yj – yn)2 B = 0

Sets phi to 0 if ‘crime site’ is inside the radius of thebuffer zone

such that phi functions as a switch that is set to 0 for sites within the bu!er zone, and 1 for sites outside the bu!er zone. k is an empirically determined constant, B is the radius of the bu!er zone, C is the number of sightings of the invasive organism f and g are parameters that con-trol the shape of the decay function, (xi, yj) are the coordi-nates of point (i, j) and (xn, yn) are the coordinates of the nth site. "us, pij describes the likelihood that the anchor point occurs at point (i, j), given the crime site locations (Rossmo 2000). "e equation describes a two-part curve, which when plotted in three dimensions resembles the caldera of a volcano. When summed these ‘volcano’ shaped decay functions produce a surface that describes an e#cient search pattern for the location of species invasions. "e model was implemented using the R sta-tistical package ‘gp’ (Stevenson 2011, R development core team 2008).

Model fitting

To validate the model, we took advantage of the temporal resolution of data at the Biological Records Centre (Spatial data, below) to split species data into decades (e.g. 1960–1969, 1970–1979, 1980–1989). We then fitted the model by selecting the value of B that best predicted the locations of the species in question in (for example) 1970–1979, using as input the locations from 1980–1989. "is was done using a maximum likelihood approach with quasi-newton optimisation (Nocedal and Wright 1999) to fit B, and leaving f and g fixed at 1.2. "ese are the values typi-cally used in criminology, and previous studies have shown that the model is much more sensitive to changes in B than to changes in f and g (Le Comber et al. 2006, Raine et al. 2009). "is also helps to avoid the problem of overfitting. Overfitting (discussed by O’Leary (2009) in the context of geographic profiling) can result in a model that essen-tially reduces to a series of point estimates that perfectly predict the data used to fit the model, but that have little or no predictive power when applied to other data sets. A further advantage, in the context of this study, is that this allows the model to be constrained to a single parameter

allowing direct comparison with a single parameter kernel density model.

"e model’s degree of fit can be calculated using the hit score percentage (HS%), the proportion of the area covering the crimes (in this case, the locations of invasive species) in which the o!ender’s base (or the source of the invasive species) is located; in criminology, this is usually the area bounding the crimes, plus a ‘guard rail’ of 10% surrounding this. "e HS% is calculated by dividing the ranked score (pij) by the total search area and multiplying this result by 100. "e smaller the HS%, the more accu-rate the geoprofile; a hit score of 50% is what would be expected from a nonprioritized (i.e. random or uniform) search (Rossmo 2000). In our analysis, unlike criminology, there are multiple sources, so we calculated the mean hit score percentage across all locations. "is ‘learning hit score’ is reported in Table 1.

Model testing

To test the model’s performance, we used the fitted para-meters to test the model’s predictions on an earlier time step; in the example above, we would feed the locations in 1970–1979 back into the model, and calculate the hit score percentages of the locations in the previous time step (1960–1969) (the test hit score in Table 1), thus ensuring that the learning and test hit score percentages were independent.

Other measures of spatial central tendency

GP was compared to other simple measures of spatial ten-dency commonly used in both biology and criminology. "e two most basic approaches are to calculate the spatial mean (or centroid) and the spatial median. "e spatial mean is the point where the coordinates are the mean of the x and y coordinates. "e equation is shown below:

x n x y n yii

n

ii

n1 11 1

,

where xi and yi are the co-ordinates of the invasive species presence and n is the total number of locations where the species is present. "e spatial median is the median point of the x and y co-ordinates. Using the same notation as above it is calculated as follows:

x y y y12

x x 112 1n

2,n

2n2

n2

A slightly more complex approach is to use the centre of minimum distance (CMD). "e CMD is the location at which the sum of the distances to all other points is mini-mized and is described by the following equation:

W x y x y x yi

n

i i( , ) (( , ), , ))dist (1

W is the distance from each location of an invasive species (xi, yi) to the chosen point (x–, y–). W is then minimised using an iterative algorithm. "e function of W is a convex hull

Page 4: Geographic profiling as a novel spatial tool for targeting ...webspace.qmul.ac.uk/rknell/pdfs/Ecography 2012 Stevenson.pdf · Geographic profiling as a novel spatial tool for targeting

707

Tabl

e 1.

Ani

mal

and

pla

nt s

peci

es u

sed

in th

is a

naly

sis,

with

Eng

lish

Nat

ure

cate

gory

list

ings

and

EU

NIS

hab

itat d

ata.

EU

NIS

hab

itats

are

as

follo

ws:

A1

litto

ral r

ock

and

othe

r har

d su

bstra

te; C

3 lit

tora

l zo

ne o

f inl

and

surfa

ce w

ater

way

s; G

1 br

oadl

eave

d de

cidu

ous

woo

dlan

d; G

4 m

ixed

dec

iduo

us a

nd c

onife

rous

woo

dlan

d; I1

ara

ble

land

and

mar

ket g

arde

ns; I

2 cu

ltiva

ted

area

s of

gar

dens

and

par

ks;

J2 lo

w d

ensi

ty b

uild

ings

; J4

trans

port

netw

orks

and

oth

er c

onst

ruct

ed h

ard-

surfa

ced

area

s; N

A n

ot a

pplic

able

. ‘Ti

me

step

’ sho

ws

the

data

use

d to

fit t

he m

odel

, prio

r to

test

ing.

B is

the

fitte

d va

lue

of th

e bu

ffer

zone

rad

ius.

The

lear

ning

hit

scor

e is

sho

wn,

alo

ng w

ith th

e te

st a

nd, f

or c

ompa

rison

, hit

scor

es fo

r th

e ce

ntre

of m

inim

um d

ista

nce

(CM

D),

spat

ial m

edia

n (S

SM) a

nd s

patia

l mea

n (S

MM

). Fo

r ea

ch s

peci

es a

naly

sed,

the

mos

t effe

ctiv

e of

thes

e fo

ur s

earc

h st

rate

gies

is s

how

n as

a s

hade

d ce

ll; in

52

of 5

3 (9

8%) c

ases

, GP

out-p

erfo

rmed

all

thre

e ot

her s

trate

gies

. Dat

a fro

m E

nglis

h N

atur

e au

dit

of n

on-n

ativ

e sp

ecie

s (H

ill e

t al.

2005

).

Spec

ies

C

omm

on n

ame

Engl

ish

Nat

ure

listin

g ca

tego

ry

M

ajor

ca

tego

ry

M

inor

cat

egor

y

D

omin

ant

habi

tat

Ti

me

step

B

GP

lear

ning

hi

t sco

re

G

P te

st

hit

scor

e

C

MD

te

st h

it sc

ore

SS

M

test

hit

scor

e

SM

M

test

hit

scor

e

Ach

eta

dom

estic

usH

ouse

cric

ket

55an

imal

sin

sect

sN

A19

71–1

980;

19

81–1

990

0.38

7 00

E-69

0.25

0.52

50.

581

0.57

9

Agl

ossa

pin

guin

alis

Larg

e ta

bby

53an

imal

sin

sect

sI2

1981

–199

0;

1991

–200

00.

425

29E-

060.

230.

471

0.65

40.

557

Chr

ysal

ina

amer

ican

aRo

sem

ary

beet

le52

anim

als

inse

cts

NA

1991

–200

0;

2001

–201

00.

353

72E-

240.

140.

423

0.83

00.

431

Dic

rano

palp

us ra

mos

usH

arve

stm

an44

anim

als

inve

rtebr

ate

non-

inse

ctI2

1981

–199

0;

1991

–200

00.

443

14E-

940.

120.

335

0.46

90.

373

Phol

cus

phal

angi

oide

sD

addy

-long

-legs

spi

der

or C

ella

r spi

der

44an

imal

sin

verte

brat

e no

n-in

sect

I219

91–2

000;

20

01–2

010

0.58

6 60

E-08

0.22

0.48

50.

563

0.51

4

Tege

naria

agr

estis

Hob

o sp

ider

44an

imal

sin

verte

brat

e no

n-in

sect

I219

91–2

000;

20

01–2

010

0.39

1 35

E-71

0.26

0.51

90.

600

0.50

8

Cor

ophi

um s

exto

nae

–14

mar

ine

mar

ine

and

es

tuar

ine

inve

rtebr

ates

A1

1981

–199

0;

1991

–200

00.

361

77E-

270.

240.

466

0.58

80.

578

Cra

ssos

trea

gig

asPa

cific

oys

ter o

r Ja

pane

se o

yste

r13

mar

ine

mar

ine

and

estu

arin

e in

verte

brat

esA

119

81–1

990;

19

91–2

000

0.6

4 88

E-07

0.16

0.39

40.

843

0.47

9

Cre

pidu

la fo

rnic

ata

Com

mon

slip

per s

hell

13m

arin

em

arin

e an

d es

tuar

ine

inve

rtebr

ates

A1

1971

–198

0;

1981

–199

00.

193

96E-

630.

180.

467

0.48

00.

470

Elm

iniu

s m

odes

tus

Barn

acle

14m

arin

em

arin

e an

d es

tuar

ine

inve

rtebr

ates

A1

1971

–198

0;

1981

–199

00.

557

20E-

410.

140.

373

0.44

80.

391

Petr

icol

a ph

olad

iform

isFa

lse

ange

l win

g

Am

eric

an P

iddo

ck13

mar

ine

mar

ine

and

estu

arin

e in

verte

brat

esA

119

81–1

990;

19

91–2

000

0.56

1 70

E-14

0.21

0.44

60.

562

0.52

3

Stye

la c

lava

Stal

ked

or le

athe

ry s

ea

squi

rt18

mar

ine

mar

ine

and

es

tuar

ine

inve

rtebr

ates

A1

1981

–199

0;

1991

–200

00.

131

32E-

220.

210.

449

0.53

70.

481

Und

aria

pin

natifi

daJa

pane

se k

elp

33m

arin

em

arin

e an

d es

tuar

ine

plan

tsA

119

81–1

990;

19

91–2

000

1.57

7 00

E-69

0.18

0.45

40.

545

0.52

2

Ace

r pla

tano

ides

Nor

way

map

le77

plan

tsva

scul

ar p

lant

sG

119

51–1

960;

19

61–1

970

0.27

1 35

E-48

0.25

0.45

50.

573

0.56

8

Aes

culu

s hi

ppoc

asta

num

Hor

se c

hest

nut

77pl

ants

vasc

ular

pla

nts

G1

1931

–194

0;

1941

–195

00.

714

98E-

430.

250.

461

0.55

70.

551

Ani

sant

ha s

teril

isBa

rren

bro

me

77pl

ants

vasc

ular

pla

nts

I119

71–1

980;

19

81–1

990

0.27

5 34

E-29

0.27

0.52

90.

668

0.59

2

Cam

panu

la p

osch

arsk

yana

Trai

ling

bellfl

ower

77pl

ants

vasc

ular

pla

nts

J219

71–1

980;

19

81–1

990

0.59

1 14

E-20

0.21

0.41

40.

930

0.52

5

Cer

atoc

hloa

mar

gina

taW

este

rn b

rom

e77

plan

tsva

scul

ar p

lant

sC

319

71–1

980;

19

81–1

990

0.63

1 36

E-20

0.32

0.56

90.

716

0.61

3

(Con

tinue

d)

Page 5: Geographic profiling as a novel spatial tool for targeting ...webspace.qmul.ac.uk/rknell/pdfs/Ecography 2012 Stevenson.pdf · Geographic profiling as a novel spatial tool for targeting

708

Tabl

e 1.

(C

ontin

ued)

.

Spec

ies

C

omm

on n

ame

Engl

ish

Nat

ure

listin

g ca

tego

ry

M

ajor

ca

tego

ry

M

inor

cat

egor

y

D

omin

ant

habi

tat

Ti

me

step

B

GP

lear

ning

hi

t sco

re

G

P te

st

hit

scor

e

C

MD

te

st h

it sc

ore

SS

M

test

hit

scor

e

SM

M

test

hit

scor

e

Cry

ptom

eria

japo

nica

Japa

nese

red

ceda

r76

plan

tsva

scul

ar p

lant

sG

419

51–1

960;

19

61–1

970

0.29

4 71

E-21

0.11

0.74

30.

920

0.85

3

Erin

us a

lpin

usFa

iry fo

xglo

ve77

plan

tsva

scul

ar p

lant

sJ2

1971

–198

0;

1981

–199

00.

648

68E-

130.

140.

419

0.48

30.

398

Fallo

pia

japo

nica

Japa

nese

kno

twee

d77

plan

tsva

scul

ar p

lant

sC

319

31–1

940;

19

41–1

950

0.38

5 29

E-09

0.19

0.41

80.

576

0.49

1

Fuch

sia

mag

ella

nica

Fuch

sia

77pl

ants

vasc

ular

pla

nts

I119

31–1

940;

19

41–1

950

0.42

9 98

E-45

0.14

0.35

40.

755

0.41

1

Fum

aria

reut

eri

Mar

tin’s

ram

ping

- fu

mito

ry77

plan

tsva

scul

ar p

lant

sI1

1981

–199

0;

1991

–200

00.

541

34E-

580.

120.

330

0.46

50.

458

Gal

anth

us n

ival

is77

plan

tsva

scul

ar p

lant

sG

119

31–1

940;

19

41–1

950

0.26

4 14

E-17

0.19

0.44

10.

549

0.48

3

Ger

aniu

m d

isse

ctum

Cut

-leav

ed c

rane

’s-

bill

77pl

ants

vasc

ular

pla

nts

I119

31–1

940;

19

41–1

950

0.71

7 68

E-25

0.19

0.40

10.

510

0.50

3

Her

acle

um

man

tega

zzia

num

Gia

nt h

ogw

eed

77pl

ants

vasc

ular

pla

nts

J419

31–1

940;

19

41–1

950

0.77

00.

130.

401

0.42

70.

411

Jugl

ans

regi

aC

omm

on w

alnu

t, Pe

rsia

n w

alnu

t or

Engl

ish

wal

nut

77pl

ants

vasc

ular

pla

nts

G1

1971

–198

0;

1981

–199

00.

236

65E-

710.

130.

401

0.52

00.

423

Kick

xia

spur

iaRo

undl

eaf c

ance

rwor

t77

plan

tsva

scul

ar p

lant

sI1

1951

–196

0;

1961

–197

00.

37

83E-

420.

110.

385

0.48

40.

377

Lact

uca

serr

iola

Pric

kly

lettu

ce77

plan

tsva

scul

ar p

lant

sJ4

1931

–194

0;

1941

–195

00.

353

45E-

110.

160.

123

0.21

10.

234

Larix

dec

idua

Euro

pean

larc

h76

plan

tsva

scul

ar p

lant

sG

419

31–1

940;

19

41–1

950

0.3

00.

230.

516

0.60

70.

573

Larix

kae

mpf

eri

Japa

nese

larc

h76

plan

tsva

scul

ar p

lant

sG

419

71–1

980;

19

81–1

990

0.28

2 04

E-47

0.34

0.61

50.

707

0.58

5

Lepi

dium

cam

pest

reFi

eld

pepp

erw

eed/

pepp

erw

ort

77pl

ants

vasc

ular

pla

nts

J219

31–1

940;

19

41–1

950

0.58

2 16

E-14

0.13

0.36

70.

497

0.46

3

Mal

va n

egle

cta

Butto

nwee

d ch

eese

-pl

ant,

chee

sew

eed,

dw

arf m

allo

w o

r ro

undl

eaf m

allo

w

77pl

ants

vasc

ular

pla

nts

J419

31–1

940;

19

41–1

950

0.63

3 19

E-08

0.23

0.44

10.

603

0.53

5

Myo

sotis

arv

ensi

sFi

eld

forg

et-m

e-no

t77

plan

tsva

scul

ar p

lant

sJ2

1971

–198

0;

1981

–199

00.

576

39E-

140.

210.

434

0.61

10.

527

Papa

ver d

ubiu

mLo

ng-h

eade

d Po

ppy

77pl

ants

vasc

ular

pla

nts

J419

71–1

980;

19

81–1

990

0.62

3 45

E-33

0.21

0.49

60.

555

0.54

6

Pice

a ab

ies

Nor

way

spr

uce

76pl

ants

vasc

ular

pla

nts

G4

1951

–196

0;

1961

–197

00.

276

43E-

540.

130.

372

0.50

30.

418

Pice

a gl

auca

Whi

te s

pruc

e76

plan

tsva

scul

ar p

lant

sG

419

71–1

980;

19

81–1

990

0.23

8 12

E-64

0.19

0.39

20.

555

0.44

5

Pice

a si

tche

nsis

Sitk

a sp

ruce

76pl

ants

vasc

ular

pla

nts

G4

1971

–198

0;

1981

–199

00.

33

56E-

900.

160.

361

0.45

50.

463

Pilo

sella

aur

antia

caO

rang

e ha

wkw

eed

77pl

ants

vasc

ular

pla

nts

J219

71–1

980;

19

81–1

990

0.51

1 47

E-41

0.13

0.37

80.

434

0.37

6

(Con

tinue

d)

Page 6: Geographic profiling as a novel spatial tool for targeting ...webspace.qmul.ac.uk/rknell/pdfs/Ecography 2012 Stevenson.pdf · Geographic profiling as a novel spatial tool for targeting

709

Tabl

e 1.

(C

ontin

ued)

.

Spec

ies

C

omm

on n

ame

Engl

ish

Nat

ure

listin

g ca

tego

ry

M

ajor

ca

tego

ry

M

inor

cat

egor

y

D

omin

ant

habi

tat

Ti

me

step

B

GP

lear

ning

hi

t sco

re

G

P te

st

hit

scor

e

C

MD

te

st h

it sc

ore

SS

M

test

hit

scor

e

SM

M

test

hit

scor

e

Pilo

sella

aur

antia

caFo

x-an

d-cu

bs77

plan

tsva

scul

ar p

lant

sJ2

1971

–198

0;

1981

–199

00.

433

93E-

640.

160.

421

0.52

80.

501

Pseu

dots

uga

men

zies

iiD

ougl

as fi

r76

plan

tsva

scul

ar p

lant

sG

419

51–1

960;

19

61–1

970

0.25

2 74

E-39

0.15

0.44

30.

740

0.44

6

Que

rcus

cer

risTu

rkey

oak

77pl

ants

vasc

ular

pla

nts

G1

1931

–194

0;

1941

–195

00.

636

72E-

060.

120.

385

0.48

20.

458

Que

rcus

ilex

Hol

m o

ak o

r hol

ly o

ak77

plan

tsva

scul

ar p

lant

sG

119

31–1

940;

19

41–1

950

0.36

00.

410.

623

0.80

30.

685

Que

rcus

rubr

aN

orth

ern

red

oak

or

cham

pion

oak

77pl

ants

vasc

ular

pla

nts

G1

1971

–198

0;

1981

–199

00.

257

44E-

170.

160.

421

0.83

00.

478

Rhus

typh

ina

Stag

horn

sum

ac77

plan

tsva

scul

ar p

lant

sG

119

71–1

980;

19

81–1

990

0.23

3 76

E-11

0.17

0.45

30.

495

0.48

2

Salix

alb

aW

hite

will

ow77

plan

tsva

scul

ar p

lant

sC

319

31–1

940;

19

41–1

950

0.67

7 08

E-22

0.12

0.38

60.

497

0.44

8

Salix

tria

ndra

Alm

ond

will

ow o

r al

mon

d-le

aved

w

illow

77pl

ants

vasc

ular

pla

nts

C3

1931

–194

0;

1941

–195

00.

246

75E-

100.

140.

362

0.52

70.

479

Salix

vim

inal

isC

omm

on o

sier

or o

sier

77pl

ants

vasc

ular

pla

nts

C3

1931

–194

0;

1941

–195

00.

311

14E-

170.

170.

410

0.49

30.

412

Sam

bucu

s ra

cem

osa

Red

elde

rber

ry77

plan

tsva

scul

ar p

lant

sG

119

71–1

980;

19

81–1

990

0.35

1 94

E-56

0.13

0.41

60.

474

0.46

4

Sina

pis

arve

nsis

Wild

mus

tard

or

char

lock

77pl

ants

vasc

ular

pla

nts

J419

31–1

940;

19

41–1

950

0.72

1 09

E-87

0.14

0.35

40.

770

0.45

4

Thuj

a pl

icat

aW

este

rn re

d ce

dar

76pl

ants

vasc

ular

pla

nts

G4

1971

–198

0;

1981

–199

00.

31

76E-

230.

150.

435

0.51

70.

400

Tsug

a he

tero

phyl

laW

este

rn h

emlo

ck76

plan

tsva

scul

ar p

lant

sG

419

71–1

980;

19

81–1

990

0.3

1 46

E-21

0.2

0.46

60.

557

0.44

8

Tsug

a he

tero

phyl

laW

este

rn h

emlo

ck-

spru

ce76

plan

tsva

scul

ar p

lant

sG

419

31–1

940;

19

41–1

950

0.27

1 45

E-33

0.17

0.39

50.

528

0.41

8

Page 7: Geographic profiling as a novel spatial tool for targeting ...webspace.qmul.ac.uk/rknell/pdfs/Ecography 2012 Stevenson.pdf · Geographic profiling as a novel spatial tool for targeting

710

and as such has a unique minimum. We used the algorithm developed by Weiszfeld (1936):

where the initial values of x and y may be taken as any reason-able value (the centroid for example). "e method has been shown to always converge on the CMD, yet the length of time taken for this to occur is an unknown function depen-dant on the size and distribution of the data (Kuhn and Kuenne 1962). We ran the method for k 100 to ensure its convergence.

"ese methods were then compared to GP by searching outwards from the central point in concentric bands. Using this search strategy, a figure comparable to the hit score could be calculated based on a directed search originating out from these measures of spatial central tendency.

Kernel density method

In addition to the simple methods described above, we also compared GP to a kernel density method of the type that underlies gravity models. In this study, we adapted the kernel density method for estimating home range size described by Worton (1989) which underlies the backcast-ing application in MacIsaac et al. (2004). "is kernel den-sity approach uses a fixed kernel based on the summation of unimodal bivariate normal probability density functions. Whilst similar in some ways to the geographic profiling model, this approach uses a single parameter normal distribu-tion in place of the geographic profiling function. "e single parameter bivariate normal kernel is given by the following equation (note that the notation used in this equation, but not the form, have been modified to make the comparison with the GP model more explicit):

Pnh

x x y yhij

i n

i

ni n1 1

2 22 21

. ( ) ( )exp

where pij is the probability of each point being a source. (xi, yi) and (xn, yn) are the same as in the geographic profiling model. h is the only free parameter in the model and can be used to smooth out or concentrate the surface; it is referred to as the smoothing parameter. "is parameter may be fitted in much the same way as the geographic profiling model’s parameter B as discussed above.

Simulations

Simulated data were also created to test and compare the GP model alongside real species invasions. "e simulations were created using the statistical modelling environment R (R development core team), and consisted of a 100 by 100 grid in which sources were uniformly distributed in the central 36 36 region (the constraint was necessary to avoid edge e!ects). From each source a normal distribution of spread

( , )(( , ),( , ))((

( ) ( )( ) ( )

x yx x y x y

xk k i i i

k kin

1 1 1

1/dist/dist ii i

k kin

i i ik k

in

y x yy x y x y

, ),( , )),

(( , ),( , ))( ) ( )

( ) ( )

1

1 /dist111 /dist(( , ),( , ))( ) ( )x y x yi i

k kin

points was simulated with a standard deviation of 5. "e simulations tested every possible combination of 1, 2 and

5 sources and 10, 20 and 30 spread points. Each variation was replicated 1000 times and the hit score for each model calculated to test how well they identified the source loca-tions. Where there were multiple sources, the mean hit score of all of the individual sources was used.

Two sets of simulations were conducted. In the first, GP was tested against the spatial mean, spatial median and centre of minimum distance. In this set of simulations (as in most real-world examples), the dispersal parameters are unknown, and GP uses half the mean nearest-neighbour distance between points to estimate B. In the second set of simula-tions, GP was compared to the kernel density method. "e kernel density method uses as an input h, which corresponds to the standard deviation of the normal distribution used to simulate dispersal, and in this set of simulations both h and B were set to the true standard deviation.

Spatial data

Fifty three British invasive species were chosen that had extensive invasion histories as recorded in the R662 audit of non-native species of England, produced by Natural England (Hill et al. 2005). "e R662 audit was a desk-based survey that collated all of the standard British resources and checked these against up-to-date studies. Data on the locations and date of presence (taken as the centroid within 10 km2 grid) were obtained from the Biological Records Centre (BRC) database downloaded from NBN gateway ( www.nbn.org.uk ). "e BRC database contains datasets from hundreds of contributors across the U.K., ranging from government bodies, NGOs and scientific surveys. "ere is significant variation in the quality of the data collected, but all species chosen had multiple records collected from di!erent organi-sations. A full list of the species chosen, years analysed and results obtained for each species can be found in Table 1.

Taxonomic differences

Di!erent species have very di!erent reproductive habits and dispersal traits, and this is likely to be reflected in dif-ferences in the radius of the bu!er zone (parameter B) in our model. To test for di!erences in B between species belonging to di!erent taxa, we examined the distribution of B values in major and minor taxonomic functional groups as defined by English Nature’s Audit of non-native species (Hill et al. 2005).

Habitat differences

Di!erent species have preferred habitat types and the abundance of suitable habitats, and their distribution, may a!ect the rates at which species are able to spread. In

Page 8: Geographic profiling as a novel spatial tool for targeting ...webspace.qmul.ac.uk/rknell/pdfs/Ecography 2012 Stevenson.pdf · Geographic profiling as a novel spatial tool for targeting

711

sources number of spread points: F1,17992 0.1, p 0.32; model type number of sources number of spread points: F1,17992 0.4, p 0.55). Tukey post-hoc tests showed that GP performed significantly better than all other methods except when the number of sources was 1, when the kernel density model performed as well as GP (Table 2b).

Model performance

Across each of the 53 species examined, GP’s mean hit score percentage was lower than the 50% that characterises a random search (mean SD: 18% 18.4%; log trans-formation: t 37.338, DF 52, p 2.2e-16). GP also out-performed the spatial mean, spatial median and centre of minimum distance in 52 out of 53 datasets. GP had a mean hit score percentage of 18.4% across all datasets (SD 6%) compared to 58.1% (SD 14%) for the spatial mean, 48.7% (SD 9%) for the spatial median and 43.4% (SD 9%) for the CMD. ANOVA reveals significant di!erences between these (F3,50 100.01, p 2.2e-16) (Table 1).

We also selected one dataset, Heracleum mantegazzianum, for which there are good data, for more detailed analysis, examining the distribution of hit scores for individual sites, as well as considering the average across all sites (Fig. 2). For 35 of 51 1941–1950 sources the hit score percentage was below 10%. "e mean hit score percentage for all the sources was 13% and never exceeded 24% of the searchable area. We also ran a kernel density model on the Heracleum mantegazzianum data set, to compare its performance to that of GP. For 18 of 1941–1950 sources the hit score percentage was below 10%. "e mean hit score was 31% and reached a maximum of 65% of the search area. "is result was sig-nificantly worse than that of GP (Wilcoxon rank sum test: W 382, n 51, p 8.052e-10).

Taxonomic differences

"ere were no significant di!erences in fitted B values across the English Nature major groups animals, marine, plants (mean SD: animals: 0.43 0.08; marine 0.56 0.48; plants 0.42 0.18; log transformation: F4,49 0.60, p 0.44), although we noted that the variance for the cat-egory ‘animals’ was lower than in the other two categories. However, when we looked within the category plants to compare categories 76 (woody stemmed conifers) and 77 (deciduous flowering plants), we did find significantly di!er-ent values of B (median values: category 76: 0.17; category 77: 0.16; Wilcoxon rank sum test: W 238, n 11, 30, p 0.005) (Fig. 3).

Habitat differences

When we fitted the model to di!erent EUNIS habitat listings, we found significant di!erences in fitted values of B (log transformation: ANOVA F4,46 3.26, p 0.02), although this was driven largely by di!erences between cat-egories G (woodland or forest) and J (industrial, constructed or artificial habitats) (Fig. 4).

its current formulation, GP makes no attempt to include habitat information, but it is still possible to check whether there is an association between the values of B produced by species living in di!erent habitats. We examined "e European Nature Information System (EUNIS) habitat types for all 53 of the species analysed to test for patterns in B values associated with habitat di!erences. "e EUNIS habitat classification is a pan-European system, which was developed between 1996 and 2001 by the European Environment Agency (EEA) in collaboration with experts from throughout Europe, and covers all types of natural and artificial habitats, both aquatic and terrestrial (Davies et al. 2004).

Temporal differences

Invasions can span decades or even centuries. In this area the biologist has an advantage over the criminologist, since data can be divided into di!erent temporal units, allow-ing examination of changes in model parameters as inva-sions progress. Invasions can change over time, and the notion of an invasion delay, followed by an explosion of expansion, is well established (Crooks and Soulé 1999). We selected three species with extensive invasion histories and repeated our analysis over multiple decades. In this way we could determine if the fit of our model alters over the course of an invasion history and could also compare species to determine if they had consistently di!erent B values across time.

Results

Simulations

"e mean hit scores for each combination of conditions used in the simulation are presented in Table 2 and Fig. 1. "e results were analysed using a full factorial ANOVA with model type, number of sources and number of spread points as factors, and all interactions included. "e first test compared GP to spatial mean, spatial median and centre of minimum distance. "e results showed significant di!erences between model type, but not between other factors or interactions (model type: F3,35984 75386.2, p 2e-16; number of sources: F1,35984 3.6, p 0.056; number of spread points: F1,35984 1.4, p 0.23; model type number of sources: F3,35984 0.5, p 0.66; model type number of spread points: F3,35984 0.1, p 0.94; number of sources number of spread points: F1,35984 0.0, p 0.89; model type number of sources number of spread points: F3,35984 2.1, p 0.1). Tukey post-hoc tests showed that GP performed significantly better than all other methods in each case (Table 2a). "e second test compared GP to the kernel density method. Again, the results showed significant di!erences between the mod-els (model type: F1,17992 81216.9, p 2e-16; number of sources: F1,17992 0.1, p 0.75; number of spread points: F1,17992 2.3, p 0.13; model type number of sources: F1,17992 0.2, p 0.67; model type number of spread points: F1,17992 2.7, p 0.19; number of

Page 9: Geographic profiling as a novel spatial tool for targeting ...webspace.qmul.ac.uk/rknell/pdfs/Ecography 2012 Stevenson.pdf · Geographic profiling as a novel spatial tool for targeting

712

Figure 1. Boxplot of simulation results, with GP compared to (a) centre of minimum distance (CMD), spatial mean (SMM) and spatial median (SSM), and (b) a kernel density model. Mean hit score % across all 9000 runs of each method (1000 replicates three levels of source points three levels of spread points) are shown on the y axis. GP performs well across all tests and never took 7% of the search area to find all of the sources. All other methods show much greater range in success. "e kernel density method performs well when number of sources equals 1 but then rapidly begins to decline in e#ciency as the number of sources increase.

not significantly di!erent from 0 in all cases). However, the method detected significant di!erences in fitted values of B between individual species (F2,20 8.5248, p 0.0005474) (Fig. 5).

Discussion

Our study shows that geographic profiling can correctly pre-dict the sources of invasive species, using as input their cur-rent locations. Crucially, it can also do so in the early stages of invasions with data that is possible to acquire quickly, and when control e!orts are most likely to be e!ective. Geographic profiling outperformed other widely used spa-tial statistics such as the centre of minimum distance, spatial mean and spatial median in locating invasion sources. In addition, a simple kernel density approach was found to be less e!ective than GP, with GP’s advantage increasing as the number of sources of invasion increased. Our study also sug-gests that it may be possible to use general, taxon- or habitat-specific values for the model parameters in cases where data on individual species are lacking, since di!erent fitted values of B were obtained in flowering plants and conifers.

"ere are marked similarities between the problems faced by police investigators and invasion managers. Both can face situations with large volumes of data and di#-culty in determining which data are informative and which uninformative. In both cases, resources and time are lim-ited (Williamson and Fitter 1996a). Methods for optimis-ing searches for source locations of invasive organisms could therefore provide a valuable tool in the rapid assessment and control in the critical early stage of invasion history (Leung et al. 2002), just as they have done in criminology (Rossmo 2000).

Temporal differences

For the three species for which we analysed multiple time periods, fitted values of B did not di!er within species across di!erent time steps (linear regressions of slopes were

Table 2. Results of computer simulations comparing GP to (a) centre of minimum distance (CMD), spatial mean (SMM) and spatial median (SSM), and (b) a kernel density model. The results shown are the mean and SD hit score percentage. Asterisks mark cases in which GP sig-nificantly outperformed the other methods. GP performs well across the entire study, never exceeding a mean search efficiency of 4.44% of the simulated area. GP’s advantage over other methods increases as the number of sources increases. Other methods fail to replicate this and become increasingly inaccurate as the number of sources increases.

Mean hit score % (SD)

Number of sources Number of spread points CMD SSM SMM GP (fitted B)

(a) GP versus SSM, SMM and CMD1 10 9.5 (0.06) 14.6 (0.08) 19.2 (0.08) 3.9 (0.05)1 20 9.4 (0.06) 14.4 (0.06) 19.3 (0.08) 4.3 (0.05)1 30 9.2 (0.06) 14.2 (0.07) 19.0 (0.09) 3.3 (0.05)2 10 19.4 (4.13) 24.4 (4.44) 29.2 (4.42) 3.4 (0.03)2 20 22.6 (4.26) 27.7 (4.15) 32.5 (4.35) 3.6 (0.03)2 30 27.1 (4.13) 32.1 (4.15) 36.9 (4.41) 3.5 (0.04)5 10 28.0 (4.26) 38.1 (4.49) 42.9 (4.55) 2.7 (0.02)5 20 35.9 (4.10) 32.6 (4.35) 47.4 (4.4) 3.1 (0.02)5 30 40.3 (4.20) 50.9 (4.20) 52.4 (4.52) 2.3 (0.02)

(b) GP versus kernel density model Kernel (fixed h) GP (fixed B)1 10 4.9 (0.06) 4.4 (0.05)1 20 4.9 (0.05) 4.3 (0.05)1 30 4.7 (0.07) 4.2 (0.05)2 10 5.2 (3.06) 3.5 (0.05)2 20 5.5 (3.07) 3.3 (0.05)2 30 6.8 (3.10) 3.3 (0.05)5 10 8.7 (3.06) 2.7 (0.05)5 20 7.7 (3.07) 2.7 (0.05)5 30 13.5 (3.12) 2.6 (0.05)

Page 10: Geographic profiling as a novel spatial tool for targeting ...webspace.qmul.ac.uk/rknell/pdfs/Ecography 2012 Stevenson.pdf · Geographic profiling as a novel spatial tool for targeting

713

Many invasive organisms show distinctive time lag between initial invasion, establishment and subsequent expansion (Crooks and Soulé 1999). Often repeated inva-sions will occur by the same pathway and only after a period of time will one of these invasions begin to expand beyond the initial source of invasion (Drake and Lodge 2004). GP operates at this critical early stage of the invasion pro-cess, locating the source locations of rapidly expanding populations allowing e!ective allocation of limited resources available for control measures (Puth and Post 2005, Keller et al. 2008).

Following the work of numerous scientific workers such as weed scientists, resource managers, conservation biolo-gists, restoration biologists, field ecologists and economists,

Figure 3. Boxplot showing fitted values of B for English Nature category listings 76 (conifers and gingko) and 77 (flowering plants).

Figure 2. Search strategies for Heracleum mantegazzianum, based on (a) geographic profiling; (b) centre of minimum distance, and (c) kernel density model. Black circles show the locations in 1941–1950, and white circles the locations in 1931–1940. Contours are shown in bands of 5%. Geoprofiling successfully locates 35 of 51 source populations after searching just 10% of the target area, with a mean hit score for all sources of 13%. Other approaches do not perform as well. "e centre of minimum distance (CMD), spatial mean (S mean) and spatial median (S median) are also shown in blue, black and red respectively.

a clear understanding has emerged of the stages that make up invasions and the relevant modelling and available man-agement options to prevent and manage invasive species (Williamson and Fitter 1996b, Sakai et al. 2001). Sakai et al. (2001) present an example invasion framework (modified from Lodge (1993)) that highlights the general steps in the invasion process and their relationship to management steps that can be taken. "e movement, establishment and subse-quent spread of invasive species are best characterized by a series of discrete steps, each of which poses di!erent problems to both manager and modeller (Vitousek 1997). Some of these stages are more relevant to prevention; others are more relevant for issues of control and restoration. "ere has been increasing understanding that feedback may occur between many of these steps (Sakai et al. 2001). Within each of the phases, di!erent types of predictive and analytical models can be used to gather information and make assessments of the risks presented by invasive species. "e results reported in this study suggest that geographic profiling could form part of an integrated strategy and allow more precise target-ing of source populations and thus more e!ective allocation of resources. "is will be most useful in the establishment and lag phase, but may still be useful in populations that have started to expand rapidly, since the model may be used to di!erentiate between existing populations that are acting as sources and those that are not, again improving the e#-ciency of intervention.

GP models present a new approach based on two simple spatial concepts, distance decay and the bu!er zone. Distance decay has obvious relevance to the spread of invasive species, since movement involves costs, either in terms of energy expenditure or exposure to risk of predation, both of which limit dispersal distance. "e bu!er zone applies even with-out active avoidance of nearby sites since, if suitable habitats are randomly distributed, the number of sites increases geo-metrically with distance from the source. In fact, there may

Page 11: Geographic profiling as a novel spatial tool for targeting ...webspace.qmul.ac.uk/rknell/pdfs/Ecography 2012 Stevenson.pdf · Geographic profiling as a novel spatial tool for targeting

714

Figure 4. Boxplot showing (a) fitted values of B for EUNIS habitat categories A (marine habitats), C (inland surface waters), G (woodland, forest and other wooded land), I (regularly or recently cultivated agricultural, horticultural and domestic habitats) and J (constructed, industrial and other artificial habitats); (b) Tukey 95% post-hoc comparisons for each of these categories, showing that only categories J and G are significantly di!erent.

Second, the mathematics of GP is under continual development and di!erent functions have been explored (Canter et al. 2000, Levine 2009, O’Leary 2009). O’Leary (2009) proposes a movement into a Bayesian framework, and future work will compare di!erent modelling approaches in terms of model fit, ease of use and practical application, as well as continuing to assess its use in biological systems.

We suggest that invasive species managers and conser-vation biologists should strongly consider the use of GP,

be cases where bu!er zones arise from aspects of a species’ biology. In some species, o!spring avoid occupying the same area as their parents due to the competition for similar niche space that will inevitable result (sometimes only in the adult phase); for example, allelopathy in British trees (Singh et al. 2001). "ere is also evidence for bu!er zones in bee species (Dramstad 1996, Saville et al. 1997).

GP models operate in a similar fashion to gravity and kernel density methods, but have the distinct advantage of performing well with multiple sources of invasion, as well as being possible to run quickly and easily on a small number of data points; crucially, GP models run using only presence/absence data, without the need to estimate parameters for mechanisms such as outflows and inflows between sources and destinations, risk ranks etc. Rossmo (2000) has shown using Monte Carlo simulations that GP is capable of pro-ducing reliable profiles with as few as five data points.

"e results presented here demonstrate the feasibility of the method, but there is also significant development potential. In two key areas there is immediate improvement that can be made in the model. First, GP can be easily placed within existing modelling frameworks in invasion biology. GP models can work alongside population growth models and be informed by trait-based risk analysis (Leung et al. 2002). One exciting area for future research would be to integrate GP with niche-based modeling (Peterson 2003, "uiller et al. 2005). GP models are based on data that describe where species are now and how they have histori-cally spread, a spatial relationship that does not include any information on preferable habitat types. Because niche-based models do not include any spatial data but are based upon habitat preferences (Leung et al. 2004, "uiller et al. 2005), these two modeling types could be combined to produce a risk map that incorporates where species are, how they spread and how likely they are to settle in certain areas.

Figure 5. Time series plots for Heracleum mantegazzianum, Larix decidua and Acheta domesticus. In none of these three cases did fitted values of B di!er within species across di!erent decades (linear regressions were non-significant in all cases). However, the method detected significant di!erences in fitted values of B between individual species (F2,20 8.5248, p 0.0005474).

Page 12: Geographic profiling as a novel spatial tool for targeting ...webspace.qmul.ac.uk/rknell/pdfs/Ecography 2012 Stevenson.pdf · Geographic profiling as a novel spatial tool for targeting

715

Leung, B. et al. 2004. Predicting invasions: propagule pressure and the gravity of Allee e!ects. – Ecology 85: 1651–1660.

Levine, N. 2009. CrimeStat: a spatial statistics program for the analysis of crime incident locations. – Ned Levine and Associates, Annandale, VA and the National Inst. of Justice, Washington, DC, www.icpsr.umich.edu/crimestat .

Lodge, D. M. 1993. Biological invasions: lessons for ecology. – Trends Ecol. Evol. 8: 133–137.

MacIsaac, H. J. et al. 2004. Backcasting and forecasting biological invasions of inland lakes. – Ecol. Appl. 14: 773–783.

Martin, R. A. et al. 2009. Hunting patterns and geographic profiling of white shark predation. – J. Zool. 279: 111–118.

Mooney, H. A. and Drake, J. A. (eds) 1986. Ecology of biological invasions of North America and Hawaii. – Springer.

Nocedal, J. and Wright, S. J. 1999. Numerical optimization. – Springer.

O’Leary, M. 2009. "e mathematics of geographic profiling. – J. Investig. Psych. O!ender Profil. 6: 253–265.

Peterson, A. T. 2003. Predicting the geography of species invasions via ecological niche modelling. – Q. Rev. Biol. 78: 419–433.

Pimentel, D. S. et al. 2000. Economic and environmental threats of alien plant, animal, and microbe invasions. – Agric. Ecosyst. Environ. 84: 1–20.

Puth, L. M. and Post, D. M. 2005. Studying invasion: have we missed the boat? – Ecol. Lett. 8: 715–721.

Raine, N. E. et al. 2009. Geographic profiling applied to testing models of bumble-bee foraging. – J. R. Soc. Interface 6: 307–319.

Ricciardi, A. and Cohen, J. 2007. "e invasiveness of an intro-duced species does not predict its impact. – Biol. Invasions 9: 309–315.

Rossmo, D. K. 1993. A methodological model. – Am. J. Crimin. Just. 172: 1–21.

Rossmo, D. K. 2000. Geographic profiling. – CRC Press.Sakai, A. K. et al. 2001. "e population biology of invasive species.

– Annu. Rev. Ecol. Syst. 32: 305–332.Saville, N. M. et al. 1997. Bumblebee movement in a frag-

mented agricultural landscape. – Agric. Ecosyst. Environ. 61: 145–154.

Singh, H. P. et al. 2001. Allelopathy in agroecosystems. – J. Crop Prod. 4: 1–41.

Stevenson, M. D. 2011. gp: geographic profiling. – R package ver. 1.0.

Strayer, D. L. et al. 2006. Understanding the long-term e!ects of species invasions. – Trends Ecol. Evol. 21: 645–651.

"uiller, W. et al. 2005. Niche-based modelling as a tool for predicting the risk of alien plant invasions at a global scale. – Global Change Biol. 11: 2234–2250.

Vitousek, P. M. et al. 1996. Biological invasions as global environ-mental change. – Am. Sci. 84: 468–478.

Vitousek, P. M. et al. 1997. Human domination of earth’s ecosys-tems. – Science 277: 494.

Weiszfeld, E. 1936. Sur le point pour lequel la somme des distances de n points donnes est minimum. – Tôhoku Math. J. 43: 355–386.

Wilcove, D. S. et al. 1998. Quantifying threats to imperilled species in the United States. – Bioscience 48: 607–615.

Williamson, M. and Fitter, A. 1996a. "e varying success of invaders. – Ecology 77: 1661–1666.

Williamson, M. and Fitter, A. 1996b. "e characters of successful invaders. – Biol. Conserv. 78: 163–170.

Worton, B. J. 1989. Kernel methods for estimating the utilization distribution in home range studies. – Ecology 70: 164: 168.

especially when it is likely that there are multiple source populations of the species in question. Although GP and kernel models perform equally well when there is only a single source, our computer simulations show that, as the number of sources increases, simple kernel density models rapidly become less e!ective (for example, with five sources and 30 data points (a more than reasonable biological case), kernel models searched 13% of the area before finding all the sources, while GP searched on average 2.6% of the area before finding all sources). Managers are cautioned that using kernel models in the case of multiple sources could lead to searching/targeting significantly more of the target area than is necessary, involving a corresponding increase in e!ort. In real-world examples, where resources are likely to be limiting, geographic profiling o!ers an increase in search e#ciency over other methods – such as spatial mean, spa-tial median, centre of minimum distance and simple kernel density models – that is likely to lead to improved targeting of interventions, and more e#cient use of scarce resources.

Acknowledgements – We thank Annabel Dennis for help in assembling data for analysis.

References

Canter, D. et al. 2000. Predicting serial killers’ home base using a decision support system. – J. Quant. Criminol. 16: 457–478.

Crooks, J. and Soulé, M. E. 1999. Lag times in population explosions of invasive species: causes and implications. – In: Sandlund, O. T. et al. (eds), Invasive species and biodiversity management. Kluwer, pp. 103–125.

Davies, C. E. et al. 2004. EUNIS habitat classification revised. – European Environment Agency report, http://eunis.eea.europa.eu/upload/EUNIS_2004_report.pdf .

Doney, R. H. 1990. "e aftermath of the Yorkshire Ripper: the response of the United Kingdom police service. – In: Egger, S. (ed.), Serial murder: an elusive phenomenon. Praeger Publishing, pp. 95–112.

Drake, J. M. and Lodge, D. M. 2004. E!ects of environmental variation on extinction and establishment. – Ecol. Lett. 7: 26–30.

Dramstad, W. 1996. Do bumblebees (Hymenoptera: Apidae) really forage close to their nests? – J. Insect Behav. 9: 163–182.

Hill, M. et al. 2005. Audit of non-native species in England. – English Nature Report, www.english-nature.org.uk .

Hulme, P. E. 2006. Beyond control: wider implications for the management of biological invasions. – J. Appl. Ecol. 43: 835–847.

Keller, R. P. et al. 2008. Preventing the spread of invasive species: economic benefits of intervention guided by ecological predic-tions. – Conserv. Biol. 22: 80–88.

Kuhn, H. W. and Kuenne, R. E. 1962. An e#cient algorithm for the numerical solution of the generalized Weber problem in spatial economics. – J. Regional Sci. 4: 21–33.

Le Comber, S. C. et al. 2006. Geographic profiling and animal foraging. – J. "eor. Biol. 240: 233–240.

Le Comber, S. C. et al. 2011. Geographic profiling as a novel spatial tool for targeting infectious disease control. – Int. J. Health Geogr. 10: 35.

Leung, B. et al. 2002. An ounce of prevention or a pound of cure: bioeconomic risk analysis of invasive species. – Proc. R. Soc. B 269: 2407–2413.