spatial data modeling with gis and sas - long school of...
TRANSCRIPT
![Page 1: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/1.jpg)
Spatial Data Modeling with GIS and SAS
M. Norman Oliver, M.D., M.A.Associate Professor,
Departments of Family Medicine,Public Health Sciences, and Anthropology;
University of Virginia Health System
![Page 2: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/2.jpg)
Acknowledgements
• This project was supported by grant K07 CA099983 from the National Cancer Institute, HRSA CFDA No. 93.984, Academic Units in Primary Care-Family Medicine, and a grant from the Paul Mellon Prostate Cancer Research Institute.
![Page 3: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/3.jpg)
Kevin A. MatthewsUniversity of Virginia Department
of Family Medicine;
Mir Siadaty, M.S., M.D.,University of Virginia Department
of Health Evaluation Sciences
Fern R. Hauck, M.D.,University of Virginia Department
of Family Medicine
Linda W. Pickle, Ph.D.,Surveillance Research Program,
Division of Cancer Control and Population Sciences, NCI
Special thanks to Dr. Pickle: All the good ideas (and some of the slides) in this presentation come from courses she has taught and discussions we’ve had. All bad ideas and mistakes are my own invention.
![Page 4: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/4.jpg)
![Page 5: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/5.jpg)
![Page 6: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/6.jpg)
Why map health data? Place matters
• Disease risk and exposure vary by location • Public wants to know whether their
community has a problem• Researchers want to know where to conduct
further studies (high or low rate areas)• Public health officials want to know where
to start intervention or prevention programs and how to allocate scarce resources
![Page 7: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/7.jpg)
Goals of mapping analysis
• Identify geographical patterns• Cluster detection • Hypothesis generation (causes of patterns?)• Comparison of patterns (e.g., health disparities)• Changing patterns over time (surveillance, cancer
control)
Different goals will lead to different analytic methods and map designs
![Page 8: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/8.jpg)
Oral Cancer in White Females
Source: Mason et al. Atlas of Cancer Mortality for U.S. Counties, 1950-1969. NIH 1975
![Page 9: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/9.jpg)
Lung Cancer in White Males
Source: Mason et al. Atlas of Cancer Mortality for U.S. Counties, 1950-1969. NIH 1975
![Page 10: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/10.jpg)
Cervical Cancer in White Females, 1950-941950-54 1955-59 1960-64
1965-69 1970-74 1975-79
1980-84 1985-89 1990-94
![Page 11: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/11.jpg)
GIS & Disparities in Prostate Cancer•Second leading cause of cancer death among men
•Unequal burden: African-Americans have both prostate cancer incidence and mortality rates twice that of whites; however, geographic patterns for both
•Reasons for disparities unclear. Diet? Environmental exposures? Access to health care? Genetic susceptibility?
![Page 12: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/12.jpg)
![Page 13: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/13.jpg)
![Page 14: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/14.jpg)
The Whirling Vortex of GIA
The question you want to
answer
The data youneed to answerthat question
The questionyou can
answer withthose data
The data youcan get
GIS
Source: Waller & Gotway, 2004Original source: John Richardson, Toxicologist EPA Region IV
![Page 15: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/15.jpg)
What Statistic to Map?
• Mapping counts ignores population differences, so mapping rates preferred
• Proportions useful for showing case characteristics, e.g., % with late stage diagnosis
• Relative risks (or relative rates) are easily interpretable, put all maps on same scale
• Observed, estimated or smoothed statistic?• Map residuals as part of modeling process
![Page 16: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/16.jpg)
Data description
• VCR: Incident prostate cancer cases 1990-9 (37,000 cases)• Geocoded 74% (26,000 cases)• Cases aggregated to census tract and county levels for
comparison to sociodemographic variables in the 1990 U.S. Census
• Census variables: low education (<12yrs), high education (≥16yrs), median household income, percent female head of household, %rural – all stratified by racial category
• MD ratio (ARF), %males screened with PSA (BRFSS)
![Page 17: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/17.jpg)
Rate standardization• What rates to map?
– Crude rates ignore population difference effects on rates– Age-specific rates usually too variable due to small #s– Age-adjusted rates
• Only meaningful in comparison to similarly-adjusted rates• Direct method: Σ(age-specific rate)ij(std pop)j/(total std pop)
where sum is over all ages j, for place i• Indirect method:
SMRi = Σ(age-specific rate in std pop)j(pop)ij/(std rate)advantage: can be calculated when # cases smalldisadvantage: SMRs not comparable place to place except under certain conditions
(Pickle and White, Stat in Med, 1995)
![Page 18: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/18.jpg)
Exploratory Spatial Data Analysis
• First look at the data• Quality control check -- unexpected
patterns?• Hypothesis generation – associations
between predictors and outcomes?• Quick & dirty method for answering spatial
questions• A way to check model
![Page 19: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/19.jpg)
![Page 20: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/20.jpg)
Smoothing Concepts
• Smoothing is a method of removing some variability in a quantitative map, highlighting the underlying (true?) pattern
• Maps of cancer rates for small areas can be difficult to interpret because of background “noise
• Different types of smoothers: linear or non-linear, based on distance or nearness of neighbors, but all are variations of 2D moving averages
• Comparison of various 2D smoothers: Kafadar1994 (Comp Stat & Data Analysis 18:419-439)
![Page 21: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/21.jpg)
Smoothing Health Data
• The most important feature of a 2D smoother for health data is its inclusion of weights
• The variance of both rates & counts is proportional to 1/population, so rates are more or less reliable depending on population size
• Most smoothers in the past did not allow weights, several new ones do– very stable rates are smoothed less– more unstable rates (due to small populations) are
smoothed more
![Page 22: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/22.jpg)
Headbanging
• Proposed by Tukey & Tukey, 1981, implemented by Hansen Simonson in 1991
• Weights added by Mungiole, Pickle, Simonson in 1999
• Algorithm is median-based moving average of a specified # of nearest neighbors for each point
• Available free on http://srab.cancer.gov/headbangas standalone or callable C++ function
![Page 23: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/23.jpg)
HIV Mortality Rates, 1988-92Original data:
Rate/100,0007.824 - 172.2345.002 - 7.8243.57 - 5.0021.991 - 3.570 - 1.991
Rate/100,0006.4 - 48.94.6 - 6.43.7 - 4.62.9 - 3.70.8 - 2.9
Smoothed, unweighted
Rate/100,0008.2 - 97.15.7 - 8.24.7 - 5.73.4 - 4.70.6 - 3.4
Smoothed, weighted
Source: Pickle, et al. Atlas of United States Mortality. NCHS, 1996
![Page 24: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/24.jpg)
![Page 25: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/25.jpg)
![Page 26: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/26.jpg)
![Page 27: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/27.jpg)
![Page 28: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/28.jpg)
![Page 29: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/29.jpg)
![Page 30: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/30.jpg)
![Page 31: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/31.jpg)
![Page 32: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/32.jpg)
Exploratory Data Analysis0
2040
6080
![Page 33: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/33.jpg)
![Page 34: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/34.jpg)
Spatial Data Modeling
• ESDA maps may show patterns; whether they are real or random noise is unknown
• Only inferential statistics can answer that question
![Page 35: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/35.jpg)
Spatial Data Modeling StepsData collection& preparation
Model identification
Modelassessment
Parameter Estimation
& model fitting
OK?Interpretation
of results& fit
DoneYes
No
![Page 36: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/36.jpg)
Unit of Analysis• What is your fundamental unit of analysis? Tumor or individual case
or aggregated case counts?• How to choose
– What is appropriate for hypothesis to be tested?– Availability of data – sometimes no choice, e.g., SEER records
have little information on each patient, so we need to use sociodemographic information for aggregated units
• Aggregated analysis = “ecologic”– Advantage: often the only available data for analysis, especially for
data considered confidential– Disadvantage: “ecologic fallacy” = bias of results because of
heterogeneity within the groups, i.e., associations at group level may not hold for individuals
![Page 37: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/37.jpg)
Spatial Point Processes• Statistical models useful for spatial cancer data
– Poisson for case counts, e.g., incidence or mortality– Bernoulli or logistic binomial for individual cases,
proportions (e.g., % late stage), prevalence• Count data are typically overdispersed
“pure” Poisson:di ~ Pois(niλ) where di = # cases, ni = population
at risk in place, λ = intensityE(di) = Var(di) = niλ (i.e., mean = variance)
However, we usually observe Var(di) = φniλ, φ>1so we need to account for this in the model
![Page 38: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/38.jpg)
Accounting for Overdispersion• Overdispersion can arise from clustering of the population, e.g., in
families, neighborhoods • Estimating overdispersion
– Directly by modeling variance as φ times usual variance– Add random effects to the model (extra variance components)
• Possible random effects for a simple model: yi = α + βXi + εi
• For each random effect, need to specify its distribution, e.g., βi ~ N(β*, γ)
VarianceComponents:
Randomcoefficients
Errors incovariates
Usual residualvariance
![Page 39: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/39.jpg)
Random Effects, Tract Within County
X
YCounty mean*
* **
**
*
Fixed effects only
X
YCounty mean
Tract 1
Tract 2
Tract 3
Random intercepts only
X
YCounty mean
Tract 1Tract 2
Tract 3
Random slope onlyX
YCounty mean
Tract 1Tract 2
Tract 3
Random slope & intercept
![Page 40: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/40.jpg)
Spatial Statistical Models Differ
• Spatial data are usually spatially correlated, not independent (as assumed for simple regression)
• Tobler’s First Law of Geography: “Everything is related to everything else, but near things are more related than distant things.” (Tobler 1969)
![Page 41: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/41.jpg)
Modeling Spatial Correlation• Consider simple fixed effects regression model:
yi = α + βXi + εi , εi ~N(0,σ2)• This assumes that errors εi are independent, all with
common variance σ2
• If errors are spatial correlated, then εi ~N(0,Σ) where Σ is a variance-covariance matrix that describes their spatial dependencies
• Σ can be defined in terms of distances or neighbors• One common form of Σ = σ2 σ2[exp(-distance/ρ)]
same σ2
where ρ is measure of correlation based on distance.
![Page 42: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/42.jpg)
More Assumptions . . .
• Statistical tests are based on assumptions.• Here, we assume a Poisson distribution of our
data. We also assume this spatial process shows• Stationarity: the spatial dependence structure
(covariance) is the same everywhere, i.e., the covariance depends only on distance or adjacency of any 2 places
• Isotropy: the spatial autocorrelation does not depend on direction
![Page 43: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/43.jpg)
Variograms and Semivariograms
• The variogram is the variance of the difference of two data values separated by a distance h in a particular direction: var(Z(s+h)-Z(s)).
• The semivariogram is ½ the variogram.
• Both are functions of the separation distance, h, also called the spatial lag.
• The shape of these functions reflects the spatial similarity observed in the data.
Source: Waller and Gotway, 2004
![Page 44: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/44.jpg)
How Do Semivariograms Help?
If A and B are positively correlated, Var(A-B) will be smaller than if they were independent.
If this variance (and correlation) changes with distance between the points, then there is spatial autocorrelation
![Page 45: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/45.jpg)
Semiovariogram
distance
sillγ
nugget
rangeObs. within this distance of each other are spatially autocorrelated
(Variance/2 = γ)
Source: Waller & Gotway, 2004
![Page 46: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/46.jpg)
Computation
• Software available for random & spatial errors– SAS PROC GLIMMIX– S+/R non-linear mixed effects (nlme) function– WinBUGS, MLwiN – full Bayesian MCMC
modeling.. • Iterative estimation can require many hours to run,
sometimes won’t converge at all • Complex models (with spatial covariance, random
effects, interactions,...) require larger sample sizes than fixed effects regression models
![Page 47: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/47.jpg)
VCR Study Results 1
Table 1. Results of statistical modeling of V irginia prostate cancer incidence at tract level.
A frican A m ericans W hites
V ariables Effect p-value Effect p-value
Age (years)
<50 -6.7734 <0.0001 -6.5974 <0.0001
50-74 -0.5407 0.0086 -0.6459 <0.0001
>75a - - - -
aReference category. bBetween 100% and 200% poverty.cAt least four years of college. dLess than a high-school education.
![Page 48: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/48.jpg)
VCR Study Results 2Table 1 contd. Results of statistical modeling of Virginia prostate cancer incidence at tract level.
African Americans Whites
Variables Effect p-value Effect p-value
aReference category. bBetween 100% and 200% poverty.cAt least four years of college. dLess than a high-school education.
Poverty
<10% -0.1495 0.6984 0.4414 0.0114
10-19% -0.3818 0.1837 0.3963 0.0122
>20%a - - - -
% Near poorb 0.0082 0.2583 -0.0083 0.0218
Median household income 0.4987 0.0187 0.1850 0.0027
Education ( persons aged >25 years)
% Highc -0.0398 0.0926 -0.0024 0.7360
% Lowd -0.0012 0.8283 -0.0077 0.0028
![Page 49: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/49.jpg)
VCR Study Results 3Table 1 contd. Results of statistical modeling of Virginia prostate cancer incidence at tract level.
African Americans Whites
Variables Effect p-value Effect p-value
Rural
<10% 1.3537 0.0012 1.6311 <0.0001
10-19% 0.0430 0.9733 -0.6474 0.2379
>20%a - - - -
% Female head of household
0.0109 0.0653 0.0217 <0.0001
% PSA screened -0.0008 0.7935 0.0006 0.5600
MD ratio 0.00001 0.9518 0.0002 0.0010
aReference category. bBetween 100% and 200% poverty.cAt least four years of college. dLess than a high-school education.
![Page 50: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/50.jpg)
VCR Study Results 4Table 2. Results of statistical modeling of Virginia prostate cancer incidence at county level.
African Americans
Whites
Variables Effect p-value
Effect p-value
Age (years)
<50 -6.6549 <0.0001 -6.6260 <0.0001
50-74 -0.4954 0.0017 -0.7335 <0.0001
>75a - - - -
![Page 51: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/51.jpg)
VCR Study Results 5Table 2 contd. Results of statistical modeling of Virginia prostate cancer incidence at county level.
African Americans
Whites
Variables Effect p-value
Effect p-value
Poverty
<10% -0.1815 0.6058 -0.1267 0.527010-19% -0.2231 0.2026 -0.0066 0.9618>20%a - - - -% Near poorb -0.0145 0.2124 -0.0024 0.7769
Median household income -0.1775 0.4188 -0.1714 0.2268Education ( persons
aged >25 years)
% Highc 0.0062 0.7340 0.0159 0.1788% Lowd -0.0067 0.4122 -0.0025 0.6457
![Page 52: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/52.jpg)
VCR Study Results 6Table 2 contd. Results of statistical modeling of Virginia prostate cancer incidence at county level.
African Americans
Whites
Variables Effect p-value
Effect p-value
Rural
<10% -0.8064 0.1184 -0.2178 0.4675
10-19% -0.3974 0.4125 -0.3433 0.1965
>20%a - - - -
% Female head of household
0.0029 0.8586 -0.0045 0.6396
% PSA screened 0.0023 0.1571 0.0011 0.2205
MD ratio 0.0000 0.9533 0.0000 0.0709
aReference category. bBetween 100% and 200% poverty.cAt least four years of college. dLess than a high-school education.
![Page 53: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/53.jpg)
Conclusions• Socioeconomic position affects incidence of prostate
cancer• Differential effect among African Americans points to
possible racial discrimination – lack of access, biased delivery of clinical services
• Increased incidence among African Americans not due to socioeconomic measures evaluated in this study
• Oliver MN, Smith E, Siadaty M, Hauck FR, Pickle LW. A spatial analysis of prostate cancer incidence and race in Virginia, 1990-1999. Am J Prev Med. January 2006.
![Page 54: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/54.jpg)
Limitations
• VCR not a gold-standard cancer registry; problems with data collection
• Only used 74% of VCR data, with large fall out of rural dataOliver MN, Matthew KA, Siadaty M, Hauck FR, Pickle LW. Geographic bias related to geocoding in epidemiologic studies. Intl J Health Geogr. 2005;4:29.
• Incidence rates calculated using average 1990 and 2000 populations
• Spatial and temporal lag: a general challenge for GIScience in public health
![Page 55: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/55.jpg)
Spatial Data Modeling Steps -- LessonsData collection& preparation
Model identification
Modelassessment
Parameter Estimation
& model fitting
OK?Interpretation
of results& fit
DoneYes
No
![Page 56: Spatial Data Modeling with GIS and SAS - Long School of ...familymed.uthscsa.edu/research08/pcrmsc/20th_2005/presentations/16... · Spatial Data Modeling with GIS and SAS M. Norman](https://reader034.vdocuments.net/reader034/viewer/2022052607/5a72c3787f8b9aac538ddc78/html5/thumbnails/56.jpg)
Spatial Data Modeling with GIS and SAS
Thank you, and good luck!