umi - university of toronto t-space · 2020. 4. 7. · acknowledgements the ultimate completion of...

143
NOTE TO USERS This reproduction is the best copy available. UMI

Upload: others

Post on 05-Feb-2021

5 views

Category:

Documents


0 download

TRANSCRIPT

  • NOTE TO USERS

    This reproduction is the best copy available.

    UMI

  • BAYESIAN HIERARCHICAL MODELS FOR MAPPING LUNG CANCER MORTALITY IN ONTARIO

    Marc-Erick Thériault

    A thesis submitted in conformity with the requirements for the degree of Master of Science

    Graduate Department of Comrnunity Healt h University of Toronto

    Copyright @ 2000 by Marc-Erick Thériault

  • National Library I * m of Canada Bibliothèque nationale du Canada Acquisitions and Acquisitions et Bibliographie Services services bibliographiques

    395 Wellington Street 395. rue Wellington Ottawa ON K1A O N 4 Ottawa ON K I A ON4 Canada Canada

    The author has granted a non- L'auteur a accordé une licence non exclusive licence allowing the exclusive permettant à la National Libraxy of Canada to Bibliothèque nationale du Canada de reproduce, loan, distribute or sel1 reproduire, prêter, distribuer ou copies of this thesis in microform, vendre des copies de cette thèse sous paper or electronic formats. la forme de microfichelfilm, de

    reproduction sur papier ou sur fonnat électronique.

    The author retains ownership of the L'auteur conserve la propriété du copyright in this thesis. Neither the droit d'auteur qui protège cette thèse. thesis nor substantial extracts fkom it Ni la thèse ni des extraits substantiels may be printed or otherwise de celle-ci ne doivent être imprimés reproduced without the author's ou autrement reproduits sans son permission. autorisation.

  • Abstract

    B-WESIAX HIERARCHIC,AL MODELS FOR ILIAPPIXG LUNG C-WCER

    MORTALITY IN ONT-4FUO

    Marc-Erick Thériault

    Master of Science

    Graduate Department of Community Health

    University of Toronto

    3000

    Mapping rare disease incidence or mortality using maximum Likelihood estimates may

    yield misleading results when regions have a low number of events or small sample size.

    Sloreover. problems in mapping may arise wlie~i the number of events in an area is large

    when frequentist pvalues are mapped. This thesis look at how Bayesian randoni effccts

    estimates and postèrior probabilities might be used co offset the above problems tvliich

    tend to visually dominate a map. Within the Bayesian framework. a spatially structured

    mode1 is considered which allows prior information of spatially similar risks to be used.

    The techniques are illustrated using lung cancer mortality data for Ontario. A simulation

    study. based on the actual data, ?vas used to examine the effects of both population

    size and fised relative risks on mean square error (including b i s ) and the ability to

    distinguish high and low risk areas. Results show that mean square error is reduced in

    Bayesian estimates at the cost of some bias. Maps of Bayesian posterior estimates and

    Bayesian probabilities offer more consistency than their traditional counterpart.

  • Dedication

    In memory of my Grandparents

  • Acknowledgements

    The ultimate completion of this thesis depended on many different people. Even though

    onIy a few are mentioned here, others were integral.

    I'd first like to thank my supervisor Dr. Michael Escobar for his guidance, experience

    and insight. I'm most thankful for his enthusiasm in taking on a topic that 1 really

    wanted to pursue.

    Many thanks go to a great group of statisticians that made up the cornmittee: Dr.

    Peter Austin, Dr. George Tomlinson and Dr. James Stafford. They provided thorough

    reviews and much input throughout the entire defense process.

    Without the support of Dr. Paul Corey, who initiated the part-time program in

    1995, this thesis would not have even gotten off the ground. 'Cheers' to a great Prograni

    Direct or!

    I'd also like to thank Dr. Toni Basinski. who got nie interested in the topic. Dr. Jack

    Williams and Dr. David Naylor. for their support and use of the data. These Scientists

    were essential to in? entry and continued success in the program.

    Thanks to BH. LJ? DR, RF. TA, CS. RS, PO. HH. TKD and CT for al1 their help

    and support. during the ups and dotvns. ovec the years.

    Finallu, I'd like to thank my parents for their constant encouragement and uncondi-

    tionsl support. Thanks to m. brothers and sister rvlio were always there for me. This

    would not have been possible ttithout rny family.

  • Contents

    1 Introduction

    2 Background. Notation and Models 3

    2.1 Definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

    . . . . . . . . . . 2.1.1 Frequentist Ma~i rnu rn Likelihood Mode1 (;LILE) 4

    . . . . . . . . . . . . . . . . . . . . . . . . 2.2 Hierarchical Bayesian models d a

    '2.2.1 Gibbs Sampling . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

    2.3 Random Effects Models . . . . . . . . . . . . . . . . . . . . . . . . . . . S

    . . . . . . . . . . . . . . . . . 2.3.1 Spatially Unstructured Mode1 (RE) 9

    2.3.2 Spatially Structured Xlodel (CAR) . . . . . . . . . . . . . . . . . 10

    . . . . . . . . . . . . . . . . . . . . . . . . . 2.4 Estirnates and Probabilities 12

    3 Preliminary Analysis 14

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.1 1lLE anaiysis 14

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.2 RE analysis 20

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.3 CAR analysis 23

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.4 Final cornparison 32

    4 Methods 36

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4-1 SimuiationStructure 36

    . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 Cornparison of Estimates 39

  • . . . . . . . . . . . . . . . . . . . 4.3 Cornparison of Map Probability values 40

    5 Simulation Results 41

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.1 Estimates 41

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.1.1 M E 42

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.1.2 Bias 46

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.1.3 Iterationmaps 50

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.2 Map Probability values 62

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.2.1 ROC curves 62

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.2.2 Iteration maps 66

    6 Discussion 76

    7 Conclusions 80

    Appendices

    A Reference map

    B ANCOVA output 88

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . B.l >ISE output 85

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . B.2 Bias output 91

    . . . . . . . . . . . . . . . . . . . . . . . . . . . B.3 Area under ROC output 95

    C Results for Estimates Analysis 98

    . . . . . . . . . . . . . . . . . . . C.1 Tabular Results for Estimates Analysis 98

    . . . . . . . . . . . . . . . . . . C.2 Graphical Results for Estimates Analysis 104

    D Results for MP-value Analysis 110

    . . . . . . . . . . . . . . . . . D.1 Graphiral Results for ROC curve Analysis 110

  • E Cornputer programs 116

    E.1 BUGS prograrns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116

    E.L.1 REmodel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116

    E.1.2 CAR mode1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . I l i

    E.2 SAS Programs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117

    . . . . . . E.2.1 Simulate re1atii.e ïisk aad nin BUGS for Gibbs szmpler 117

    . . . . . . . E.2.2 Calculate relative risk estirnates from Gibbs samples 125

  • List of Tables

    Lung Cancer Mortality, Ontario 1996, Maximum likelihood estirnates of

    relative risk (MLE) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

    Surnrnary statistics for relative risk of Lung Cancer Mortality, Ontario

    1996, Bayesian spatially unstructured model (RE) . . . . . . . . . . . . .

    Sumrnary statistics for relative risk of Lung Cancer Mortality, Ontario

    1996, Bayesian spatially structured model (CAR) . . . . . . . . . . . . .

    SIean and variance estimates for models used in prciirninary nnalysis . .

    ..KCOVA results for USE . . . . . . . . . . . . . . . . . . . . . . . . . .

    ANCOV.4 results for Bias . . . . . . . . . . . . . . . . . . . . . . . . . .

    Frequency (and percentage) of model risk estimates assigned to 'High'

    areas (ttl=49) and correct assignment to preassigned 'High' areas (ttl=9)

    Ai'COVA results for area under the curvc . . . . . . . . . . . . . . . . .

    Frequency (and percentage) of model MP-values assigned to 'High- areas

    (ttl=49) and correct assignment to preassigned 'High' areas (tt l=9) . . .

    Percentiles for model estimates . . - . . . . . . . - . . . . . . . . . . . .

  • List of Figures

    Lung Cancer Mortality. Ontario 1996. Maximum likelihood estimates of

    relative risk . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

    Lung Cancer Mortality. Ontario 1996. P-values based on two-tailed test

    under Poisson assumption do. . = 1 . . . . . . . . . . . . . . . . . . . . . . 17 Boxplots and Lineplots comparing MLE vs . Bayesian " fixed" (Ieft) and

    frequentist random effects vs . Bayesian RE (right) . . . . . . . . . . . . . 21

    Histogram of l 0 ~ ( 0 ; \ ' ~ ~ ) . . . . . . . . . . . . . . . . . . . . . 22

    Bayesian relative risk estimates of Lung Cancer llortalit y. Ontario 1996.

    Spatially unstructured mode1 (RE) . . . . . . . . . . . . . . . . . . . . . 24

    Bayesian 3IP-values. Spatially unstructured mode1 (RE) . . . . . . . . . 25

    Bayesian relative risk est irnates of Lung Cancer Slortality. Ontario 1996 . Spatially structured model (CAR) . . . . . . . . . . . . . . . . . . . . . . 26

    . Bayesian SIP.values Spatially st ructured mode1 (CAR) . . . . . . . . . . 2 1 Bosplots and Lineplots of relative risk estimates comparing \[LE vs . Bayesian

    RE (left). Bayesian RE vs . Bayesian CAR (middle). Bayesian CAR vs .

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . MLE (right) 33

    4.1 Simulation rnap . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37

    5.1 Bosplots For 0 wlierc eo = i . 1. 3 (roms) and O0 = 1.5: 2.5.4.0 (columns) . 43 5.2 SISE vs Ba . eo IF analysis met hod . . . . . . . . . . . . . . . . . . . . . . 47

  • 5.3 MSE vs Bo. eo by area assignment . . . . . . . . . . . . . . . . . . . . . . 48

    5.4 Bias vs Bo. eo by analysis method . . . . . . . . . . . . . . . . . . . . . . 51

    5.5 Bias vs Bo. eo by area assignrnent . . . . . . . . . . . . . . . . . . . . . . A

    5.6 MLE maps for 0 where eo = and Bo = 1.5 . . . . . . . . . . . . . . . . .

    5.7 RE maps for 8 where eo = ! and Bo = 1.5 . . . . . . . . . . . . . . . . . . A

    5.8 CAR maps for B where eo = f anci 4 = 1.3 . . . . . . . . . . . . . . . . .

    5.9 MLE rnaps for where eo = 1 and Bo = 1.5 . . . . . . . . . . . . . . . . .

    5.10 RE maps for where e~ = 1 and Bo = 1.5 . . . . . . . . . . . . . . . . . .

    5.11 CAR maps for 0 where eo = 1 and = 1.5 . . . . . . . . . . . .

    5.12 ROC curves for e where eo = f , 1 ' 3 (rows) and Bo = l .5 ,2 .5 .4 .0 (columns) 5.13 Area under curve vs Bo, eo by analysis method . . . . . . . . . . . . . . .

    5.14 MLE maps for MP-value where eo = f and $ = 1.5 . . . . . . . . . . . .

    5.15 RE maps for MP-value where eo = $ and Bo = 1.5 . . . . . . . . . . . . .

    5.16 CAR maps for NP-value wliere eo = f and $ = 1.5 . . . . . . . . . . . .

    . 5 .K MLE niaps for MP-value where eo = 1 and Bo = 1.5 . . . . . . . . . . . .

    . . . . . . . . . . . . . 5-18 RE maps for NP-value where eo = 1 and $ = 1.5

    . . . . . . . . . . . . 5.19 CAR maps for MP-value where eo = 1 and do = 1.5

    h.1 Ontario reference map of census divisions . . . . . . . . . . . . . . . . . .

    C.1 Bosplots for eo = 4 = 1.5.2.5.4.0 . . . . . . . . . . . . . . . . . . . . .

    C.2 Boxplots for eo = i. Bo = 1.5.2.5.4.0 . . . . . . . . . . . . . . . . . . . . . C.3 Boxp lo t s fo re~=1~Oo=1 .5~2 .~ .4 .0 . . . . . . . . . . . . . . . . . . . . .

    C.4 Boxplots for eo = 2. 4 = 1.5.2.5.4.0 . . . . . . . . . . . . . . . . . . . . .

    C.5 Boxplots for eo = 3. 60 = 1.572.57 4.0 . . . . . . . . . . . . . . . . . . . . .

    D.1 ROC curves for eo = $.Bo = 1.5.2.5.4.0 . . . . . . . . . . . . . . . . . . .

    D.2 ROC curvcs for eo = $.Bo = 1.3.2.5.4.0 . . . . . . . . . . . . . . . . . . . - D.3 ROC cirn-ris for eu = 1.19~ = 1.3.2.5.4.0 . . . . . . . . . . . . . . . . . . .

  • D.4 ROC curves foreo = 2 , & = 1.5,2.5,4.0 . . . . . . . . . . . . . . . . . . . 114

    D.5 ROCcurvesfore~ =3:$=l .5 .2 .5 ,4 .0 . . . . . . . . . . . . . . . . . . . 115

  • Chapter 1

    Introduction

    blapping standardized rates and p-values are common approaclies to analyze grograpti-

    ical variation in disease rnortality and incidence. Generally. tliese rnetliods attempt to

    idcntify arcas which are truly differeiit from eacli otlier. This practice is used to providc

    useful information sudi as generating etiology Iiypotheses. Differcnces iii rates tiave beeri

    historically tested using frequentist methods. However. some of these methods are prone

    to problems. Triie differences may be difficult to determine when areas with small nurn-

    bers of subjects at risk have a greater chance to have a large estimated effect size due

    to chance alone. Moreover, areas may appear significantly different since regions with

    larger sample sizes give more power to detect 'deviant* rates. These areas tend to visiially

    dominate a map. Therefore. smoothing or stabilizing rnethods are used to minimize in-

    stability of estimates and probabilities. yet mnintaining geographic resolution. Generally

    boundaries are drawn according to established definitions. Disease trends may or may

    not follow these arbitrary boundaries. The prior belief of disease etiology or transmission

    ma!? be incorporated in Bayesian rnetliods.

  • solved. Spatial statistics is an area mhich has benefited from the use o f tliese techniques.

    The purpose of this study is to demonstrate to the reader tliat Bayesian random effect

    estimates of relative risk and posterior probabilities are less prone to the problems en-

    countered by traditional frequentist est imates and pvalues. hlso. random effects models

    extend nicely to incorporate a spatial component which is readily feasible with current

    Bayesian techniques and software. This study compares both tiequentist and Bayesian

    methods along with both fixed and random effects (emphasis being on the random e f

    fects model). The first Bayesian method assumes independent random effects that are

    spatially unstructured. The second Bqesian method assumes conditional random efFects

    based on regional adjacency This rnodel is called the conditional autorcgressivc model.

    h simulation study is used to esamine mean square error (including bias) and a

    rnethod's ability to distinguish between high and Inw risk areas wliile controlling risk

    estimates and data size. Visiial iiispection of maps of estiniates and map probnbility

    values ( NP-rilues) will also be iised to examine the diffcrenccs be twen t tic nict liods.

    Kork by Bernardelli et al. (1995) [5] used a simulation stiidy to esiiniinc relat i~e risk

    estimate mapping. The results indicate that the fully Bayesian approach is superior to

    the conventional approach of mapping maximum likelihood estimates and p-values. This

    thesis espands work of Bernardelli et al. [3]: by incorporating more extensive simulations

    and addressing issues in mean square error and bias with both graphical and numerical

    techniques to examine differences between methods.

    This thesis is divided into seven chapters. The current chapter addresses the rationale

    and aim of the thesis. The next chapter provides background. notation and introduces

    the model information relevant for the remainder of the thesis. Chapter three provides

    a case study outlining an initial assessrnent of the data and the clifferent rnodels. The

    rnethocls of the simulation study are ciescribcd in cliapter four. The foUowing chapter

    prcsciits tlic resiilts of t lic! siniiilat ion arid t liosc resiilts arc sytitliosizd iri t hc cliaptcr sis.

    Coric.liisioris arc. c1r:iwti iii t liv f i ~ i i i l c h p r PI-.

  • Chapter 2

    Background, Notation and Models

    This chapter provides an introduction to the relevant background for this thesis. Tlie

    first section introduces basic definitions and the frequentist estimate for relative risk.

    The next section introduces Bayesian theory and current techniques in obtaining solu-

    tions to Bayesian probleins. Random effects niodels. whose estimates ma? be consiciered

    shnnhge estimators. are introduced in the following secticin. The raiidoni effects motlcl

    is the basic framework for both the spatially structured and unstructured rnodels. The

    next section introduces the spatially structured conditionally autoregressive model. That

    section discusses how Bayesian methodology incorporates the spatial concept into a ran-

    dom effects model. The final section section discusses how the Bayesian estimates and

    probabilities are calculated in the thesis. Background litereture review. forma1 notation

    and definition of the models rvill be made throughout the sections in this chapter.

    2.1 Definitions

    The Statistics Canada county definition was used for the maps in this tkicsis. It cic-

    fines forty-nine contiguous arcas in the province of Ontario. Let ol~scrrcrl cotirits O =

    (O1. . . . . O,,). n = 49 bc niut iiaily intlepcncl ~ l ) ~ ( . t < ' < [ cr)ilt ltS it l l

  • CHAPTER 2. BACKGROUND, NOTATION A N D MODELS 4

    unknown area specific relative risks of niortality. In this thesis. the expected counts

    are standardized to the overall age and geiider distribution of the data. Assuming the

    average rate of the event of interest over i areas for j age groups and k genders is

    where nijk is the total number a t risk. The expected number of events for an area is then

    defined by

    Therefore, assurning a non-contagious and rare outcorne. the likelihood of the relative

    risk Bi is defined by

    p(Oi 1 B i ) = e-'lE1 ( m ) O 1 O,!

    In this thcsis. the overall rate f,k is obtained from the obsened data and not a standard

    reference table. Event though the joint distribution of Oijh is multinomial~ Waller et al.

    (1997) (referring to Xgresti (1990) pp.455-456) state that likelihood inference is unaffected

    by conditioning on xi O i j k Therefore? it is common practice to retain the product Poisson likelihood. .Ml methods in this thesis rely on the Poissa(Bi Ei) likelihood. The

    first method is frequentist and the final two are Bayesian.

    2.1.1 Frequentist Maximum Likelihood Model (MLE)

    The estimate that maxirnizes the likelihood (2.1) is

    This niasimum likelihood est imate (LILE) is also known as the st antlardizcd mort ality

    ratio (S.\ IR) (Bmslou- anci Da\- 1857 [il) for tlic it" area. This rstiniatr of risk is spatialIy

    tiiistriictiircxl aiid is tlir orily frr~qiicrit~ist iiititho(1 ri.;iliiatcd i r i tlio siriiitliitioii stiiclc

  • r\ssurning that 0, are random Poissm(E,B,) variables. the map probability ualve

    (MP-value) is defined as

    This value is a cumulative probability and not a typical p - value. The definition above

    is used to allow cornparisons with the Bayesian rnethods.

    2.2 Hierarchical Bayesian models

    Let O = (OL, . . . '0,) be random variables with joint density function po(O 1 8) where

    8 = (O1, Oz, . . . ,O,) is a real-valued parameter lying in 8 = (8, x e2 x . + . x O,). Né assume 8 to be a set of open sets. Let E = ( E l . . . . . En) be constant. Let pe (0 ( u ) be a pnor

    distribution function defined on 8 given real-valued hyperparameters & = ( I ! : ~ . . . . . c,).

    The uncertainty in the value of the parameter 0 prior to ohserving O is clescribed by

    a. Similarly, let p , (ci..) = (p,, (&,). . . . . pu,, ( l~r,)) be hyperprior distributions describing one's uncertainty in @, where ri, is a real-valued parameter lying in open set = (QI x

    x - - - O - x ap). Using Bayes' Theorem. Ive define a hierarchical posterior dist7-ib~tion of B given O as

    The calculation of the posterior distribution relies heavily on the calculation of the above

    integrals.

    Moreover. the posterior eqectation

  • or moments require more integral calculations.

    Until recently. the calculation of the posterior distribution and posterior espectation

    have been a major lirniting factor in the practical use of Bayesian techniques. One pas-

    sible solution to calculating the posterior density is to use asymptotic methods. Normal

    approximations such as the Bayes ian Central L i m i t Theorem method fail to pick up skew-

    ness in the posterior. This may then iead to problematic konfiàence' intervais. Wide

    the more complicated Laplace method (Tierney and Kadane 1986 [29]) yields more ac-

    curate results, the posterior distribution must be unimodal. Both methods require a

    large nurnber of data points. While both of tliese methods may be overcome. they can

    be problematic in high dimensions. The expectation-mazimization (EM) algorithm of

    Dempster et al. 1977 [ll] is a practical method to obtain the posterior mode. but not

    the entire posterior distribution. The Gibbs sampler [12] allows one to sample values

    from the full joint posterior distribution using a SIarkov Chain. With the auailability

    and practicality of Gibbs sampling software. full' Bayesian analysis has I>ecomt. niore

    popular. Therefore. only the fully Bayesian analysis will be assessed in this tliesis.

    2.2.1 Gibbs Sarnpling

    Consider the n+p dimensional problem where the n+p parameters are (81, . . . . &,W1?. . . , d p ) . Gisen the a~ailable data. the primary interest is the marginal posterior distribution of

    each B i . Area specific surnrnary statistics such as the posterior mean or median (ie. pos-

    terior espectation defined above) c m then be mapped. Thereforc. $yen the full joint

    posterior distribution, the marginal densities can be readily computed as required. The

    Gibbs sampler is described by the following iterative algorithm:

  • CHXPTER 2. BACKGROUND. NOTATION A N D ~ I O D E L S

    (11 ( 1 ) ( 1 ) ( 1 ) (0) (01 ) Draw 8, p ( 0 , ( O. 0, . O? . . . . . On+ kvl . . . . . up . ( 1 ) ( 0 ) Draw 8'1') - p(ul 1 0.01?6)4". . . . .On . r2 . . . . . CF)).

    This describes one iteration in the Gibbs sampier. After an initial burn-in of B

    iterations, where the sampier reacbes convergence, arid a irl fur tiier i trra

    for each of the n + p parameters are obtained. Therefore, using the a above,

    (O(,? O:'\'). . . . . op, y * . , &y 1) -b ri (el. 02,. . . .en' krl . . . . . C;J

    lgori t hm defined

    - p . . . . 0 . . . . vP 1 O ) as J I -+ cc.

    are Monte Car10 estimates (Pnr (B i ( O). 1 O)) of marginal posterior distributions

    The above algorithm requires the calculation of full conditional distributions in which

    each of the n + p parameters are drawn from. The full joint posterior distribution can be

    sampled more easily when one specifies conjiigate distributions for po(O 1 8) . po(O ( dr)

    and p&) [15. 161. The fi111 conditional distribution are not reqiiired by B W S software

    [28] to sample the full posterior distribution. Generally, one needs only to set up the like-

    lihood and the prier distribution in the BUGS prograrn. Given samples frorn the full joint

    posterior distribution. estimates of the marginal posterior distributions nere esarnined.

    Tliere arc niaiiy tecliniqiics iised to cictermiric convergence of the G i h sanipling cliaiii.

    111 t h tliesis. only visiial inspcctiori of trace plots [vas prrforrncd.

  • CHAPTER 3 - BACKGROUND. NOTATION AND MODELS 8

    The Bayesian met hods discussed above assume observations to be condit ionally inde-

    pendent given parameters that incorporate spatial dependence or independence via a

    prior distribution. This approach borrows strength from al1 the data to allow unsta-

    ble risk estimates to be shrunk toward overall or local means. Tlie Bayesian methods

    incorporate the use of random effects models.

    2.3 Random Effects Models

    Often, with a combinat ion of rare disease and areas witli siiiall population. the observed

    number of events is greater then espected under Poisson inference 131. The extra-Poisson

    variation is the variation in escess of that captured by thc likeliliood aime. This ariscs

    from an area's heterogeneity in the rate of disease risk. It is reasonable to think tliat each

    area has its own mortality rate which depends on other non-rneasurable factors. Random

    effects models can captures both Poisson and estra-Poisson nriation. thereby obtaining

    estimates which better reflect the true heterogeneity of the relative risk [3.4. 5.91. Tliese

    estimates of random effects models can be referred as shrinkage estimators. The extra-

    Poisson variation is rnadelled t hrough a prier distribut ion wi t h dispersion hyperpararneter

    O*. This parameter represents the variation in true relative risks. Both Bayesian models

    discussed in t liis t hesis allow the component of estra-Poisson variabili ty to be modelled.

    The major differencc between fully Bayesian and empirical Bayes analysis is the way this

    parameter is estimated. The fully Bayesian rnethod allows the iincertainty abolit o2 to

    be carried over to the estimates of the relative risks [SI. Even with this difference, the

    relative risk estimates O btained by bot h ernpirical Bayes and full y Bayesian techniques

    have been demonstrated to yield similar results [4. 721.

    hltlioiigh cstablishecl for some time in analysis of variance. rnncloni effects niodels have

    siirgrcl in the fields of niiiltilel-el inocielling and lorigitiiclinal cL;itii atialysis. SAS pro~icles

    PROC MIXED [ZG] ii~id Splits providcs lme [?O] for riiodrliing raticlor~i ~ ~ f i w s h o a r iiiodrls.

  • As part of tlie preliniinary nnalysis. 1 provide a brief esa~iiination of tlie frequentist risk

    estimates for ranclom effects iising the S.AS macro G L I M M I X (iised for geiieralized linear

    models). but do not apply it to the simulation study.

    Consider the following model

    where p is the overall mean and ci is the iLh area specific random effect. Typicall- the

    assumption of random versus fixed effects is based on whether the analyst believes the

    effects are sampled from some population. Although the counties in a province may

    not be thought of as a sarnple from a population of counties, the use of random effects

    models. in tliis thesis. is used to obtain smoothed estimates which rnay othenvise be

    considered 'unstable or 'noisy' when modelling through a fixed inodel. In fi~ct. the

    m ~ ~ i m u r n likelihood estirnate 2.2 is the solution to the fixed effects model.

    Within the Bayesian framework. the random effects can be modelled througli a lcvel

    of hierarclv via a prior distribution. In ttiis thesis. the ci term is inodcl!ed with a

    Normal distribution. The Bormal distribution is easily extended to adjust for area level

    covariates by adding fixed effects with a #xi term. Recall the data esamined here has

    been standardized for age and sex. The simulations in this thesis do not apply any

    covariates. The two Bayesian rnodels differ in the way the random effects are modelled.

    The different priors on the random effects will refiect the analyst's prior belief about

    the kind of geographical variation expected in relative risk. The first Bayesian mode1

    assumes a spatially unstruct ured model while tlie second mode1 is spatially st ruc t ured.

    2.3.1 Spatially Unstruct ured Mode1 (RE)

    The spatially unstructured or heterogeneity random effects model (RE) for log relative

    risk (Claytori and Kaldor 1987 [SI) is defincd Iiierarcliiçally as follows:

  • This mode! d l ~ w s the area specific random effect hi to be modelled about a Normal

    (CCh, rh) distribution where ph may be interpreted as the overall mean p in (2.5) and rh

    (= &), called the precision, is the overall variability of random effects. The fully Bayesian uh

    method allow one to investigate the uncertainty about both ph and T~ by adding another

    level to the hierarchy as seen in the last two lines of (2.6) with the use of hyperprior

    distributions. In this thesis. values of hyperparameters &. & 06 and ,8,h are chosen such that ph and rh are modelled with diffuse or non-inlonative prior distributions. The

    use of hyperprior distributions highlights the major advantage over frequentist random

    effects models. The ability to capture uncertainty in ph and rh highlights the advantage

    over empirical Bayes methods. Both models exploit the use of random effects models.

    The random effects are 'shrunk' towards the overall mean with the amount of shrinkage

    depending on both prior (and hyperprior) distributions and the amount of data observed

    j through the likelihood).

    The full conditional distributions can be found in Mollie 1996 (221. For the random

    effect hi, given all other random effects h-i, the conditional distribution for the spatially

    unstructured model is defined by

    (hi - 4)' P(hi ( h - i , ~ , ~ E ) a exp Oihi- Eiexp{hi) - 2'7;

    2.3.2 Spatially Structured Model (CAR)

    Frequestist methods have been established for some time in the area of spatial statistics

    (Ripley 1981 [24]: Riple~- 1988 [25] and Cressie 1993 [101). An autoregrcssi~e xnoclcl to

    acc:ount for regional iltljacc11c:y. iutroducecl b>- Bcsag 1974 ['I]. is tliscussc~rl in i d l of t lw

  • CHAPTER 3 . BACKGROUND, NOTATION A N D ~ ~ O D E L S 11

    aboce rcferences. k i n g empirical Bayes methods. Clayton aiid Kaidor 1987 [9] apply the

    model in mapping log relative risks. Their model assumed tliat the conditional variance

    of an area effect, given its neighbouring areas, to be constant. This may not be strictly

    appropriate for maps with varying number of neighbours. Besag et al. 1991 [3] proposed

    a model whereby the conditional variance for an area i given al1 other area log relative -7 risk is inversely proportional the number of areas adjacent to it. 1 nis moàei continues

    to be extended and applied, in the Bayesian context [[4? 5, 22, 23, 301.

    If i is an area of interest, we define w, = 1 if area i is adjacent to area j and

    wi, = O othenvise. The conditionally autoregressive model (CAR) requireç that the joint

    distribution of 10g(0~ ) , given Zog(0,) and hyperparameter oz. is Normally dist ributed wit h

    mhere j belongs to the set of neighbours of i. c, is tlie i lh area specific rnndom effect.

    1 u!ii = x:=, w,, and Et = , xy~, w,, c, The inverse variance-covariance mat rix is de fined by

    Equivalent 1- in terms of precision rC (where T = 5 ) the inverse variance-covariance matris is defined by

    The Bayesian CAR mode1 is thercfore defined hierarchically as

  • CHAPTER 2 . BACKGROUND. NOTATION AND ~ ~ O D E L S

    where ni is the number of adjacent counties to county i, j belongs to the set of neighbours

    of i and ui+ = xy,, wij- The random effects are conditional on the average of the adjacent area random effects. Therefore, the effects are 'shrunk' towards local means

    where, again. the amount of shrinkage depends on priors and the amoiint of data. Note.

    the area specific variation (r:) is the proportion of overall variation of conditional random

    effects as required by (2.7).

    For the random effect ci, given al1 other random effects c+ the conditional distribu-

    tion for the spatially structured model is defined by

    U& (c* - ci)- P(ci 1 c-,;O.O:) N esp Oit* - E,esp {ci) - 3 'L

    - o c

    2.4 Estimates and Probabilities

    When convergence has been attained from the Gibbs sampler, area specific relative risk

    can be estirnated by the sarnple median (8:-4'ES) or sample mean of the log relative risks

    defined by

    where bl is the number of samples for the ith area. Meng 1957 [21] proposed the use of

    posterior.probabilities from Bayesian randorn effects models as a itirthod to combat the

    miiltipic coiriparisoris prol~lcm. Bernarcielli et al. 1992. 1995 [A. 51 iipplied this concept

  • CHAPTER 2. BACKGROUND. NOTATION AND ~ I O D E L S

    If the sample global rnean is defined as

    the Bayesian MP-value is defined as

    This may be defined as the posterior probability of an area specific relative nsk being

    greater than the global mean [4, 5, 211. The MP-value can then be estimated by

    where indicator function I ( . ) = 1 if espression is tme. I ( . ) = O otherwise.

    Similarly, in terms of the median (ë), the MP-value may be defined by

    whicli is then estimated by

    After completing M Gibbs cycles on n area specific relative risk estirnates. the global

    median (or mean of log relative risk) is obtained from the n x M Gibbs samples. Each

    sarnple value is tested against the global median. The estimated MP-value represents

    the proportion of times the area specific relative risk is greater than the overall median

    relative risk. Therefore. a relative risk much larger than the global rnedian tends to yield

    a large proportion. The use of the global mean or median is arbitrary and up to the

    analyst to decide which constant to use for the calculation of the posterior probabilities.

    Although another method is to compare each area specific effect against the mean (or

    median) within each Gibbs cycle. Bernarclclli et al. 1995 [5] used the global rnean. For

    simplicity and robustness. both estimates and SIP-valiics werc bascd on nicdiaiis in tliis

  • Chapter 3

    Preliminary Analysis

    This chaptcr introduces the preliminary analysis used to examine cancer mortality data

    for Ontario. Maps of estimates and probabilities will also be presented throughout this

    chapter: for the models of specific interest. The first section looks at the traditional

    maximum likelihood analysis. This section will provide a comparison 114th an analogous

    Ba1:esian model. The second section will look a t the unstructured random effccts model.

    Although the Bayesian model is of primary interest, this section provide a comparison

    with a frequentist random effects model. The next section incorporates the spatially

    structured autoregressive model to the data. The final section provides surnmary of

    estimates for both overall mean and variance. A visual comparison of the estimates

    from one model from each of the previous sections. These three models will be further

    esaniined in the simulation studv of the following chapters.

    3.1 MLE analysis

    Lung cancer mortality data was estracted from the Ontario Cancer Registry database.

    h coliort aas identified as tliose persons. age 50 years and older. witli a diagnosis of

    lung cancer in 1996 and dyirig by the end of the sanie year. The ohscrvcd and cspected

    niirithcr of clmtlis. 11asrd on '2.1. for cadi coiirity apprar in Tal~le 3.1. Tlic traclitional

  • inaximum likelihood estimates (MLE defined by 2.2) and p-values based on a trvo-tailed

    test against uni@

    also appeax in Table 3.1. -4 reference map is provided in Appendix A as Figure A.1. This

    maps labels the geographic region with the county identifier (CD) in Table 3.1. There is

    an average of 4.4 (median of 4) neighbous for each county. Both Essex County (37) and

    Sudbury Regional Municipality (53) have only one adjacent neighbour and Wellington

    County (23) has nine neighbours.

    The largest relative risk estirnate is 2.21 in Prince Edward County (13) with p-value

    c 0.001. The smallest relative risk estimate of 0.66 belongs to Perth County (31) with

    p-value = 0.07. The number of observed deaths Vary from a high of 789 in the Toronto

    area (20) to a low of 5 in Manitoulin District (5 1). These two counties have risk estimates

    that are very similar, but only Toronto has a statistically significant result ( p d u e <

    0.001). A rnap of the relative risk estimates appear as Figure 3.1. Estimates larger tlian

    1.30 appear as red areas. Lighter shades of red reflect areas of gradually lower risk.

    Similarly, areas whose estimates were smaller than 0.70 appear as blue. Lighter shade

    of blue reflect the gradua1 increase in relative risk. The white areas indicate regions

    where the relative risk estimate falls between 0.90 and 1.10. The legend contains the

    distribution of counties. Recall there are 49 counties. There are 8 counties, scattered

    throughout Ontario. with risk estimates larger then 1.30. There appears to be 5 distinct

    clusters of 'high' risk counties. One cluster with 3 counties. another with 2 and the rest

    consist of only 1 area. Note counties 57, 56, 54, 48 and 49 are relatively high risk counties

    that are adjacent to Sudbury District (52) which is relatively low risk. Sudbury District

    (52) district encornpasses Sudbury Regional Municipality (53) which is relatively higb

    ri&. There does not seem to be an\- apparent continuity of the relative risk estimates in

    the map.

    Thcrr arc 11 çourities ni th p-wlticis lcss thari 0.05. Eight of tliosr coiintics han! risk

  • Figure 3.1: Lung Cancer Mortaüty, Ontario 1996, Mzuimum likelihood estimates of

    relative risk

  • Figure 3.2: Lung Cancer Mortaüty, Ontario 1996, P-values based on two-tded test under

    Poisson assumption Bor = 1

  • Table 3.1 : Lung Cancer Mortalit. Ontario 1996. Maxi-

    mum likelihood estimates of relative risk (MLE)

    CD County name Population O bserved Expected SMR P-value

    STOEtMONT

    PRESCOTT AND RUSSELL UNITED COUNTIES

    C)TTPPWP--CA-PXLFI"I'Y REGION.4L MUNICWAC'ITY

    LEEDS AND GRENVILLE UNITED COUNTIES

    LANARK COUNTY

    FRONTENAC COUNTY

    LENNOX AND XDDWGTON COUNTY

    HASTLIGS C O W T Y

    PRINCE EDWARD COUNTY

    NORTHUMBERLXXD COUIJTY

    PETERBOROUGH COUKTY

    \TCTOFU COUSTY

    DURHAM REGIOSXL >IUIiICIPALITY

    YORK REGIOXAL MUXICIPALITY

    TORONTO M ETROPOLITAN i\llU?iiCIPALITY

    PEEL REGIONAL MUNICIPALITY

    DUFFERIB COUKTY

    WELLINGTON COUNTY

    HALTON REGIOXAL MUNICIPALITY

    H,AhlILTOS-iVEST1VORTH REGIONAL MU&.

    NIAGARA REGIOSAL blUNICIPALITY

    HALDNAND-NORFOLK REGIONAL MUN-

    BR4XT C O i D T l -

    W.4TERLOO REGION AL MUNICPALITY

    PERTH COUXTY

    OXFORD COC'STY

    ELGIX COUSTl-

    KEST COC'XTY

    ESSES COCSTl*

    LA5iBTOS COVXTY

  • -

    CD County name Population Observed Expected SMR P-value

    MIDDLESEX COUXTY

    HCRON COUXTI*

    BRUCE COUNTY

    GREY COUXTY

    SIMCOE C O U N N

    >iIUSKOKA DlSTRlCT MU'IICIPALITY

    HALIBURTON COUNTY

    RENFREW COUNTY

    NIPISSING DlSTRICT

    PARRY SOUND DISTRICT

    kIXNITOULIX DISTRICT

    SUDBURY DISTRICT

    SUDBURY REGIONXL MUNICIPALITY

    TIblISKXhIING DISTRICT

    COCHRANE DISTMCT

    ALGOMA DISTRICT

    THUNDER B.\Y DISTRICT

    RAINY RIVER DISTRICT

    KENOR4 DISTRICT

    TOTAL 2959380 3921 3921.0

  • estimates greater than 1 and the remaining 3 have risk estimates less than one. A map

    of pvalues appears as Figure 3.2. The legend attempts to combine both statistically

    significant results (via pvalues) with risk estimate information (where 'High' has MLE

    > 1 and 'Low' has MLE < 1). The significantly high counties tend to correspond to the

    map based on estimates. One county in south-western Ontario, Essex county, has pvaliie

    = 0.025, but a relative risk cstimatc of 1.20. This couoty, therefore, did not appear as

    a high rate county on the estimates map. Conversely, the county of Muskoka appears

    'high' in the estirnates map with relative risk of 1.36 and pvalue = 0.067. Therefore,

    this county did not appear on the p-wlue map. Discontinuities appear in this map. as

    did the estimates map.

    For comparison only, a Bayesian model was fit to this model. This Bayesian 'fised'

    efFect model used separate diffuse Normal distributions on each effect. No MP-values were

    calculated. The 11 counties with p-values < 0.05 in the MLE analysis have a 2.5-97.3

    percentile interval that does not capture 1.0.

    Referring to the ieft-most plot Figure 3.3 (the riglit-most will be discussed in the nest

    section), the MLE and Bayesian relative risk estimates are shown for comparison. The

    boxplots (not traditional bosplots) indicate the 2.5, 25, 50, 75 and 97.5 percentiles. The

    Iineplots join estimates between the two methods for the same counties. A reference line

    at 1.0 was added to the plot representing no increase or decreased risk of rnortality.

    Except for a couple of counties, below the the reference line, which appear to diverge

    away from 1, the estimates are very similar.

    3.2 RE analysis

    Recall the assumption that the county specific observed number of cleaths follorvs a Pois-

    son distribution. The saniplc variance to sample niean ratio of courity specific deaths is

    quite liirgf! ( l72.2). S inw t lir k t è i ii[>[>c;ir to c x h i l ~ i t est ra-Poissoii iiiriat iori (owrdispcr-

  • I

    O'Z

  • Figure 3.4: Histogram of log(8yLE)

    sion), i t is reasonable to mode1 using random effects models. A Iiistogram of log relative

    risks appears as Figure 3.1. The log maximum likelihood estimates appear Normally

    distributed with an overall mean of 0.0730 (p=0.019). median of 0.0796 and standard

    deviation of 0.2272. The county specific relative risks were then estimated with the

    Bayesian random effects mode1 (RE) from (2.6). The Gibbs sampler cvas run with a

    burn-in of 200 and a further 1000 samples in the chain were kept. Al1 the chains appear

    to converge within 50 iterations. The summary statistics of the Gibbs samples for each

    county appear in Table 3.2. T h e county specific median appears to be slightly less or

    equal to the mean. The MP-value (2.12) was also calculated. The global median SMR

    from al1 the county specific Gibbs sarnples was 1.053. There are 4 counties with a 2.5

    percentile which are greater than 1.0. Those counties each have an MP-value greater

    than 92%. There are 3 counties with a 97.5 percentile lower than 1.0. Each of those

    counties have an MP-value close to 0%. These T counties \vit h a range falling outside of

    1.0 are al1 captured in the SILE analysis with a p-value < 0.05. In the LILE analysis.

    the 4 remaining counties with ii pvaliic less than 0.05 with 1.0 falling bettveen ttic 2.5

    and 9ï.-j percriitilc. in tlic RE ariitlysis (coiititirs 12. 54. .X and 59). tiad hlP-\-aliies froni

  • 0.783 to 0.908. -4s indicatetl by tlie IIP-value. these counties have relative risk esrimates

    larger t han 1 .O.

    The estimates from the random effects model are mapped in Figure 3.5 using the

    same range defined in the map of maximum likelihood estimates in Figure 3.1. There

    appears to be much shrinkage to the overall mean. There is only one county (13) with a

    relative risk estimate greater than 1.3. and ün1y 3 counties with an cstimntc less than 0.9

    (19,20,21). With 30 of 49 counties in the 0.90-1.09 range, it is very difficult to make any

    comments about spatial continuity in risk estimates. The map of MP-values, in Figure

    3.6, appears spatially discontinuous and similar to the MLE pvalue map. Figure 3.2.

    Generally, the lowest risk counties are in the southern part of the province.

    A frequentist model mas fit to this model for comparisons with the Bayes estimates

    only. It will not be examined beyond this section. The mode1 rvas fit with the GLIMMIX

    macro in SAS. The model revealed one significantly high and two significantly loir. randorn

    effects for counties 01. 19 and 20 respectively. The second plot of Figure 3 .3 displays

    both the frequentist and Bayesian relative risk estimates for cornparison. The relative

    estimates appear very similar with slightly more shrinkage occurring in the frequentist

    estimates.

    3.3 CAR analysis

    The Bayesian spatially structured random effects model (CAR) from (2.8) was also mod-

    elled. The sumrnary statistics of the Gibbs run for each county appear in Table 3.3. The

    MP-values in the table is based on a global median of 1.077. There are 7 counties with

    a 2.3 percentile is greater than 1.0. The 4 counties identified. in the RE analysis with

    the sanie critcria. are captured here. Those counties each have ari NP-ralue greater tlian

    89%. TTlicre are 3 countics mitli a 97.5 perceritilc loncr ttian 1.0. Tlicsci coiirities are

    the saine as tliosc iclcntificd i ~ i tlicl RE ;itiiilysis. Tlir cstiniattls arc riiiippctl iri Figiirc

  • Figure 3.5: Bayesian relative risk estimates of Lung Cancer Mortaüty, Ontario 1996,

    Spatially nnstructured mode1 (RE)

  • Figure 3.6: Bayesian MP-dues, Spatially unstnictured mode1 (RE)

  • Figure 3.7: Bayesian relative risk estimates of Lung Cancer Mortality, Ontario 1996,

    Spatially stnictured mode1 (CAR)

  • Figure 3.8: Bayesian MP-dues, Spatially structurecl mode1 (CAR)

  • Table 3.2: Surnmary statistics for relative risk of Lung

    Cancer L l ~ r t a l i t y ~ Ontario 1996. Bayesian spatially un-

    structured mode1 (RE)

    CD Mean STD 2.5% 97.5% Median M P - d u e

  • CD Mean STD 2.5% 97.5% Median MP-value

  • Table 3.3: Summary statistics for relative risk of Lung

    Cancer blortality, Ontario 1996, Bayesian spatially stmc-

    tured mode1 (CAR)

    CD Mean STD 2.5% 97.5% Median MP-value

  • CD Mean STD 2.5% 97.5% Median MP-value

  • 3.7 with the range consistent with the previous two maps (Figures 3.1 and 3.5). There

    appeae to be more variation as cornpared to the RE model. The amount of shrinkage

    appears to be les : but there also appears to be local smoothing with more continuity

    between adjacent counties. Recall the counties around Sudbury District. The relatively

    low county (52), in the MLE analysis, has been locally smoothed to relatively high. There

    are 21 coünties in the 0.30-1.09 range cornparcd t o 13 and 30 for estimztec of the MLE

    and RE models, respectively. The MP-value value map for this model appears in Figure

    3.8. There appears to be sorne spatial continuity, but not as much as the estimates map.

    The counties in the south have very low probabilities. Some discontinuity appears in the

    south-east part of the province. The Sudbury District county relative risk estirnate is

    smoothed slightly upwards.

    3.4 Final cornparison

    Estimates of overall mean and variance appear in Table 3.4. The values are on the

    log scale. There is mked evidence that the overall mean is different from zero. The

    intervals for the Frequentist empirical and random effects do not capture zero, while the

    interval for the Frequentist fked mode1 and the Bayesian spatially unstructured random

    effects include zero. The frequentist randorn effects models has smaller variance than the

    Bayesian RE model. Since this is a fully Bayesian model. the larger variance rnay be due

    to uncertainty carried over from the hyperprior distributions. As rnentioned earlier. an

    advantage of a fully Bayesian analysis allows one to obtain an interval for the dispersion

    parameter. The variance for the Bayesian CAR model is larger then the Bayesian RE

    rnodel. This is due to the spatial constraints placed on the random effects.

    For tlic rernainder of t his thesis. frequentist SILE. Ba-esian RE (spatially unstruc-

    turccl) and Bayesian CAR (spatially structtired) niodcls are consitlered. Figure 3.9 dis-

    plays t lie I>os/lirie plots ancl provide a coniparat ive \-i(?w o f t lie ii11ioiirit aiid dire

  • Table 3.4: SIean and variance estimates for models used

    in preliminary analysis

    Frequentist Bayesian

    1 ModeÏ 1 Parameter 1 Estirnate %%CI 1 Estirnate 2.5%-97.5% 1

    ! Sample std std ) 0.2272 - 1 - - 1

    shrinkage for both the RE and CAR models. The arnount of spread in the risk estimates

    coincide with the values in Table 3.4. There was much shrinkage in the estirnates from

    the MLE model to both the RE and CAR models. This [vas also observed in the maps.

    The distribution of the RE model estimates appears symmetric while the MLE and CAR

    estimates appear skewed.

    The leftmost graph displays how the BILE estimates were slirunk to an overall mean

    of the the Bayesian RE model. Larger relative risk estimates are 'pulled' downward while

    smaller estimates have been 'pulled' upwards. The lines corresponding to the 2 largest

    MLE risk estimates have been shrunk dramatically by the RE model. These 2 counties.

    13 and 59: have counts of 29 and 16 respectively York Regional Municipality (19). which

    has the second to smallest MLE risk estimate and a count of 120. was not greatly affected

    by the RE model. Perth County (31) ivith a cocint of 19 (the lon?est risk estimate). has

    been shrunk upwards to cross several lines including York's line. ..\fso, note that several

    counties go from having a risk l e s than one to gea te r than one.

    The middle graph shows that the CAR model display more variation than the RE

    model. The added heterogeneity is due to the spatial constraints place on the randorn

    effects. The estimates do not appear to have the sanie ranks. Therefore. tliere is sonie

    stirinkige occurririg ttiat is not toiviisds a single overall tricari. like t h RE riio(k4.

    Fixed effects

    &dom effects

    p

    p

    0.1665 (-0.2422,0.5752)

    0.0582 (0.0004,0.1160)

    - - 0.05605 (-0.0032?0. 1151)

  • The third graph compares estimates between the CAR and hILE niodels. Most of the

    lines have been shriink from the MLE inodel to the CAR model. -4 few ?VILE model risk

    estimates greater than one became larger under the CAR model. A couple of low risk

    counties in the 'VILE model have been 'pulled' upwards by quite a large margin. Recall

    Sudbury District (52) in the MLE and CAR maps (figures 3.1 and 3.7) which was greatly

    3nccted 5' its neighbouring countier.

  • Chapter 4

    Methods

    Based on the preliminary results of the previous chapter, this chapter outline the meth-

    ods used for a simulation study to examine mean squared error, bias and the ability

    to distinguish between high and low risk areas. The first section describes the overall

    structure of the simulatioris including the developmeiit of a simulation map. The nest

    section introduces the definitions of mean sqvared error and bias to allow for coniparisons

    between the different methods. The final section describes the use of receiuer operator

    curves (ROC) for cornparing MP-values for different methods.

    4.1 Simulation Structure

    Recall the two frequentist based maps in Figures 3.1 and 3.2. Using the resiilts from these

    two maps. a simulation map was created based on the the union of the dark red areas

    in these maps. This approach identified 'hot spots' or areas with either large estimates

    (which rnay be unstable) or significantly high risk areas (which rnay be susceptible to

    large number of events). The simulation map appears as Figure 4.1. Red areas represent

    areas which are açsigned as 'higti'. Those areas adjacent to the 'hi&' areas arc coloured

    grey and labeled as 'boundarf areas. .\II otlier areas are assigtied -loiv0 clesignatcd by

    a white coloiir oti tlic! niap. Tlicrc are nirie .higli' areas. sistecti %oiincl;iry' arcas and

  • Figure 4.1: Simulation map

  • tmenty-four 'low' areas. Most of the 'low' areas appear as one large cluster in south-west

    region of Ontario.

    The MLE, RE and CAR models were examined under different scenarios. Each sce-

    nario simulated the observed number of events 0' = (O;, . . . ,O&): from a P&SSO~(E ;B~~)

    where Ei = eo x Ei, i = 1,. . . ,49. The Ei7s are the original data while eo is a constant

    multiplier for the expected counts. The multiplier eiiher iriflatts ur deilatrs tlir aiiiu uiit

    of data for each area. In this thesis, eo = f , h, 1,2? 3. The 8 o i ' ~ are relative risks assigned to an area based on the category of 'high', 'boundary' or 'low'. In this thesis. Bo is as-

    signed the values 1.5. 2.5 or 4 for 'high' areas. Al1 other areas ('boundary' and 'lowl) are

    assigned Bo = 1. Therefore, each scenario or experiment is a function of both one OF three

    assigned relative risks, Bo, and one of five data size multipliers, eo, resulting in a total of

    fifteen experiments. Each scenario was simulated 100 times. Within each simulated iter-

    ation, the Gibbs sampler \vas run with a 'burn-in' of B = 300 and M = 1.000 iterations

    to ensure coiirergence to the posterior distribution of the Bayesian models. After visttal

    inspection of sample area specific traceplots, convergence \vas commonlp attained within

    50 iterations. Convergence and diagnostics will not be addressed further in this thesis.

    The simulations can be simplified with the following pseudocode:

    do over SCENARIOS eo x 80 (eo = i, 4,1,2,3; Bo = 1.5,2.5,4) do ITERATION = 1 to 100 f o r each county t

    Er = eo x Ei

    simulate 01 -- Poisson(OoiE~)

    calculate &y LE = Or/E: do over BAYESIAN methods (RE,CAR)

    do b= 1 t o B (Gibbs 'burn-in')

    discard B ! ~ )

    end

    do m= 1 t o M (Gibbs samples)

    store 01'") end

    calculate ~f-"""" = median(0j;"). . . . .O(" ' ) 1 'If )

  • end BAYESIAN

    end ITERATION

    end SCENARIOS

    where i = 1,. . . ,49.

    The SCENARIO and ITERATION steps were done using SAS (~6.12: Windows95). BUGS

    [28] (vO.6O:DOS) was used to perform the Gibbs sampling for the BAYESIAN çteps.

    4.2 Cornparison of Estimates

    The mean squared error (MSE) of relative risk estirnates. 0 = (0,. . . . y &). pooled in

    each of the 'high7, 'boundary' or 'low' areas was used to esarnine cornparisons between

    methods. If the true value of the parameter is Bo = (Oo,i,. . . . 0oeq9) the XlSE is defined

    to be

    where 6 is equai to some statistic. It can be shown that

    where 13iase(9) = E ~ ( & ) - Bo

    With k e d parameter go, the &ISE is estimated under each scenario' for each 'highr,

    'boundarf and .lo+ nreas. by

    where na is the number of areas in either the 'high7, 'boundary' or 'low' group. Similarly

    the bias of ê is estimated by

    For cpantitim comparisons. anaiysis of covariance (;\'\'CO\--\) was used to csamine

    the effefects of estiniatc sim and data sizc on ILSE aiid bias n-hik coritrolliiig for hot11

  • method of analysis (.VILE. RE and CAR) and area type (high. boundary and low). Since

    shrinkage can occur either 'upwards' or -downwards', the absolute value of the bias is

    taken. Due to normality violations, the response was transformed using a log trans-

    formation, for both MSE and bias. Graphical cornpaxisons disptayed boxplots of the

    distribution of estimates for each of the methods. Maps of estimates h m simulation

    iterations under select scenarios were also used to examine the differences.

    4.3 Cornparison of Map Probability values

    To examine MP-values, receiver operator cames (ROC) were used to determines a method's

    ability to distinguish between 'high' and non-high ('boundary' and 'Iow') areas. The Y

    avis represents the proportion of 'liigh' areas greater than a cut-off value c (true positive

    rate). Similady. the X axis represents the proportion of non-high areas greater than the

    same cut-off point (false positive rate). Therefore. a method mit h high distinguishing

    ability results in a curve which is close to both the Y=l and S=O lines. The ROC pro-

    vide graphical cornparison only. The area under the ROC quantifies the distinguishing

    ability. A value closer to 1 indicates high discriminating ability while values closer to .5

    indicate low discriminating ability. The area under the ROC was calculated using the

    trapezoid rule. r\NCOVA was used to esarnine the differences in methods of analysis on

    the area under the ROC curve. The response ivas transformed using the logit transforma-

    tion defined as l o g ( - ) due to normaiiv violations. Maps of MP-values from selected

    scenarios and simulation iteration ri-ere used to esamine the difference in rncthods.

  • Chapter 5

    Simulation Result s

    This chapter presents the results of the simulation study described in the previous c h a p

    ter. Note that logarithmic transformations were used for BISE and IBias( while the logit

    of the area under the ROC was used to stabilize variance in the analysis of covariance

    (ANCOVA). Interactions (and contrasts )are implemented to examine differences in rates

    of increase or decrease. The complete output of rcsults appear in Appendis C. Each scc-

    tion provides the results numerically and graphically. The first section examines ttie

    relative risk estimates with subsections focusing on MSE, bias and iteration maps. The

    final section examines the MP-values with subsections focusing on ROC and iteration

    maps.

    5.1 Estimates

    The numerical summaries (percentiles, estimated MSE and estimated bias) for the pooled

    relative risk estimates by both assignment area and analysis type for each eo x O0 com-

    bination appear in Appendix C. For gaphical cornparison. boxplots displaying the 2.5,

    25' 50 (median). 75 and 97.5 percentiles by each combination also appear in Appendis

    B. Eacli graph plots the relative risk estimates pooled for eacli of the 'liigh' (on top).

    %oiinclary' (in ttie niiddle) and 'loiv' (on the bottom) arcas by cadi a~iiil>-sis nictliotl

  • MLE (on left). RE (in the middle) and CAR (on right). Figure 5.1 displays the boxplots

    in a matrix for 8 where e~ = f y 1.3 (roms) and $ = 1.5.2-5.4.0 (columns). The leftmost

    cohmn identifies the scenarios where Bo = 1.5, the middle column for 80 = 2.5 and the

    nghtmost column for Bo = 4.0. The top row identifies the scenarios where eo = i, the middle row for eo = 1 and bottom row for eo = 3. Note the boxplots in the 'high' section

    s e not disp!ayed QI? the came $cale over fio. For hrevity: only a subset of scenarios were

    displayed here.

    5.1.1 MSE

    Results of the ANCOVA appear in Table 5.1. The highest level represents the main

    effects including some second order interaction t e m s of interest. Sublevel term provide

    cornparisons made with contrasts. The pvalues are not adjusted for multiple conipar-

    isons. Scatterplots of log(L1SE) venus Bo and eo with least squared lines by method of

    analysis appears in Figure 5.2 and by assignment area in Figure 5.3.

    Figures 5.1. 5.2 and 5.3 indicate a decrease in M E with increased eo and an increase

    in kISE witli increased Bo. These results are confirmed statistically significant with

    ANCOVA results in Table 5.1.

    The average log(h1SE) is lowest for the Bayesian methods. RE (-1.553) and CAR

    (- l.487), than the SILE mode1 ( - 1.302). These differences are significant alt hough there

    is no evidence that the two Bayesian methods are significantly different. The scatterplots

    appear to indicate that the different methods of analysis decrease at a different rate over

    eo (pvalue=0.0023). There does not appear to be differences between the two Bayesian

    methods. The two Bayesian methods do appear to be different from the MLE method.

    The scatterplots show that the Bayesian methods increase at a significantly faster rate

    over Bo (pvalue=O.OOOS), than the MLE method. There is evidence that the two Bayesian

    methods are different tliaci the MLE nietliod and no evidence that there arc tliffcrcrices

    hctween t lie two Baysiari met liods.

  • Figure 5.1: Bosplots for è where eo = i! 1.3 (rows) and Bo = 1.5' 2.3.4.0 (columns)

  • The mean log(MSE) is lowest for the 'low' and highest for the 'high' areas The means

    are -1.805. -1.599, -0.939 for loml boundal and high areas respectively. These means

    are significantly different (pwlue=0.0001). There is no evidence that the 'boundary'

    area is different from the 'loiv' area (pvalue=0.1337). There is maginal evidence (p

    value=0.0378) that the rates of decrease by area assignment over eo are difTerent. The

    rate of decrea~e for the two Bayesian methods are not diflermt. There is no evidence that

    the rates of increase of log(MSE) over Bo by assigned area are different (pvalue=0.1842).

    The bottom left graph of figure 5.1 shows that both RE and CAR methods for Bo = 1.5

    and eo = appear to eshibit less variation than the MLE method. Note the RE method

    in the 'high' geographic regions for Bo = l.5,eo = $, 95% of the coverage does not

    capture Bo. This difference is reduced as both Bo and ea increase. SIoreovero the RE

    method displays less variation than the CAR method for low Bo and low eo.

  • Table 5.1: A'JCOVA results for MSE

    Effect F Value P-value

    80

    eo

    MLE, RE, CAR

    MLE vs. RE, CAR

    RE vs. CAR

    High, Boundary, Low

    High vs Boundary, Low

    Boundary vs Low -- -

    O0 xMLE. RC, CAR

    MLE vs. RE, CAR

    RE vs. CAR

    O0 x High, Boundary, Low

    High vs Boundary. Low

    B o t i n d q vs. Low

    eo x MLE. RE, CAR

    MLE vs. RE, CAR

    RE vs. CAR

    eo x High, Boundary, Low

    High vs Boundary, Low

    Boundary vs. Low

  • 5.1.2 Bias

    Results of the ANCOVA for log(1 biasl) appear in Table 5.2. Scatterplots of log(l biasl)

    versus eo and Bo with least squared lines by method of analysis appears in Figure 5.4 and

    by assignment area in Figure 5.5.

    The log(lbias() does not appear to increase or decrease over increasing O0 (gvalue=0.3183).

    There is evicience that iogij bias j j ciecreases as eo increasts.

    The average log((bias1) is -2.616, -1.554 and -1.362 for the MLE, CAR and RE methods

    respectively. There is evidence that these means are different. In Figure 5.1. the bILE

    method appears to be centered around $ witli very little or no b i s . Botli RE and CAR

    rnethods appear to be biased downward for the 'high' areas and biased upward for the

    'boundary' areas. Figure 5.1 also shows that there is very little bias among the 'loi\-*

    areas across al1 scenarios regardless of method. The rates of decrease over eo by niethod

    do not appcar to be different. There is eridence that the rates over O0 by method arc

    different (p~alue=0.03%7). Tliere is some evidence that the IWO Bayesian rnethods rates

    are different.

    There is evidence that the log(lbias1) differs by area assignment. The average log(lbias1)

    is -2.476, -1.775 and -1.280 for the 'low'. 'boundary' and 'high' areas respectively. How-

    ever, the rates of decrease over eo and Bo by area assignrnent do not appear to be different.

  • Figure 5.2: MSE vs O0,etl by analysis method

  • Figure 5.3: MSE vs Bo, ea by area assigrnent

  • Table 5.2: ANCOVA resuhs for Bias

    E ffec t F Value P-value -

    00

    eo

    MLE, RE, CAR

    MLE vs. RE, CAR

    RE vs. CAR

    High, Boundary, Low

    High vs Boundary, Low

    Boundary vs Low

    x MLE. RE. CAR

    MLE vs. RE, CAR

    RE vs. CAR

    x High. Boundary, Low

    High vs B o u n d q , Low

    Boundary vs. Low

    eo x MLE. RE. CAR

    MLE vs. RE, CAR

    RE vs. CAR

    eo x High, Boundary, Low

    High vs Boundary, Low

    Boundary vs. Low

  • 5.1.3 Iteration maps

    bfaps of estimates from four simulated iterations are provided to allow for coniparisons

    and consistency between the methods. These maps were randomly selected. The maps

    are also examined to determine concordance with the simulation map of Figure 4.1:

    focussing on gradation in risk over areas. This is proposed for a visual comparison o n l .

    Relative risk estimates based on MLE, RE and CAR mocieis for four iterations uricler

    scenario eo = f and O0 = 1.5 are mapped in Figures 5.6-5.8. Iteration maps for the

    scenario that closely resembles the actual data, eo = 1 and Bo = 1.5. appears in Figure

    3.9-5.11. The ranges of stiading categories tend to be different as the distributions of

    the relative risk estimates are quite different under the different models. The nurnber of

    regions in each range can be viewed in the legend of individual maps. .A table is provided

    with counts of assigned 'red' areas (out of 49 counties) and coiints of correct assignment

    of 'high' areas (out of 9). The ranges are fixcd. within scenario. to allow for an 'objective'

    comparison.

    The first set of maps in Figure 5.6 displays some consistency with the simulation map.

    The bright red areas indicates areas with relative risk estimates greater than 1.30. The

    large central regions and the south-western tip appear as 'high' areas. Spurious 'high'

    risk regions appear in the south-west. initially set to predominantly 'low' areas.

    The nest set of maps in Figure 5.7 illustrate modelling the snme simiilatrd data under

    the RE assumption. The bright red areas indicates areas with relative risk estimates

    p a t e r than 1.05. The different scale is iiscd to acconimodate the arnount of slirinkage

    that occurred in the estimates, using the random effects model. This set of maps look

    more variable compared to the MLE rnethod. Uçing the same range reveals a slightly

    different number of counties in each. AI1 maps capture the south-west tip as 'high'. The

    large central areas also tend to bc -Iiigh'. -411 rnaps revwl tlic similar spurioiis resiilts

    seen in tlic prc~ioiis set of tniips. Tlicrc appcars to Iw niorr contiiiiiity I~c!t~vc~ri *higli'.

    -Iio~i~i

  • Figure 5.4: Bias vs &, eo by analysis method

  • 0.0 0.5 1 .O 1.5 2 0 2 5 3.0

    Figure 5.5: Bias vs Bo, eo by area assignment

  • Figure 5.8 displays the CAR maps wliich appear smoother witli more continuity

    between areas than the previous sets of maps. The dark red areas indentifies areas with

    relative risk estimates greater than 1.15. The maps appear similar to the RE maps

    but more spatially smoothed and have higher concordance to the simulation map. The

    spurious results in the south-west r e $ m appearc; tn h~ s m o o t h ~ d out in the first and third

    maps. The north-west area (Rainy River District59). originally set as % i g l ~ ' ~ appears to

    have lower risk compared to other 'high' risk areas.

    The relative risk estiniates based on MLE. RE and CAR models for four itcratiotis

    under scenario Bo = 1.5 and ea = 1 are mapped in Figures Z.9-5.11. The SILE maps

    in Figure 5.9 use the same range as the previous scenario. There appears to be more

    consistency with the simulation map with regards to 'high' areas. There are still sonie

    spurious results in the south-west region but not as manu in the previous scenario.

    The RE maps in Figure 5.10 have somc concordance with the simulation map. The'

    also appear to be much tess variable than the same method under the previous scenario.

    The RE maps are noticeably smoother compared to the 'VILE maps. The north-aest

    county in the rnaps appear less consistent with 'high' areas. -411 maps capture the south-

    west tip as 'high'. The large central areas also tend to be 'high' along with the south-

    central Ontario areas. The spurious results seen in the SILE set of maps. has been

    smoothed out somewhat but still appear in the maps. As in the previous scenario. there

    also appears to be more continuity between 'high'. 'boondarf and 'low' areas. Cnlike

    the previous scenario, there appears to be higher concordance with the simulation rnap.

    Figure 3.11 displays the CAR maps which maintain the same range a s the RE maps

    for cornparison. The maps appear to have high concordance with the simulation map.

    Spatial continiiiiy is esemplified in tliis set of maps. The sptirious resiilts in the south-

    wcst area appcar to be snioothcd out niore iii thc secoiid arid foiirtli rnaps. Ttic riorth-

    iwst ami iipprars to corrcdy cirriioristratc~ 'liigli' risk ivliirli \vas iiot t h ciisc in th

  • Figure 5.6: MLE maps for 8 where eo = f and go = 1.5

  • Figure 5.7: RE maps for 6 where e,, = ; and q = 1.5

  • Figure 5.8: CAR maps for 8 where eo = and Bo = 1.5

  • previous scenario. The sinal1 county of Sudbury Regional Sliitiicipality (53) wliich is

    totally surrounded by Sudbury District ( 5 2 ) appears more coiisistent with the simulation

    map than the RE maps.

  • nu m m - ) gzK;i 1

  • Figure 5.10: RE maps for 6 where eo = 1 and Bo = 1.3

  • Figure 5.11: CAR maps for 4 where Q = 1 and Bo = 1.5

  • Table 5.3 is a table of counts after assigning the same range for colour coding to al1

    estimates over each method. within each of the two above scenarios. The counts are

    tabulated for each of the iteration maps esamined above using the ranges defined in the

    RE model in the previous section. An overall proportion is dso provided summarizing

    al1 100 iterations. Under the first scenario, each of the methods appear to have similar

    proportions. In the second scenario, it appears that more areas in the MLE and CAR

    models, were assigned, than the RE model. The MLE and CAR methods pick up more

    correct 'high' assignments. Comparing the RE and the CAR models, there is a 52%

    (12.5 to 19.0) increase in the proportion of areas assigned 'high' with a 35.1% (60.4 to

    81.6) increase in proportion of correct -high' assignments. Even though. overall. the CAR

    mode1 assigns 28.4% (19.0 to 24.1) mare 'higli' risk estimates than the CAR niodel. tliere

    is only a 10.5% (81.6 to 90.2) gain in picking up the correct 'hight risk areas.

    Table 5.3: Frequency (and percentage) of model risk es-

    timates assigned to 'High' areas (tt1=49) and correct as-

    signment to preassigned %gh' areas (ttl=9)

    MLE RE C .AR eo Iteration High Correct High Correct High Correct

    Overall ( 4 . ) (83.6) (49.8) (74.7) (48.2) (81.3)

    1 1.5 1 3 (26.5) 8 (88.9) 5 (10.2) 5 (55.6) 9 (1 .4 ) 7 (77.5)

  • 5.2 Map Probability values

    5.2.1 ROC curves

    ROC curves were used to determine the analysis method's ability to discern between

    'high' vs 'non-high' ('boundary' plus 'low') risk areas. ROC curves were used for graphical

    comparisons. Only a subset of scenarios were displayed here. AI1 ROC curves appear in

    Appendix LI. Figure 5.12 displays the ROC in a matrix. The rom (eo = f ,1' 3) represent

    increasing eo up on the Y axis, while the columns (do = 1.5.2.5,4.0) represent increasing

    Bo dong the X axis. The solid line represents the MLE method while the dotted and

    daslied lines represent the RE and CAR methods, respectively.

    For nunierical cornparison, the area under the ROC curves w r e calculatcd using

    trapezoid rule and AXCOVA was used on data for al1 sceriarios. The resiilts of the

    ANCOVA appcar in Table 5.4. Scatterplots of logit of the area iincler ROC versils eo and

    $ with least squared lincs by methoci of analysis appears in Figure 3.13.

    The curves in Figure 5.12 do not appear to be very differcnt within eacli scenario.

    The CAR lines appears to fa11 consistently outside the RE lines. ünder scenario 4 = 1.5

    and eo = f , the CAR line starts outside then crosses to the inside of the MLE line. then

    cuts outside again. The RE line appears to run parallel with the CAR line, crossing

    outside the MLE line at about 50% Non-high. 90% High.

    .Al1 lines get further away from the diagonal as both $ and eo increase. There is also

    statistical evidence that the overall discerning ability increases with increasing eo and Oo.

    The mean logit of area under the curves was 1.819, 2.196 and 2.247 for the MLE,

    CAR and RE methods respectively. This difference is not statistically significant (p-

    value=0.7654).

    Tliere is no evidence tliat the rate of increase is over eo is differcnt for cadi of tlic

    met tiorls.

  • Figure 0.12: ROC curves for 0 where eo = f ' 1.3 (rows) and $ = 1.5.2.5.4.0 (colurnns)

  • CHAPTER 5. SIMULATION RESULTS

    the MLE mcthod, there was not enough evidence iising .-\XCOV.l ((p=0.1957).

  • Table 5.4: AXCOV-A results for area under the curve

    Effect F Value P-value

    80 67.93 0.0001

    eo 22.91 0.0001

    MLE, RE, CAR 0.27 0.7654

    MLE vs. RE, CAR 0.52 0.4767

    RE vs. CAR 0.02 0.8839

    e0 x MLE, RE? CAR 1-71 0.1957 MLE vs. RE, CAR 3.33 0.0763

    RE vs. CAR 0.08 0.7753

    eo x MLE, RE. CAR 0.09 0.9103

    MLE vs. RE, CAR 0.19 0.6678

    RE vs. CAR 0.00 0.9739

  • 5.2.2 Iteration maps

    Maps of MP-values €rom four simulated iterations are provided under scenarios Bo = 1.5

    and eo = and Bo = 1.5 and eo = 1.

    The MP-value maps in Figures 5145.19 coincide with the previous maps of estirnates

    Figures 5.6-5.9.

    For scenario Bo = 1.5 and e~ = f . the MLE map has areas scattered and discontinuous

    throughout Ontario. There is little consistency, even at 90% probability? with each of

    the other maps and the simulation rnap. The south-western tip does appear in al1 the

    maps as 'high'.

    The RE maps in Figure 5.15 appear more continuous than the MLE rnaps. More of

    the 'high' areas are captured with probability geater than 60%. Spurious resuits still

    exist? especially, in the south-western region. The maps are very sirnilar to estimates

    maps in Figure 5.7.

    The CAR maps in Figure 5.16 maintain tlie same range as tlie RE niaps and also

    appear very similar to estimates map in Figure 5.8. Alore areas are captured as 'Iiigh'

    than the RE method. Again, there is some concordance with the simulation map but

    some spurious results do exist.

    The MLE MP-value maps of scenario Bo = 1.5 and eo = 1 in Figure 5.17 appear to

    capture most of the 'high' areas. Some other areas are sporadically assigned to 'high'

    but the maps appear to be similar to the simulation map. There also appears to higher

    concordance under this scenario than the same method under the previous scenario.

    The RE maps of MP-values in Figure 5.18 appear quite similar to the estimates

    maps of Figure 5.10. These maps have high concordance with the simulation map. The

    'high' areas are captured by this method and the maps are very similar to their hILE

    counterpart maps. Sonic spurioiis rcsults still appear in the south-west arcas.

    Figure 5.19 displays the maps of 1IP-valiies iindcr the CAR asstiiriptioii. Siriiiliir

    to ttic cstirtiatcs tiiaps. tticse riiaps < l i ~ p l i i sortie coticordiiiicc* witli t h siiiiiiliitiori riiiq).

  • Figure 5.13: Area under curve vs Bo, eo by analysis method

  • Figure 5.14: MLE maps for MP-value where eo = f and Bo = 1.5

  • Figure 5.15: RE maps for MP-value whem eo = a and Bo = 1.5

  • Figure 5.16: CAR maps for MP-value where eo = a and Bo = 1.5

  • They also tend to smooth out some of the spurioiis results identified by the precious two

    methods. Most of the 'high' areas are captured by this method and the last map is the

    only one that captures al1 but one area a t 80% probability The CAR method picks up

    more 'high' and 'low' areas as indicated by the red and white areas indicative of higher

    discerning ability.

    Table 5.5 is a table of counts after assigning the same range to al1 MP-values over

    each method, within each of the two above scenarios. The first scenerio shows a 55.2%

    (23.0 to 35.7) increase in the proportion of 'high' assignments from the RE to the C.4R

    model. There was a 35.1% (56.2 to 73.9) increase in the proportion of correct assignments.

    Comparing the CAR model to the iLI LE model? there was a 47.6% (33.7 to 52.7) increaçe

    in 'high' assignment. with a 12.8% (75.9 to 85.6) increase in correct assignnients. In the

    scenario that resembles the data, eo = 1 and Bo = 1.3. there was a 23.0% (17.8 to 21.9) in

    'high' assignments with 13.3% (73.6 to 83.4) increase in correct assignrnents comparing

    the RE and CAR models. Comparing the CAR to 'VILE models. there was a 58% (21.9

    to 34.9) increase in ;tiigh7 assignments. but only a 10.5% (83.4 to 92.3) increase in correct

    assignments.

    Table 5.5: Frequency (and percentage) of mode1 MP-

    values assigned to 'High' areas (ttl=49) and correct as-

    signment to preassigned 'High' areas (ttl=9)

    - -

    !VI LE RE CAR

    eo Bo Iteration High Correct High Correct High Correct

  • Figure 5.17: MLE maps for MP-value where eo = 1 and Bo = 1.5

  • -*

    Figure 5.18: RE maps for MP-value where q = 1 and Bo = 1.5

  • Figure 5.19: CAR maps for MP-value where eo = 1 and 00 = 1.5

  • Chapter 6

    Discussion

    Using both numerical and graphical analyses, it was found that both 'VISE and bias

    decreased while increasing the amount of data (eo). Moreover. increasing the assigned

    'high' relative ri& estimate ( O 0 ) results in increasing bISE and bias neither increased nor

    decreased. The area under the ROC increased tvith an increase in both data and nt111

    relative risk.

    The MSE for the Bayesian methods increased over null relative riskt but \vas consis-

    tently lower than the MSE for the MLE method which remained relatively unchanged

    (particularly in the ' b o u n d q ' and 'low' areas). Ail three methods appear to converge to

    the same MSE for large assigned relative risk. Similarly, as data increased. the Bayesian

    methods (with lower M E ) approached the same MSE.

    The bias analysis indicated that the bias in the Bayesian methods nrere consistently

    higher than the MLE method. This result was noticed over both increased data and nul1

    relative risk. The rates of decrease over the amount of data were simiIar for the three

    different methods. There tvas evidence that the rates were different over null relative risk.

    Noreover. therc was evidenre that the rates for the Bayesian rnethods w r e different. witli

    the CAR methotl increasing (similar to tlie MLE methocl) and ttir clecreasing for tlie RE

    mctho(1.

  • The area under the ROC captured by the Bayesian methods was consistentl- larger

    over both increasing 'high' relative risk esimate and data examined in this stiid. -41-

    though the scatterplots (with the least squared lines) appeared to indicate steeper rise in

    dope over increased null relative risk for the Bayesian methods, there was no statistical

    evidence for this difference. The rates of increase over increased data multiplier were

    relatively similar between al1 methods.

    By increasing the null relative risk in 'high' areas, the magnitude of the difference in

    assigned relative risk increases between 'high' and 'non-high' regions. This is an artifact

    of the simulation study. The pooled estimates for boundary areas consistently lie between

    the low and high areas. Smoothing pulls estimates of the boundary areas farther away

    frorn the null relative risk assigned as one, to values greater tlian one. Sirnilarly. those

    'high' areas are smoothed downcvards below the null relative risk (Oo = 1.5.2.5 or 4).

    The RE method pulls estimates towards a global mean. whereas the CAR method pulls

    estimates towards local means. The amount of pull or smoothing depends on a relative

    weight determined bu the amount of data.

    Recall the tight distributions for the RE method in scenario O0 = 1.5. eo = f . The

    prior distribution on the random effects was a Normal distribution with relatively large

    variance. Even though diffuse priors were used, there was a considerable eRect on the

    estimates in scenarios with small amounts of data (or low e o ) The amount of data did

    not affect the &ISE from the CAR method as much as it did for the RE method because

    the variance was constrained by adjacent areas.

    Fundamental to Bayesian methods is that with the increase in data, the MSE and

    b i s becorne similar to the MLE method. That was shown here. The prior distribution

    becornes 'less important' to the postenor distribution, since the likelihood dominates the

    posterior as data increases [8, 131.

    It was sliown tii21t the cstimatcs from tlic Bayesian nicthotls IKE bitiscd with srnaII

    M E . The iisc of tlic iteration iiiaps iwrc iiiore ticlphil to detcrniiric corisistcricy with tlic!

  • simulation map. The maps became less variable, within rnethod. as both nul1 relative

    nsk and data increased. The Bayesian methods revealed continuity in the maps which

    were more similar to the simulation map than the MLE maps. The Bayesian random

    effects models, through the use of a prior distribution on the effects, allowed one to

    control the MSE while allowing spatial (or non-spatial) continuity. The maps of the first

    scenario appeared quite variable. Although the prior distribution and hyperparameten

    were fked over al1 scenarïos in the simulation study, the analysis require more thought

    within each scenario in practice. The 'tweaking' under each scenano: may have lead to

    more favourable results.

    The south-western tip, Essex county (37). originally 'high', appeared as 'high' in most

    of the iteration maps. There is a t least a 2.5 fold increase in the espected number of

    deaths for Esses county than the nest largest count among 'high' areas.

    Spurious results appeared in many of the maps. These were identified as regions in the

    simulation where number of events was driven largely due to relatively small population

    (E:) . Gelinan and Pricc (i41 state that maps of posterior means (dong with means and

    p-values) are subject to systematic artifacts related to county populations.

    The ability to distinguish between 'high' versus non-high areas appeared consistent

    between the rnethods. The discerning ability of the Bayesian methods may have been

    darnpened by combining 'boundary' and 'low' in the non-high group. Bernardelli et al.

    (1995) [5] showed a marked advantage in the Bayesian methods with the ROC curves.

    However, it is unclear whether the 'boundary' areas were included in the non-high areas.

    As Bo increases, in the 'higli' regions only, the difference between 'high' and non-high

    areas increase. Therefore, by design, al1 methods capture more area under the ROC

    curve. The area under the ROC curves gets larger as eo increases. Recall as eo increases.

    >ISE decreases. -4s this happens? the variance around the estimates decrease. Therefore.

    the 'higli' region S IP-values incrcascs.

    Tlic iteratioii niaps for tlic LIP-~iliics wcrc gcncrally sirriilar to tlic cstirri;it(! rtiaps.

  • The MLE maps had larger MP-values as eo increased. Similarly. the Bayesian methods

    too' had a greater number of larger MP-values.

    Gelman and Price [NI define marginal z-score as a score that measures the discrep

    ancy of the county estimate with respect to its marginal distribution, averaging over

    the unknown county parameter Br. The Bayesian MP-value behaves similarly depending

    on county and global estimates which, in turn, only depend on the samples from the

    marginal posterior distribution.

    Since the MP-value is a function of the estimates, the iteration maps of MP-values

    yield similar results to the estimates maps since Bayesian methods for estimates and

    probabilities synthesize the raw effect and 'sample' size value. This was noticed in maps

    presented in Beranardelli et al. (1992? 1995) [4, 51.

    It is possible that. by design of the simulation study, the ?VILE method is favoured

    over the spatial model. Even though the 'boundary? areas (those areas adjacent to 'high'

    risk areas) are considered. they are always assigned a relative risk of one. As the assigned

    'high' area risk increases. the 'boundary' areas result in relative risk estimates larger than

    one. This may be unfair to the spatial model.

  • Chapter 7

    Conclusions

    Bayesian met hods exploit the use of random effects model to successfiill y reduce variance.

    even thoiigh the estimates of relative risk are biased. Therc did not appear to be an

    increase