lukáš marek - spatial clustering and multivariate statistics in analysis of infectious diseases

24
This presentation is co-financed by the European Social Fund and the state budget of the Czech Republic Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases Lukáš MAREK

Upload: swenney

Post on 29-Nov-2014

282 views

Category:

Education


0 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

This presentation is co-financed by theEuropean Social Fund and the statebudget of the Czech Republic

Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

Lukáš MAREK

Page 2: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Why Geographical Information Systems?

Advanced methods for spatial analyses

Spatial statistics

Exploration of spatial pattern

Visualization and presentation for non-geographers

(doctors, specialist)

Page 3: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Spatial Analyses of Health Data - AOI Disease mapping Visual description of spatial variability of the disease incidence Maps of incidence risk, identification of areas with high risk

Geographic correlation studies Analysis of associations among the incidence and

environmental factors

Analyses of spatial pattern Exploration of spatial and spatio-temporal patterns in data Disease clusters, randomness, …

Page 4: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

require specific procedures because of their confidentiality management, presentation and operations

aggregated, anonymized or incomplete data sets

usage of suitable analytical procedures, while the uncertainty and the inaccuracy of data characteristics need to be taken into account during the analysis and interpretation of results

Health and Medical Data

Page 5: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Health and medical data = private, confidential and sensitive data

Keeping all available records but prevent their re-identification

Usefulness of the local scale analysis X privacy protection

Unlikely to explore the relations on the individual level (and not necessary)

Availability, accessibility and restrictions

Data Privacy

Page 6: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Software

Page 7: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Disease occurence as the event Space, time, attributes Spatial (point) pattern

Spatial evaluation of infectious diseases in Olomouc Region, Czech Republic Parotitis, Salmonella, Viral intestinal infections

Spatial evaluation of Campylobacter infection in the Czech Republic

Objectives

Page 8: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Campylobacteriosis Campylobacter bacterium (C. jejuni) Frequent Often foodborne Symptom are similar to salmonella Poultry or fresh milk products

Page 9: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Data Collaboration with Regional Public Health Service in Olomouc

and the National Institute of Public Health EPIDAT database Mandatory records about infectious diseases and patients,

manually fulfilled Age, Sex, Date, Profession, Place of residence, infection, isolation,

… 2008 - 2012 ≈ 100 000 records (weakly) Anonymized

Aggregation Hexagons with the size of average cadastral unit, administrative

units EUROSTAT population grid

Page 10: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Page 11: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Time

Page 12: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Disease mapping

Page 13: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Page 14: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Page 15: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Identification of Spatial Processes Estimation of the nature of phenomenon Plenty of methods Global vs. Local Graphical vs. Numerical

Testing of Complete Spatial Randomness Or other spatial processes

Scale dependency Bayesian modelling

Page 16: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Spatial Pattern

Page 17: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Page 18: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Local Moran‘s I with EB Rate

Page 19: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Multivariate Clustering Similarity searching in

attribute space

Classification of areas with related properties of occuring diseases and their parameters

Classification of similar cases

Without spatial relations

Page 20: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Page 21: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Page 22: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Problems / Challenges Geocoding Aggregation Age / Population standardization Neighbourhood estimation Modifiable areal unit problem Probability distribution of the disease occurrence Underlying processes Under / Overestimation of results leading to

misinterpretation

Page 23: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Future work Spatio-temporal analysis (More of) Bayesian modelling and smoothing Multivariate statistics with spatial relations

Page 24: Lukáš Marek - Spatial Clustering and Multivariate Statistics in Analysis of Infectious Diseases

2nd InDOG Doctoral Conference, 14th October - 15th October 2013, Olomouc

Thank you for your attention