open pseudonymisation project julia hippisley-cox, 2011

7
Open Pseudonymisation Project Julia Hippisley-Cox, 2011 www.openpseudonymisation. org

Upload: willis-wheeler

Post on 18-Dec-2015

214 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Open Pseudonymisation Project Julia Hippisley-Cox, 2011

Open Pseudonymisation Project

Julia Hippisley-Cox, 2011

www.openpseudonymisation.org

Page 2: Open Pseudonymisation Project Julia Hippisley-Cox, 2011

QData Linkage Project

• QResearch database already linked to • deprivation data• cause of death data

• Very useful for research • better definition & capture of outcomes• Health inequality analysis• Improved performance of QRISK and similar

scores• Decided to link to additional data sources

Page 3: Open Pseudonymisation Project Julia Hippisley-Cox, 2011

QResearch Linkage Project

Data source• Hospital Episode

Statistics

• Cancer registry

• MINAP ‘Myocardial Infarction National Audit Project’

Content • Inpatient, outpatient,

A&E, maternity

• Cancer type, grade stage

• Heart attack type and treatment

Page 4: Open Pseudonymisation Project Julia Hippisley-Cox, 2011

New approach pseudonymisation

• Need approach which doesn’t extract identifiable data but still allows linkage• Legal ethical and NIGB approvals• Secure, Scalable• Reliable, Affordable• Generates ID which are Unique to project• Potential to be used NHS wide as Open Source

Common Specification

Page 5: Open Pseudonymisation Project Julia Hippisley-Cox, 2011

Pseudonymisation: method

• Scrambles NHS number BEFORE extraction from clinical system

• Takes NHS number + project specific encrypted ‘salt code’

• One way hashing algorithm (SHA2-256)• Applied twice in to separate locations before data

leaves EMIS• Apply identical software to external dataset• Allows two pseudonymised datasets to be linked• Cant be reversed engineered

Page 6: Open Pseudonymisation Project Julia Hippisley-Cox, 2011
Page 7: Open Pseudonymisation Project Julia Hippisley-Cox, 2011