building on the tabulation tool - making dwp statistics

18
Mike Payne / Dan Brown: Information, Governance and Security, DWP 3 April 2012 Building on the Tabulation Tool - making DWP statistics more open and accessible

Upload: others

Post on 21-May-2022

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Building on the Tabulation Tool - making DWP statistics

Mike Payne / Dan Brown: Information, Governance and

Security, DWP

3 April 2012

Building on the Tabulation Tool -

making DWP statistics more open

and accessible

Page 2: Building on the Tabulation Tool - making DWP statistics

Our benefit claimant statistics …

Jobseekers Allowance

Income Support

Incapacity Benefit / Employment Support Allowance

Pension Credit

State Pension

Housing / Council Tax Benefit

Cover over 17 million individuals

Publish quarterly

Based on „complete‟ administrative data, back to 1999

Page 3: Building on the Tabulation Tool - making DWP statistics

The DWP Tabulation Tool

Sequence shortened !

•Limit of 1x1 table

• Figures displayed in

thousands

• Disclosure control

applied

• html format

• copy and paste into

Excel, etc.

Page 4: Building on the Tabulation Tool - making DWP statistics

The challenge of #opendata …

• “Data which can be freely used, re-used and redistributed by anyone.”

• “… ensure that privacy is preserved and that personal data is protected.”

(Cabinet Office: Making Open Data Real: A Public Consultation)

Page 5: Building on the Tabulation Tool - making DWP statistics

Dissemination

Production & Publication Privacy Protection

Aggregate & Microdata SDMX Standard

Web-based Analysis

TRUST & EXPLORATION Ad hoc Analysis Data Integration

Web Service Access

Visualization

Publication & Consumption Privacy & Security

Aggregate Data Metadata Rich

Interactive Visualization

EXPERIENCE Guided Analysis User Stimulation Data Confidence

Stat-Xplore Stat-Tab

New Software

• two access routes

• for both casual and

experienced user

Page 6: Building on the Tabulation Tool - making DWP statistics

Stat-Xplore

Visualization

Publication & Consumption Privacy & Security

Aggregate Data Metadata Rich

Interactive Visualization

EXPERIENCE Guided Analysis User Stimulation Data Confidence

Page 7: Building on the Tabulation Tool - making DWP statistics

Dissemination

Production & Publication Privacy Protection

Aggregate & Microdata SDMX Standard

Web-based Analysis

TRUST & EXPLORATION Ad hoc Analysis Data Integration

Web Service Access

Visualization

Publication & Consumption Privacy & Security

Aggregate Data Metadata Rich

Interactive Visualization

EXPERIENCE Guided Analysis User Stimulation Data Confidence

Stat-Tab Stat-Xplore

Page 8: Building on the Tabulation Tool - making DWP statistics

Choose any variable

Build tables interactively,

then download or save

results

Visualize

Stat-Tab

Page 9: Building on the Tabulation Tool - making DWP statistics
Page 10: Building on the Tabulation Tool - making DWP statistics
Page 11: Building on the Tabulation Tool - making DWP statistics

Stat-Tab

Page 12: Building on the Tabulation Tool - making DWP statistics

Stat-Tab

Page 13: Building on the Tabulation Tool - making DWP statistics

SDMX

“Data which can be freely used, re-used and redistributed by anyone.”

• SDMX (Statistical Data and Metadata Exchange) is the

electronic exchange of statistical information.

• uses standard XML format for both data and accompanying

structural metadata.

• changes what has historically been a "push" technology into

a "pull" technology.

• standard used by a variety of international organisations,

including Eurostat.

Page 14: Building on the Tabulation Tool - making DWP statistics

Ensure that official statistics do not reveal the identity of an

individual or organisation, or any private information

relating to them, taking into account other relevant sources

of information.

Code of Practice: Principle 5, Practice 1

“… ensure that privacy is preserved and that personal data is protected.”

Page 15: Building on the Tabulation Tool - making DWP statistics

Protecting Privacy - primary

• (output) Perturbation

– applied dynamically

– to all cells, within all tables (all cells have chance of being perturbed)

• Consistent „record key‟ for each unit record in dataset

• Modulo arithmetic to obtain „cell key‟, used with perturbation look-up table

– each cell receives same perturbation whenever it appears in any table

– guards against repeated requests and differencing

Page 16: Building on the Tabulation Tool - making DWP statistics

Protecting Privacy - secondary

• Additional software features – ability to:

– additional suppression (very low cell counts)

– restrict combinations of variables

– registration of users (with more „rights‟)

– software audit

Page 17: Building on the Tabulation Tool - making DWP statistics

• Phased implementation

• Detailed timetable to be confirmed, but first statistics expected to be

released during second half of 2012

• Looking to form „User Panel‟ – please get in touch if you‟d like to help!

Stat-Xplore / Stat-Tab

Page 18: Building on the Tabulation Tool - making DWP statistics

Mike Payne - 0114 209 8229 ; [email protected]

Dan Brown - 0114 209 8127 ; [email protected]

Building on the Tabulation Tool -

making DWP statistics more open

and accessible