the use of administrative sources for statistical purposes steven vale united nations economic...

85
The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Upload: everett-lewis

Post on 11-Jan-2016

255 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

The Use of Administrative Sources for Statistical

Purposes

Steven ValeUnited Nations

Economic Commission for Europe

Page 2: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

What areAdministrative

Sources?

Page 3: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

• Eurostat ‘CODED’ Glossary:

• An administrative source is the organisational unit responsible for implementing an administrative regulation (or group of regulations) for which the corresponding register of units and the transactions are viewed as a source of statistical data.

• Source: OECD and others, "Measuring the

Non-Observed Economy: A Handbook",

Page 4: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

P rim a ry(S ta tis t ica l)

P u b licS e c to r

P riva teS e c to r

S e con d a ry(N o n -s ta tis t ica l)

D a ta S o urces

Narrow Definition

Page 5: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Wider Definition

P rim a ry(S ta tis tica l)

P u b licS e c to r

P riva teS e c to r

S e con d a ry(N o n -s ta tis t ica l)

D a ta S o urces

Page 6: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Administrative sources are sources containing information which is not primarily collected for statistical purposes.

Page 7: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Reasons for this definition

• Increasing privatisation of government functions

• Growth of private sector data and “value-added re-sellers”

• User interest in new types of data

Page 8: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Examples of Administrative

Sources

Page 9: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

• Tax data

- Personal income tax

- Value Added Tax (VAT)

- Business / profits tax

• Social security data

• Health / education records

• Registration systems for persons / businesses / property / vehicles

Page 10: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

• Published business accounts• Internal accounting data• Data held by private businesses:

- credit agencies- business analysts- utility companies- telephone directories- retailers with store cards etc.

Page 11: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

The Benefits of Using Administrative

Sources

Page 12: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Cost

• Surveys are expensive, a census is worse, data from administrative sources are often “free”

• Less staff are needed to process administrative data - no need for response chasing.

Page 13: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Cost advantage• UK could save

€350m• Source: Eurostat –

Documentation of the 2000 round of population and Housing censuses in the EU, EFTA and Candidate Countries; Table 22

Page 14: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Response Burden• Using administrative sources:

– Reduces the burden on data suppliers– Allows statistics to be compiled more

frequently with no extra burden

• Data suppliers complain if they are asked to provide the same information many times by different government departments

Page 15: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Coverage• Administrative sources usually offer

better coverage of target populations, and can make statistics more accurate:– No survey errors– No (or low) non-response

• Better coverage gives:– Better small-area data– More detailed information

Page 16: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Timeliness

• Producing statistics from administrative sources can sometimes be quicker than using surveys

• No need for:– forms design;

– pilot surveys;

– sample design etc.

Page 17: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Public Image

• Making more use of existing data can enhance the prestige of a statistical organisation by making it seem more efficient

• The concept of “Joined-up government” is politically appealing

Page 18: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

• Are data from administrative sources as good as data from surveys?

• Who should judge this?

• How can we measure quality?

• How should we report and communicate quality?

Quality

Page 19: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Definition of Quality

International StandardISO 9000/2005 defines quality as;

'The degree to which a set of inherent characteristics fulfils requirements.’

Page 20: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

What does this mean?• Whose requirements?

– The user of the goods or services

• A set of inherent characteristics?– Users judge quality against a set of

criteria concerning different characteristics of the goods or services

• Therefore, quality is all about providing goods and services that meet the needs of users (customers)

Page 21: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Quality Measurement

• How can we measure the quality of data from administrative sources?

• There are established methods for measuring the quality of survey data, but these are not always relevant for administrative data

Page 22: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Three Aspects of Quality

• To understand the quality of administrative sources we need to consider:– Quality of incoming data

– Quality of processing

– Quality of outputs

Page 23: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Metadata

• Knowledge and documentation of the source is vital to help us to understand quality:– How the data are collected

– Why they are collected

– How they are processed

– Concepts and definitions used

– etc…

Page 24: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

LegalFrameworks

Page 25: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

National Legal Frameworks

• Most statistical institutions have legislation defining their roles and responsibilities, e.g. a ‘Statistical Law’

• Many new statistical laws introduced in the last 10 - 15 years

• Some existing statistical laws have been revised to include provisions for the use of administrative data

Page 26: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

• Some national legal frameworks give more powers than others for access to administrative data

• Historical, political and cultural factors will have an impact on national frameworks, therefore these are not harmonised between countries

National Legal Frameworks

Page 27: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

International Frameworks

• The European Union is developing a legal framework for official statistics, including references to the use of administrative data

Page 28: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Timing

• Legal frameworks should reflect the desired future position rather than the current reality, otherwise they will quickly become a barrier

• Start planning now for the next revision to the legal framework

Page 29: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

PolicyFrameworks

Page 30: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Policy Frameworks

• Government policy on data sharing– How can the national statistical

institute influence this?

• Codes of practice– International

– National

Page 31: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

International Codes of Practice

• United Nations Fundamental Principles of Official Statistics– Principle 5: Data for statistical purposes

may be drawn from all types of sources, be they statistical surveys or administrative records. Statistical agencies are to choose the source with regard to quality, timeliness, costs and the burden on respondents

Page 32: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

National Codes of Practice

• Can reassure the public that data will only be used for specific and reasonable purposes

• Should be made available to the public, e.g. via the internet site of the national statistical institute.

Page 33: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

OrganisationalFrameworks

Page 34: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Using Administrative Sources in Practice

The use of administrative sources is enabled by a legal framework, in the context of a policy framework

But - these are not usually detailed enough to cover all the administrative arrangements for access and use

Page 35: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

What Else is Needed?

• Some sort of agreement:- Service Level Agreement (SLA)- Administrative Protocol- Contract- Informal or verbal agreement- Other type of agreement according to national customs and practices

Page 36: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Agreement Contents 1

• Legal basis

• Names of persons transferring and receiving data

• Detailed description of data covered

• Frequency of data supply

• Quality standards

• Confidentiality rules

Page 37: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Agreement Contents 2

• Technical standards

• Provision of metadata

• Provisions for payment for supply data

• Period of agreement

• Contingencies for changes in circumstances

• Procedure for resolving disputes

Page 38: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

TechnicalFrameworks

Page 39: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Technical Frameworks• Mechanisms for data transfer

– Paper– Magnetic media– Secure on-line connection

• File formats

• Data / metadata standards– XML– SDMX– GESMES

Page 40: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Common Problems and Solutions

Page 41: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Public Opinion

• The level of public concern about government departments sharing data varies from country to country

• There is usually some suspicion of the motives for data sharing

• Sometimes public opinion favours data sharing

Page 42: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

• Adopt and publish a code of practice following international standards

• Clearly stated limits and rules may help reduce concerns

• The principle of the “one-way flow” of sensitive data must be understood by all

Solutions

Page 43: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Solutions

• Publish cost-benefit analyses of the use of different sources

• It may be possible to claim that data are more secure– No questionnaires sent by post

– Fewer clerical staff, so fewer people with access to data

Page 44: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Public Profile

• Direct contact with the public via surveys helps raise the profile of the statistical office

• The use of administrative data can reduce contact with the public and awareness of the work of the statistical office

Page 45: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

• Effective ‘marketing’ of the statistical office and data outputs

• Greater involvement with education institutions, business groups, and other target customers

Solutions

Page 46: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Units

• Administrative units may be different to statistical units:– Job / person

– Tax unit / enterprise

– Dwelling / household

• They may need to be converted to meet statistical requirements

Page 47: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe
Page 48: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Solutions• Automatic rules for simple cases

– These must be clear and consistent

• Statistical “adjustments”– E.g. the statistical unit is persons. The

administrative unit is jobs. We know from a survey that working people have, on average, 1.15 jobs. This adjustment factor can therefore be used to estimate persons in employment from jobs

• Profiling

Page 49: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Definitions of Variables

• Administrative data are collected according to administrative concepts and definitions

• Administrative and statistical priorities are often different, so definitions are often different

Page 50: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Unemployment

• Statistical definition (ILO)

– Out of work

– Available for work

– Actively seeking work

• Administrative definitions are often based on those claiming unemployment benefits

Page 51: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Solutions

• Know and document the differences and their impact

• Use other variables to derive or estimate the impact of the difference

• Statistical adjustments during data processing

Page 52: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Classifications

Two scenarios:

1. Same classification system

2. Different classification systems

Page 53: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Same Classification

• Used for different purposes

• May not be a priority variable for the administrative source

• Different classification rules

• Different emphasis, e.g. specific activity rather than main activity

Page 54: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Solutions

• Understand how classification data are collected and what they are used for

• Provide coding expertise, tools and training to administrative data suppliers

Page 55: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Different Classifications

(or different versions of the same classification)

• Not always a 1 to 1 correlation between codes

• Tools are needed to convert codes from one classification to another

Page 56: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Solutions (1)• Stress the advantages of using a

common classification

• Offer expertise to help re-classify administrative sources

• Give early notice of classification changes and help implement them across government

Page 57: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Solutions (2)• Use text descriptions to re-code

administrative data• Use probabilistic conversion

matrices to convert codes– This results in individual unit

classifications not always being correct, but aggregate data should be OK

Page 58: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Example of a conversion matrix

Code 1 Code 2 Weight

0100 01300 100 1 to 1 correlation

0101 01210 26

0101 01221 14

0101 01222 29 1 to many correlation

0101 25730 11

0101 74332 20

0102 03200 100

0103 02160 36

0103 74332 64

(Approx. 22% probability of correct code!)

Page 59: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Missing Data

• Impute where possible

• Many different imputation methods are used. Two common methods are:– Deductive Imputation

– Hot-deck Imputation

Page 60: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Case Study

• Eurostat have a project to develop enterprise demography

• They want to estimate the impact of enterprise births

• Employment of new enterprises is used, but this variable is often missing or unreliable for new units

Page 61: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Solutions

• Calculate turnover per head ratios to impute missing variables

• Ratios based on “similar” units by classification and size

• Problems with outliers therefore trimming used, e.g. x% or mean of inter-quartile range

Page 62: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Timeliness

Two Issues

• Data arrive too late

• Data relate to a different time period

Page 63: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Data arrive too late

• Data from annual tax returns are often only available several months after the end of the tax year, so they are unsuitable for monthly or quarterly statistics

• Lags in registering “real world” events

Page 64: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Solutions

• Understand the length and impact of lags

• Adjust data accordingly

• Look for ways to reduce lags where possible

Page 65: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Different Time Periods

• Administrative reference period (e.g. Financial/tax year) may not be the same as the statistical reference period

• Monthly average versus point in time (e.g. employment data)

Page 66: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Different Time Periods Statistical / Calendar Year

(0.25 x 146) + (0.75 x 168) = 162.5

(or more complex formulae)

Financial Year 1 Financial Year 2

146

168

Page 67: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Solutions

• Statistical corrections or estimations using data from other reference periods

• Be aware of possible biases when using point in time reference dates

Page 68: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Using data from different sources

• Data from different sources may not agree

• This may be due to:– Different definitions, classifications, time

periods,....

– Errors

Page 69: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Solutions

• Data validation checks

• Benchmarking against other sources

• Priority rules for updating from different sources

• Knowledge of source quality

Page 70: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Priority Rules• Different sources can be given different

priorities for different variables• To stop a “low priority” source

overwriting a “high priority” one– Use source codes– Use priority / quality markers– Store dates with variables– Load data in reverse priority order

Page 71: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Resistance to Change• Statisticians may resist the use of

administrative data because they:– Do not trust data unless they collect them

themselves;

– Focus on negative quality aspects;

– Have an over-optimistic view of the quality of survey data;

– Assume survey respondents comply with statistical norms.

Page 72: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Solutions• Education (courses like this!)• Take a wider view of all the

dimensions of quality, and focus on the impact on users

• Determine the real relative quality of survey and administrative data

• Identify how cost savings can be used to improve quality and / or increase outputs

Page 73: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Change Management

• Risk of changes in:– Government / administrative policy

– Thresholds

– Definitions

– Coverage

– Systems

Page 74: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

High-Risk Times

• Immediately after an election

• Change of minister

• Change of government policy

• Change in EU legislation

and….

• When you least expect it!

Page 75: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Solutions

Manage the Risk by:– Legal provisions

– Contractual agreements

– Regular contact with administrative colleagues

– Anticipating changes

– Contingency plans

Page 76: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Administrative Sources and

Statistical Registers

Page 77: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Direct Use

• Simplest case - an administrative source is used directly as a sampling frame

• Easy and cheap• Quality problems?• Lack of control?

Page 78: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Indirect use (1)

• One or more administrative sources are used to construct a statistical register

• A statistical register can be a tool for integrating data from different sources

Page 79: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Indirect use (2)

• A statistical register can contain additional variables, e.g. from surveys, imputed, or calculated

• The statistical register is owned and controlled by statisticians

Page 80: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Multiple data sources

• Data from a single source– no check on accuracy

• Data from several sources– better view of the accuracy of the data

– increased range of variables available

– but; how to deal with data conflicts?

Page 81: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Models for Creating and Maintaining

Statistical Registers using Administrative

Data

Page 82: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Business Register

VAT PAYE

Survey inputs

Geographic information

systems

Company registrations

Dun and Bradstreet

Satellite

registers

UK Business Register

Page 83: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Basic Registers in Sweden

Page 84: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Central Register

Cross-government projects to create central registers of person / business data to facilitate data sharing, e.g.

• Registers in Nordic Countries

• Australian Business Register

• UK Business Index

Page 85: The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe

Satellite RegisterA satellite register is a tool for incorporating administrative data that are only relevant for a sub-set of units in a statistical register

Statistical Register

Satellite Register

Satellite registers may contain additional units, or variables, or both