d efinitions : inconsistent, variable and subjective definitions based on the source … and new...

30
DATA QUALITY MANAGEMENT THE FUNDAMENTALS FOR TODAY’S SUCCESSFUL HEALTHCARE ENTERPRISE WVHIMSS FALL CONFERENCE NOVEMBER 14, 2014

Upload: horace-walters

Post on 22-Dec-2015

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

DATA QUALITY MANAGEMENT

THE FUNDAMENTALS FOR TODAY’S SUCCESSFUL HEALTHCARE ENTERPRISE

WVHIMSS FALL CONFERENCE

NOVEMBER 14, 2014

Page 2: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

AGENDA • DATA 101

• DATA 911

• DATA QUALITY EXPLAINED

• UAB HEALTH SYSTEM – A CASE STUDY

• LESSONS LEARNED AND CALL TO ACTION

• DISCUSSION

Page 3: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

DATA 101

• HOW MUCH NEW DATA DOES YOUR ORGANIZATION CREATE IN A DAY? FROM WHICH SYSTEMS?

• WHERE DOES YOUR BUSINESS-CRITICAL DATA RESIDE ACROSS THE ENTERPRISE? WITH WHOM IS IT SHARED?

• DO YOU COLLECT AND STORE THE RIGHT INFORMATION TO SUPPORT THE ORGANIZATION?

• IS THE RIGHT DATA AVAILABLE TO SUPPORT TIMELY DECISION-MAKING?

• IS THE DATA CONSISTENT WITHIN AND ACROSS THE ENTERPRISE?

• CAN YOU GENERATE PERFORMANCE IMPROVEMENTS THROUGH DATA INSIGHTS?

Page 4: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

DATA 911: WHY HEALTHCARE DATA IS ESPECIALLY COMPLEX AND DIFFICULT TO MANAGE• LOCATION: HEALTHCARE DATA TENDS

TO BE CREATED AND RESIDE IN MULTIPLE PLACES – DIFFERENT SOURCE SYSTEMS, DIFFERENT DEPARTMENTS, ON INDIVIDUAL DEVICES

• FORMAT: TEXT, NUMERIC, PAPER, DIGITAL, IMAGES, MULTIMEDIA, VIDEO…AND THE SAME DATA CAN EXIST IN DIFFERENT SYSTEMS IN DIFFERENT FORMATS

• STRUCTURE: STRUCTURED VS UNSTRUCTURED DESPITE BEST EFFORTS TO LEVERAGE THE EMR AS A PLATFORM FOR CONSISTENT DATA CAPTURE

• DEFINITIONS: INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE…AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING

• COMPLEXITY: CLAIMS DATA, CLINICAL DATA, MYRIAD VARIABLES RELATED TO AN AMALGAM OF SYSTEMS, SHIFTING BUSINESS RULES AND DEFINITIONS

• REGULATORY REQUIREMENTS: CMS REPORTING AND HIPAA ARE JUST THE BEGINNING AS THE SHIFT TO VALUE-BASED PURCHASING MODELS WILL LIKELY ADD TO THE REPORTING BURDEN

Page 5: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

DATA 911:COMMON CULPRITS OF BAD DATA QUALITY• ARCHITECTURE AND APPLICATION COMPLEXITY: DIFFICULT TO TRACK HOW

CHANGES TO A SYSTEM FILTERS THROUGH AND AFFECTS OTHERS. MULTIPLE, OVERLAPPING APPLICATIONS, OFTEN WITH NON-INTEGRATED DATA SOURCES, CAN CREATE A DATA ENVIRONMENT HIGHLY SUSCEPTIBLE TO BAD DATA QUALITY.

• LACK OF OWNERSHIP AND RESPONSIBILITY FOR DATA QUALITY: WITHOUT OWNERSHIP OF DATA QUALITY AT EACH INTEGRATION POINT, THERE IS A POTENTIAL FOR ASSUMING GOOD DATA QUALITY WHEN IN REALITY IT IS VERY POOR.

• REPETITIVE OR AMBIGUOUS BUSINESS PROCESSES: CREATE REDUNDANCY IN HOW DATA QUALITY IS ADDRESSED, OFTEN RESULTING IN DATA QUALITY CONFLICTS.

• UNCLEAR AND MULTIPLE DEFINITIONS OF DATA ELEMENTS: MISMATCHED SYNTAX, FORMATS, AND STRUCTURE FROM DISPARATE DATA SOURCES REQUIRE CONSISTENT DEFINITIONS OF DATA ELEMENTS TO ENSURE CLEAR MAPPINGS ACROSS THE ENTERPRISE

• NO CLEARLY DEFINED DATA QUALITY ESCALATION PROCESS: WITHOUT CLEARLY DEFINED DATA QUALITY ESCALATION PROCESSES, KNOWN DATA QUALITY ISSUES MAY NEVER BE ADDRESSED.

Page 6: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

WHY DOES DATA QUALITY MATTER?

GOOD DATA IS KEY TO:

• EFFECTIVE DECISION-MAKING AT EVERY LEVEL: FROM THE BEDSIDE TO THE BOARDROOM

• STRATEGIC PLANNING

• CLINICAL QUALITY

• HEALTHCARE OPERATIONS AND FINANCIAL MANAGEMENT

• PUBLIC POLICY

• PUBLIC AWARENESS ABOUT FACTORS THAT AFFECT HEALTH

Page 7: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

DATA QUALITY FRAMEWORK

• Discovery• Profiling• Analysis

Assess

• Cleansing• Enhancement• Consolidation

Remediate

• Measure and CompareMonitor

DATA QUALITY IS ONE COMPONENT OF A DATA MANAGEMENT PROGRAM

Page 8: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

ASSESSING DATA QUALITY

• DATA QUALITY IS A MULTI-DIMENSIONAL CONCEPT

• MEASURE. USABLE METRICS ARE REQUIRED TO DEFINE AND ASSESS QUALITY.

• DATA CAN BE ASSESSED BOTH OBJECTIVELY AND SUBJECTIVELY

• CONTEXT IS OFTEN IMPORTANT WHEN ASSESSING QUALITY

• QUALITY IS DEFINED BY THE INSTITUTION AND DOES NOT NECESSARILY MEAN PERFECTION

Page 9: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

DIMENSIONS OF DATA QUALITY

QUANTITATIVE DIMENSIONS

• ACCURACY

• CONSISTENCY

• UNIQUENESS

• LINEAGE

• VOLUME

• COMPLETENESS*

• TIMELINESS*

QUALITATIVE DIMENSIONS

• COMPREHENSION

• RELEVANCE

• TRUST

• OBJECTIVITY

OTHERS COMMON DIMENSIONS

• ACCESS

• SECURITY

• INTERPRETABILITY

• REPUTATION

• EASE OF MANIPULATION

• MAINTAINABILITY

• RATE OF DECAY

Page 10: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

UAB HEALTH SYSTEMA CASE STUDY IN DATA QUALITY

Page 11: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

PROVIDER DATA QUALITY PROVIDING MULTIPLE CHALLENGES TO THE ORGANIZATION.

• DUPLICATE DATA

• MISSING DATA

• INACCURATE DATA

Page 12: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

UAB PROFILE

• LARGE ACADEMIC HEALTH SYSTEM IN CENTRAL ALABAMA.

• HEALTH SYSTEM COMPRISED OF MULTIPLE LEGAL AND OPERATIONAL ENTITIES

• TWO PRIMARY IT ORGANIZATIONS + DEPARTMENTAL• HEALTH SYSTEM INFORMATION

SERVICES (HSIS)

• MANAGEMENT SERVICES ORGANIZATION (MSO)

Page 13: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

DISCOVERY

• DEFINE SCOPE OF DATA

• ORGANIZATIONS & INDIVIDUALS

• PHYSICIANS & MID-LEVEL PROVIDERS

• CREDENTIALED

• REFERRING

• RESIDENTS

• AMBASSADORS

WHAT IS “PROVIDER” DATA?

Page 14: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

DISCOVERY

• HSIS – CERNER, HEALTHQUEST, LMS, CUSTOM DATABASES

• MSO – CENTRICITY

• MDSO – HEALTHQUEST, CUSTOM

• MARKETING - CUSTOM

• PHYSICIAN SERVICES – HEALTHQUEST, CUSTOM

• GME, CUSTOM

• HIM – TRANSCRIPTION

• OTHERS…

WHERE IS THE DATA?

Page 15: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

DISCOVERY

WHERE DOES THE DATA COME FROM? WHERE DOES IT GO?

• PROVIDERS ARE PROVISIONED DEPARTMENTALLY

• ADDITIONS AND MODIFICATIONS ARE MANUALLY ENTERED INTO ELECTRONIC SYSTEMS AND DISTRIBUTED VIA EMAIL TO INDIVIDUALS OR GROUPS

• PROVISIONED: MSO, MDSO, GME, LAB, RAD, HIM, MARKETING, AMBULATORY CLINICS, HSIS, …

Page 16: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

PROFILING

• AN ALGORITHMIC ASSESSMENT OF DATA ELEMENTS, DATA RELATIONSHIPS, AND COMPLIANCE TO BUSINESS RULES.

• MEASURE AND SLICE AND DICE DATA TO IDENTIFY ANOMALIES

WHAT IS DATA PROFILING?

Page 17: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

DATA PROFILINGDATASET STATISTICS

• 28K ACTIVE PROVIDER RECORDS

• 2.5K DUPLICATE PROVIDER RECORDS IDENTIFIED

• 3.5K RECORDS TO DEACTIVATE

• ~80% RECORDS WITH 1+ QUALITY ISSUES

Page 18: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

PROFILING A DATA ELEMENTEXAMPLE

DATA PROFILE: NATIONAL PROVIDER IDENTIFIER (NPI)

WHY DO WE CARE?

• SYSTEMS INTEGRATION

• BILLING IMPACT

WHAT WOULD WE EXPECT TO SEE?

• TYPICALLY A UNIQUE, 10 BYTE NUMERIC VALUE

Page 19: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

DATA PROFILING EXAMPLE (COLUMN)NATIONAL PROVIDER IDENTIFIER

Type Count %

Null 4,576 19.31%

Non-Null 19,118 80.69%

Duplicates 69 .29%

Distinct 19,049

80.40%

Non-Unique 69 .29%

Unique 18,980

80.10%

Null; 19.31%

Duplicates; 0.29%

Distinct; 80.40%

CountsBasic Counts

Data Type: STRING

Why are there duplicates?

Should there be Nulls?

The NPI is a numerical value. Identified as a String?

Page 20: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

DATA PROFILING EXAMPLENATIONAL PROVIDER IDENTIFIER

Value Count %

NULL 4,576 19.31

DDDDDDDDDD

17,117 72.24

LDDDDDDDDD 2,000 8.44

DDDDDDDDD 1 0.00

Pattern Frequency

NPI is a 10 digit identifier. Inconsistent findings

Type Value Frequency

Min Value 1000004138 1

Median Value 1558551127 1

Max Value T000199999 1

Type Value

Min Length 9

Median Length 10

Average Length 10

Maximum Length 10

Additional Counts

Page 21: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

DATA PROFILING EXAMPLENATIONAL PROVIDER IDENTIFIER

WHAT DID WE FIND?• DISCOVERED UNKNOWN TEMPORARY “SPARSE PROVIDER”

PROCESS RESULTING (AND IDENTIFIED BY “T” PREFIX ON NPI NUMBER.

• 1 INVALID VALUE

• NULLS EXPLAINED AND EXPECTED TO BE REMEDIATED OVER TIME.

Page 22: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

DATA PROFILING EXAMPLECOMMON PROFILING ANALYSIS

Average Value Mean Value Median Value Range Analysis

Sparseness Cardinality Uniqueness Value Distribution

Value Absence Minimum Value Maximum Value Standard Deviation

Type Determination

Format Identification

Overloading Semantic Variance Analysis

Key Analysis Dependency Analysis

Orphaned Records

Redundancy Analysis

Integrity Analysis

Page 23: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

ANALYSIS

WHAT WAS LEARNED?

• MULTIPLE SOURCES OF DATA PROVISIONING

• VARIED PROCESSES SUPPORT PROVISIONING

• LACK OF AUTOMATION AND USE OF EMAIL AS WORKFLOW

• ONGOING INDEPENDENT DATA QUALITY INITIATIVES

Page 24: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

ANALYSIS

WHAT WAS LEARNED?

• ABSENCE OF DATA GOVERNANCE

• UNDEFINED DATA OWNERSHIP AND STEWARDSHIP

• INCONSISTENT DATA SEMANTICS

• INCONSISTENT POLICIES

• UNDOCUMENTED/UNKNOWN PROCEDURES

• BETTER COMPLIANCE AWARENESS NECESSARY

• NO MEASUREMENTS OR MONITORING

Page 25: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

ANALYSIS

WHAT WAS LEARNED?

• DEMOGRAPHICS ACCURACY OPPORTUNITIES

• INCONSISTENCIES IN DATA BETWEEN SYSTEMS

• TRUST ISSUES

• INADEQUATE SYSTEM & PROCESS INTEGRATION

• LACK OF ENTERPRISE DATA SEMANTICS

• LEGACY REQUIREMENTS RESULTING IN DATA HOARDING

Page 26: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

ONGOING REMEDIATION

• IT STEWARDSHIP INITIATIVES:

• ACQUISITION OF NEXTGATE MDM SOLUTION

• DATA CLEANUP

• DE-DUPLICATION

• NORMALIZATION

• DEPARTMENTAL INITIATIVES (IT LED)

• WORKFLOW OPTIMIZATION

• INSTITUTE DATA GOVERNANCE (IT INFLUENCED)

• DEFINE OWNERSHIP

• DEFINE STEWARDSHIP

• DEFINE ENTERPRISE SEMANTICS

• SET GOALS, MEASURE, REVIEW, AND REACT

DATA MANAGEMENT

IS AN ORGANIZATIONA

L ISSUE

Page 27: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

BEST PRACTICES IN DATA QUALITY MANAGEMENT

• UNDERSTANDING OF OVERALL BUSINESS OBJECTIVE – CLINICAL, OPERATIONAL, FINANCIAL. USE EXISTING OR UPCOMING INITIATIVE TO INCUBATE PROGRAM.

• ANALYSIS OF DATA “TOXICITY” – PRECISELY WHY IS YOUR DATA DIRTY?

• TAXONOMY/DATA DICTIONARY FRAMEWORK TO FACILITATE CREATION AND MAINTENANCE OF CLEAN DATA

• CONCERTED EFFORT TO CLEANSE AND ENRICH DATA ENTERPRISE WIDE

• SOFTWARE PLATFORM OR REPOSITORY TO HOUSE CLEAN DATA

• GOVERNANCE AND STEWARDSHIP AROUND DATA MAINTENANCE

Page 28: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

HOW TO DELIVER DATA QUALITY

• CREATE THE APPROPRIATE GOVERNANCE – DATA IS YOUR BIGGEST ASSET AND DESERVES TOP SPONSORSHIP

• TAKE A RISK-BASED APPROACH – FOCUS ON THE AREAS THAT POSE THE GREATEST RISK TO THE BUSINESS

• MAINTAIN BUSINESS BENEFIT – PERFECT DATA IS NOT THE AIM, ONLY DATA FIT FOR ITS PURPOSE

• PROACTIVE – ADDRESS KNOWN ISSUES THAT DERIVE FROM POOR DQ, INVESTIGATE AND ASSESS IMPACT ON BUSINESS STRATEGY

• ITERATIVE – FOCUS ON QUICK WINS, PROVE CONCEPT, FINE TUNE & REPLICATE

• INTEGRATED – PEOPLE, PROCESS AND TECHNOLOGY AND GOVERNANCE ALIGNED TO A COMMON GOAL

• SUSTAINABLE – ESTABLISH A LONG-TERM MODEL THAT CAN BE REPLICATED

• FLEXIBLE – FOCUS ON THE MOST PRESSING NEED

Page 29: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

WHERE TO START

• BE CAUTIOUS OF BIG DATA HYPE

• DO SOMETHING! PROFILE SOME DATA

• BE RESTLESSLY FIRM ON SCOPE

• CRAWL, WALK, RUN

• WITHOUT GOVERNANCE, IT’S JUST A CLEANUP

Page 30: D EFINITIONS : INCONSISTENT, VARIABLE AND SUBJECTIVE DEFINITIONS BASED ON THE SOURCE … AND NEW KNOWLEDGE KEEPS THIS TARGET MOVING D EFINITIONS : INCONSISTENT,

QUESTIONS?

Dan [email protected]

888.869.0984Immersivellc.com3411 High Cliff Road, Panama City, FL 32409

Brian BishopOperations and Interface ManagerUAB Health System Information [email protected]