a case study of icd-11 anatomy value set extraction from snomed ct guoqian jiang, phd ©2011 mfmer |...

24
A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics, Mayo Clinic College of Medicine ICBO 2011, Buffalo, NY July 29, 2011

Upload: dana-curtis

Post on 29-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT

Guoqian Jiang, PhD

©2011 MFMER | slide-1

Division of Biomedical Statistics & Informatics, Mayo Clinic College of MedicineICBO 2011, Buffalo, NYJuly 29, 2011

Page 2: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

Acknowledgements

• Harold R. Solbrig

• Mayo Clinic

• Robert J.G. Chalmers

• University of Manchester, Manchester, U.K

• Kent Spackman

• International Health Terminology Standards Development Organisation, Copenhagen, Denmark

• Alan L. Rector

• University of Manchester, Manchester, U.K

• Christopher G. Chute, MD, Dr.PH

• Mayo Clinic

©2011 MFMER | slide-2

Page 3: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

©2011 MFMER | slide-3

Motivation

• The 11th revision of the International Classification of Diseases (ICD) was officially launched by the World Health Organization (WHO) in March 2007.

• The WHO has sought to reuse existing ontologies such as SNOMED CT for value set definition.

• One of the core value sets being developed is for anatomical site, defined by WHO as “the most specific level of the topographical location or the anatomical structure where the health-related problem can be found relevant to the condition”.

Page 4: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

Value Sets

• A value set is a uniquely identifiable set of valid values that can be resolved at a given point in time to an exact collection of codes.

• Typically, value sets can be drawn from pre-existing coding schemes such as SNOMED CT by constraining the value selection based on a logical expression (e.g. all sub-codes of the code “breast cancer”).

©2011 MFMER | slide-4

Page 5: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

Clinically meaningful value sets

• Generating clinically meaningful value sets in a (semi-) automatic way from a terminology/ontology service has been challenging for the community.

• A number of research and standardization efforts

• HL7 Common Terminology Services II (CTS2) specification,

• Mayo’s LexEVS 6.0 implementation on a value set definition service, and

• Manchester’s new OWL API 3 which contains a set of modularization tools.

©2011 MFMER | slide-5

Page 6: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

Objectives of the study

• We perform a case study of ICD-11 anatomy value set extraction from SNOMED CT.

• We propose four semi-automatic value set extraction strategies based on different clinical context patterns.

• We evaluate the strategies and discuss their implications in terms of domain coverage, granularity and clinical usefulness from both technical and clinical perspectives

©2011 MFMER | slide-6

Page 7: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

ICD-11 Content Model

• To present the knowledge that underlies the definitions of an ICD entity.

• For example, we may choose a SNOMED CT concept “74181007 Myocardium structure (body structure)” as the value of the parameter “Body Structure” for the ICD category “Acute myocardial infarction”.

©2011 MFMER | slide-7

Page 8: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

SNOMED CT Structure-Entire-Parts (SEP)

• SEP triples are a means to avoid the use of transitive properties and to make clear which disorders and procedures apply to an entire structure and which to the structure and/or its parts.

©2011 MFMER | slide-8

Page 9: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

SNOMED CT Clinical Finding Concept Model

©2011 MFMER | slide-9

Page 10: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

Materials

• The topographical term mappings from SNOMED CT to ICD-O Topography provided by the 20100731 International Release of SNOMED CT;

• A subset of SNOMED CT anatomy “base terms” extracted by IHTSDO (to provide familiar names to end users);

• A subset of all “Clinical Finding” concepts extracted from SNOMED CT stated form in Web Ontology Language (OWL);

• The entries from a download of the UMLS CORE Problem List Subset of SNOMED CT.

©2011 MFMER | slide-10

Page 11: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

Methods

• We took advantage of a Manchester SNOMED module extraction API and developed an extension as a Protégé 4.1 plugin.

• In this way, we were able to load the SNOMED CT stated form as an OWL ontology into the Protégé 4.1 platform for module extraction.

©2011 MFMER | slide-11

Page 12: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

SNOMED CT Module Extraction Tool as a Protégé 4.1 Plug-in

©2011 MFMER | slide-12

Page 13: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

Module Extraction

• Input (Anatomy/CF)• Control

• BaseTerm

• ClinicalFinding

• ProblemList

• Output (Anatomy)• Control

• BaseTerm

• ClinicalFinding

• ProblemList

©2011 MFMER | slide-13

* Control - The topographical term mappings from SNOMED CT to ICD-O

Page 14: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

Evaluation Measures (I)

• Domain coverage

• 287 ICD-O topographical categories as domain anchors

• The ratio of the number of categories containing mappings over total number of categories (i.e. the 287 categories).

©2011 MFMER | slide-14

Page 15: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

Evaluation Measures (II)

• Module granularity

• General granularity

• simply by the ratio of the number of concept IDs in the module over the number of concept IDs in a control module (i.e. the module of the first pattern).

• Adjusted granularity

• by the average ratio of the number of concept IDs in each of 287 categories in a module over that of the control module.

©2011 MFMER | slide-15

Page 16: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

Evaluation Measures (III)

• Clinical usefulness of the module• We chose two categories out of the 287 ICD-O

categories “C44.3 skin of face” and “C44.5 Skin of trunk” in the dermatology domain.

• One of the authors (RC, a dermatology physician) reviewed the SNOMED CT mapping concepts to the two categories and marked those that should be considered as part of ICD-11 anatomy concepts.

• We measured the clinical usefulness by the ratio of the number of concepts checked (by RC’s ratings) over the number of total concepts in the two categories.

©2011 MFMER | slide-16

Page 17: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

Results

©2011 MFMER | slide-17

Number of concept IDs in each module and their distribution

Page 18: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

Evaluation results of domain coverage and granularity for four modules

©2011 MFMER | slide-18

Page 19: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

Evaluation results of clinical usefulness

©2011 MFMER | slide-19

C44.5 Skin of trunk

C44.3 Skin of face

Page 20: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

Discussion

• The strategy used for the module ClinicalFinding would be a good starting point for the ICD-11 anatomy use case.

• The module has good domain coverage

• while keeping a relatively small size and better outcome on clinical usefulness.

©2011 MFMER | slide-20

Page 21: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

Discussion

• The upper level of the SNOMED CT anatomy may create some confusion because it has three main branches:

• 1) Anatomical structure, i.e. the normal anatomy,

• 2) Acquired body structure - mostly the results of surgery plus “scar (morphologic abnormality)”, and

• 3) Body structure altered from its original anatomic structure (morphologic abnormality) – the results of disease or repair.

• Given these different types of body structure, the question really is what is needed for ICD-11 development.

©2011 MFMER | slide-21

Page 22: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

Discussion

• The Protégé based OWL module extraction tool we developed in this study has been demonstrated very useful for achieving our goal.

• While we mainly use an external signature file for module extraction in this study, we have extended the tool and integrated it with the Protégé DL Query plugin, by which a signature can be defined through a semantic query which invokes a DL (description logic)-based reasoner.

• We consider that this provides a powerful and ease to use feature to define and extract a domain specific value set.

©2011 MFMER | slide-22

Page 23: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

In summary

• We performed a case study for ICD-11 anatomy value set extraction from SNOMED CT.

• We proposed four different clinical context patterns for the purpose of generating clinically meaningful value sets for ICD-11 anatomy.

• We evaluated the value sets in terms of domain coverage, granularity and clinical usefulness by defining quantitative measures, which provide effective metrics for helping us to select an approach for satisfying the ICD-11 anatomy use case.

•  

©2011 MFMER | slide-23

Page 24: A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,

©2011 MFMER | slide-24

Questions