ida sim, md, phd associate professor of medicine

43
Copyright 2007, Ida Sim Ida Sim, MD, PhD Associate Professor of Medicine Director, Center for Clinical and Translational Informatics University of California, San Francisco, CA Supported by The Trial Bank Project R01-LM-06780 National Center for Biomedical Ontology U54 HG004028-01 RCT Schema The Trial Bank Project

Upload: kiona-cantrell

Post on 02-Jan-2016

48 views

Category:

Documents


4 download

DESCRIPTION

RCT Schema The Trial Bank Project. Ida Sim, MD, PhD Associate Professor of Medicine Director, Center for Clinical and Translational Informatics University of California, San Francisco, CA Supported by The Trial Bank Project R01-LM-06780 - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Ida Sim, MD, PhDAssociate Professor of Medicine

Director, Center for Clinical and Translational InformaticsUniversity of California, San Francisco, CA

Supported by The Trial Bank Project R01-LM-06780

National Center for Biomedical Ontology U54 HG004028-01

RCT SchemaThe Trial Bank Project

Page 2: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Outline

• Background• RCT Schema

» modeling approach» the class structure» evaluation

• Relationship to CTO • Summary

Page 3: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Goals of the CTO

• (1) fully and faithfully capture the types of entities and relationships involved in clinical trials

• (2) comprehend terms like: cohort, randomization, placebo, etc., including ... statistical terms and terms for ... meta-analysis;

• (3) organize these terms in a structured way, providing definitions and logical relations designed to enhance retrieval of, reasoning with, and integration of the data annotated in its terms

• ... • (6) draw on and seek maximal alignment with existing clinical

trial ontologies, including:» RCT Schema ontology used by theTrial Bank Project

Page 4: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Major Axes for Aligning Models• Domain of clinical trials

» design: (non-)randomized, crossover, cluster randomized, factorial...» objective: interventional, diagnostic, preventive...» clinical domain: drugs, procedures, organizational change...

• Task» trial design, execution, reporting, analysis, application» for individual trials, sets of clinically related trials

• Purpose (application vs. domain ontology)» to support accomplishment of domain task(s)» to define shared meaning for “integration of the data annotated in its

terms”

Page 5: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Systematic Review

Decision Models

Guidelines Electronic Patient Record

Trial Execution

Trial Interpretation

Trial Application

Regulatory Reporting Trial Conduct Trial Design

Feedback to Trial Design

Trial Bank

Scientific Reporting Trial Registration

Trial Tasks

Page 6: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Trial Bank Definition

• Computable repository of RCT information sufficiently detailed to support scientific analysis for » designing future clinical trials» evidence-based practice and policy making

• Detailed information on » study design» study execution» summary and individual participant-level results

• Trial Bank is NOT for running a trial

Page 7: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Systematic Review

Decision Models

Guidelines Electronic Patient Record

Trial Execution

Trial Interpretation

Trial Application

Regulatory Reporting Trial Conduct Trial Design

Feedback to Trial Design

Trial Bank

Scientific Reporting Trial Registration

Trial Bank Target Uses

Page 8: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Trial Bank Software• RCT Bank built on RCT Schema

» Ocelot frame-based ontology

• Bank-a-Trial» web-based program for trialists to enter trial instances into RCT Bank» clinical descriptions of trial features (slot values) are in UMLS

• RCT Presenter» web-based browser of individual trials

Bank-a-Trial RCT Presenter

RCT Bank

RCT Schema

Page 9: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Major Axes for RCT Schema• Domain of clinical trials

» design: (non-)randomized, crossover, cluster randomized, factorial...» objective: interventional, diagnostic, preventive...» clinical domain: drugs, procedures, organizational change...

• Task» trial design, execution, reporting, analysis, application» for individual trials, sets of clinically related trials

• Purpose» to support accomplishment of domain task(s) [application ontology]

» to define shared meaning for “integration of the data annotated in its terms”

Page 10: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Outline

• Background• RCT Schema

» modeling approach» the class structure» evaluation

• Relationship to CTO • Summary

Page 11: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Entity Specification Problem

• What RCT aspects to model in RCT Schema? What not to model?» multiple users (e.g., trialists, systematic reviewers)» multiple tasks (e.g., analysis, interpretation)

– multiple methods» no one correct RCT ontology

• Need principled, systematic approach» to specifying, documenting, and evaluating

Page 12: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Competency Decomposition Method• To define the entities that must be in a conceptual

model• General approach

» specify a task hierarchy of target tasks and subtasks» specify methods for each task» specify entities required for completing each task

using each method• Generates a specification of required entities

» the information requirements for the competencies (tasks and subtasks) that the model/knowledge base is to support

(Sim, et al, JBI 2004: 37(2):108-119)

Page 13: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Systematic Review

Decision Models

Guidelines Electronic Patient Record

Trial Execution

Trial Interpretation

Trial Application

Regulatory Reporting Trial Conduct Trial Design

Feedback to Trial Design

RCT Bank

Scientific Reporting Trial Registration

Target Task for RCT Bank• Using RCTs for trial design or clinical application requires synthesizing evidence across all trials on a topic

Page 14: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Systematic Reviewing

• Canonical method for synthesizing evidence across trials

• Major steps are» retrieve related RCTs (e.g., 32 trials of metformin for

diabetes)» analyze how comparable the trials are» statistically combine data if appropriate

– combining smaller trials increases statistical power to detect effects

Page 15: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Target Task = Systematic Review

• RCT Bank entities = sys. review information needs» identified all systematic review tasks

– review of literature and personal experience conducting 3 systematic reviews

» identified methods for completing these tasks» organized tasks and methods into a task hierarchy» derived RCT entities necessary and sufficient for each

(sub)task

Page 16: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Top-Level Sys Review Tasks• Trial retrieval• Trial critiquing

» judging internal validity » judging generalizability

• Meta-analysis of quantitative results» analysis of clinical and statistical heterogeneity

• Contextual interpretation» scientific, socio-economic, and ethical

http://rctbank.ucsf.edu/tasks/tasks.html

Page 17: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Judgment of Generalizability

• Were the people enrolled in the trial representative and unbiased?» were eligible patients randomly selected from the source

population?» were enrolled subjects a random subset of those

eligible?

• Are the trial subjects similar to mine? • Do I have the tested intervention available here?

Page 18: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Method-(In)dependent Entity Specification• Were the people enrolled in the trial representative

and unbiased?» were eligible patients randomly selected from the

source population?– method: no computable algorithm available

recruitment method» were enrolled patients a random subset of those

eligible?– method: using standard statistics

number and clinical characteristics of enrolled subjects number and clinical characteristics of eligible but non-

enrolled subjects

Page 19: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Results: Entity Requirements

Methods

High-levelTasks

SubTask I

SubTask II

SubTask III

Critiquing

ExternalValidity

13

4

. . .

. . .

InternalValidity

11

39. . .

. . .

QuantitativeSynthesis

4

7

. . .

. . .

Retrieval

1

1

2

2

ContextualInterpretation

3. . . . .

9. . .

2

1

. . .

22. . .

112 29 30171 Unique Entities Required

(n=35)

(n=74)

Page 20: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Entity Specification Evaluation

• Evaluated match between » the 171 information items» 388 requirements in 18 published trial-critiquing

instruments

• Results» entity specification is comprehensive» entity specification is reasonable

Sim, et al, KR-MED 2004; JBI 2004: 37(2):108-119

Page 21: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Benefits of Approach• Task hierarchy understandable by domain experts

» identifies which information items are required for which tasks

• Provides an evaluation “yardstick”» if an ontology contains all the information requirements for a

task– then is it “competent” for that task

» can evaluate and compare application ontologies

• Documents an (application/domain) ontology» states which tasks an ontology is competent for, and why» cross-indexes tasks and entities in the ontology

Page 22: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Outline

• Background• RCT Schema

» modeling approach» the class structure» evaluation

• Relationship to CTO • Summary

Page 23: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Implementation of 171 Items• Purpose of RCT Bank KB is to support scientific analysis of

trial evidence» needed a “data-schema” or “instance-style” ontology

• RCT Schema implemented the 171 information items in a frame-based ontology

Bank-a-Trial RCT Presenter

RCT Bank

RCT Schema

Page 24: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

RCT Schema Ontology

• Ocelot frame-based model» 7 levels deep

• 192 frames, 607 unique slots» avg. 9.8 slots/frame» 3 frames (1.6%) have multiple parents» 193/607 slots (32%) take other frames as values

• Available at http://rctbank.ucsf.edu/

Page 25: Ida Sim, MD, PhD Associate Professor of Medicine

• RCT Schema displayed in GKB Editor» classes are red boxes» instances are blue boxes

• Class hierarchy organizes entities as » trial concepts» trial descriptions (details)

• Not fully compliant with “Werner’s Rules”

Page 26: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Includes IS-A Hierarchies

Page 27: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Instantiating RCT Schema

• Clinical content described by terms from a clinical vocabulary, e.g., » for a breast cancer trial, instance of BASELINE-

CHARACTERISTIC is described by– term “menopause” from UMLS preferred term– the UMLS CUI

» Trial Bank software supports any vocabulary in UMLS (e.g., SNOMED)

• Each trial is a collection of instances of classes

Page 28: Ida Sim, MD, PhD Associate Professor of Medicine

• 518-TRIAL» 518-

BACKGROUND-DETAILS

» 518-ADMIN-DETAILS

» 518-EXECUTED-PROTOCOL– 518-ALL-SUBJECTS– 518-PRIMARY-

OUTCOME-1– etc.

» 518-ERRATUM» 518-CONCLUSION-

DETAILS

Page 29: Ida Sim, MD, PhD Associate Professor of Medicine

• 518-PRIMARY-OUTCOME-1 (e.g., all-cause mortality)» 518-STAT-ANALYSIS-AND-RESULTS-1 (e.g. t-test)

–518-ALL-COMPARISONS-AT-TIME-X-1 (e.g., at 6 months) 518-SINGLE-TIME-X-COMPARISON-1 (e.g., between

PCI and thrombolysis groups)

Page 30: Ida Sim, MD, PhD Associate Professor of Medicine

• 518-SINGLE-TIME-X-COMPARISON-1 » datapoint for PCI group

– numerator (all-cause deaths at 6 months)– denominator (had all-cause death outcome assessed at 6 months)– 518-STUDY-ARM-POPULATION-1 (the PCI group)

» datapoint for thrombolysis group» summary odds ratio under intention-to-treat analysis» summary odds ratio under efficacy analysis

Page 31: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Outline

• Background• RCT Schema

» modeling approach» the class structure» evaluation

• Relationship to CTO• Summary

Page 32: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Characteristic Examples

Clinical domains Cardiology, Radiology, Geriatrics, etc. Intervention Types

Procedures (thrombolysis), Single and Multi-step Drugs (aspirin, warfarin), Counseling, Multiple interventions in one arm

Outcome Types Dichotomous, continuous, univariate, multivariate, survival, regression, scored instruments (e.g., Wechsel Memory Scale)

Result Types Intention-to-treat, efficacy analysis, subgroup analyses

Expressivity Evaluation

• Captured 17 full and 20+ partial trials

Page 33: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Modeling Challenges Met

• Multi-armed, crossover, cluster randomized studies• Many variations of patient drop-out, loss to followup

(e.g., excluded after randomization)• For each outcome, the # of subjects assessed at each

timepoint in each subgroup• Blinding efficacy: did subjects know which arm they

were assigned to?• etc.

Page 34: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Modular, Extensible• Extensions possible with only minimal changes to 192

existing classes

• Extensible to new clinical domains (e.g., genomics) via clinical vocabularies» no clinical terms in RCT Schema

Modeling Extension

New Classes

New Slots

Old Classes Changed

Cluster randomized trials 2 10 3

Complex intervention regimens 4 8 3

Computable eligibilty rules ? ? 3

Page 35: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Limitations of Representation• Modeled, not yet tested

» participant-level data» factorial designs» designs with run-in and washout periods

• In development» computable eligibility rules

• Not yet modeled» genomic data » nested subgroups» secondary studies (e.g,. followup studies)

Page 36: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Trial Bank Publishing

• How to get trials into RCT Bank?• Collaborated with JAMA and Ann Int Med to explore co-

publishing trials as articles and RCT Bank entries» authors submit manuscripts for peer review as usual » trial-bank staff enter accepted trials into trial bank» co-published 14 trials in RCT Presenter

• Evaluation» 83 respondents evaluated a trial using both RCT

Presenter and the Journal Article– mostly trialists and meta-analysts

Page 37: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

RCT Presenter Evaluation

85 8165

85

15

73

42 46 42 50

12 1227

12

65

23

3139 50 39

0102030405060708090

100

% R

esp

ondents

Presenter Better Same as Article

• 70% of respondents rated RCT Presenter as good as or better than the Journal Article for all attributes

N=30

Page 38: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Outline

• Background• RCT Schema

» modeling approach» the class structure» evaluation

• Relationship to CTO • Summary

Page 39: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Alignment to CTO

• Have participated in CTO working group from inception (Simona Carini)

• Contributed RCT Schema classes and definitions to list of terms for consideration

• Contributed to draft of high-level concept hierarchy– http://www.bioontology.org/wiki/index.php/High-

level_Concepts_v0.2

• Contributed to creation of the draft CTO presented this morning

Page 40: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

How Would Trial Bank Use CTO?• Use CTO as common index into RCT Bank and into

other clinical trial data and information systems • Map RCT Schema class and slot names to terms in

CTO» so that RCT Bank instances can be made available to

machines and humans who wish to – query– reason, or – integrate

» clinical trial information using CTO» e.g., trials co-published with PLoS, etc. into RCT

Bank

Page 41: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Example Use (NCBO)

• Map class names from any trial bank to terms in CTO» e.g., in RCT Bank or “European Trial Bank”

– PRIMARY-OUTCOME to CTO term for this– BASELINE-CHARACTERISTIC to CTO term for this

• CTeXplorer an ontology-driven tool for visualizing complex design differences across trials [MA Storey, et al UVic, Canada]

» given variable HgbA1C from any trial bank» would know how to handle and display if annotated as

– PRIMARY-OUTCOME [CTO] or – BASELINE-CHARACTERISTIC [CTO]

Page 42: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

CTO for Integration with Trial Bank Collaborators...• National Center for Biomedical Ontology• Electronic Primary Care Research Network

» Primary Care Research Object Model• Global Trial Bank, with AMIA

» trial-bank publishing» “dedicated to assuring the implementation and maintenance

of an open global infrastructure for computable clinical trial results information”

• European Clinical Trial Data Repository» submitted to Framework Programme 7

• with BRIDG, clinical trial management systems, etc?

Page 43: Ida Sim, MD, PhD Associate Professor of Medicine

Copyright 2007, Ida Sim

Summary• Trial banks are computable repositories of trial information

for analysis, interpretation, and application of trial evidence to research and care

• We specified the entities for RCT Bank using competency decomposition method

• RCT Schema implemented entities specification as an “instance-style” frame-based ontology» expressive, extensible, useful for reporting and interpretation

• Mapping of RCT Schema terms to CTO terms makes RCT Bank instances available for CTO-driven integration and reasoning