a semantic web approach to integrative biosurveillance...2- jena from hp as api for semantic web 3-...

39
A Semantic Web Approach to Integrative Biosurveillance Narendra Kunapareddy, UTHSC Zhe Wu, Ph.D., Oracle

Upload: others

Post on 04-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

A Semantic Web Approach to Integrative Biosurveillance

Narendra Kunapareddy, UTHSCZhe Wu, Ph.D., Oracle

Page 2: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

This talk:• Translational BioInformatics and

Information Integration Dilemma• Case Study: Public Health

Preparedness• Our Vision• Our Implementation• Challenges

Page 3: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

This talk:• Translational BioInformatics and

Information Integration Dilemma• Case Study: Public Health

Preparedness• Our Vision• Our Implementation• Challenges

Page 4: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Structural Heterogeneity

EMR CPOE Charts

Clinical Data

Genomics

Diagnostics

Semantic DisparityStandards and Frameworks

NHIN

HL7

UMLSDistributed Collaboration

Epidemiologist Clinician BasicScientist

Schema Change, Semantic Drift, Framework UpdateDynamic Environment, New Hypothesis, Governance Planning

Repurpose and Re-use Translational

Research

Protocols& Policies

PublicHealth

Governance & Protection

Security

Access

Provenance

change in time

Information Integration Dilemma

Page 5: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

This talk:• Translational BioInformatics and

Information Integration Dilemma• Case Study: Public Health

Preparedness• Our Vision• Our Implementation• Challenges

Page 6: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Public Health Preparedness

Page 7: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Public Health Preparedness

Page 8: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Public Health Preparedness

SNOMED NNDS

LOINC PHIN

Page 9: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Context is important

Page 10: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

State of the art

Page 11: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

An elephant with 2 trunks and 5 legs!!!

State of the art

Page 12: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

An elephant with 2 trunks and 5 legs!!!

State of the art

Page 13: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

This talk:• Translational BioInformatics and

Information Integration Dilemma• Case Study: Public Health

Preparedness• Our Vision• Our Implementation• Challenges

Page 14: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

the Solution Framework• Resource Definition Framework (RDF)to enable unified Information Representation

• Ontologies (OWL) and Computer Reasoning (DL)(OWL-DL) To enable knowledge representation and reasoning

• Services Oriented Architecture Dynamic interoperability and reuse

Page 15: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

the Solution Framework• Resource Definition Framework (RDF)to enable unified Information Representation

• Ontologies (OWL) and Computer Reasoning (DL)(OWL-DL) To enable knowledge representation and reasoning

• Services Oriented Architecture Dynamic interoperability and reuse

Integrative, transdisciplinary, agile, collaborative

Page 16: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

the Solution Framework• Resource Definition Framework (RDF)to enable unified Information Representation

• Ontologies (OWL) and Computer Reasoning (DL)(OWL-DL) To enable knowledge representation and reasoning

• Services Oriented Architecture Dynamic interoperability and reuse

Integrative, transdisciplinary, agile, collaborative

Context aware, knowledge based, agile, transdisciplinary, collaborative

Page 17: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

the Solution Framework• Resource Definition Framework (RDF)to enable unified Information Representation

• Ontologies (OWL) and Computer Reasoning (DL)(OWL-DL) To enable knowledge representation and reasoning

• Services Oriented Architecture Dynamic interoperability and reuse

Integrative, transdisciplinary, agile, collaborative

Context aware, knowledge based, agile, transdisciplinary, collaborative

Agile, interoperable, collaborative and distributed

Page 18: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Ontologies

CapabilityCases Web Services

Dimensions of SARA

Page 19: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

This talk:• Translational BioInformatics and

Information Integration Dilemma• Case Study: Public Health

Preparedness• Our Vision• Our Implementation• Challenges

Page 20: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Data Sources - 1• Triage Data

– Patient Demographics (Age, Ethnicity, Gender)– Vital Signs (T, RR, PR, PO2) – Chief Complaints

• Nurse Notes – Vital Signs, – Complete Review of Systems: General, Respiratory, Neurological,

Gastrointestinal, Dermatological, etc– Past Medical and Surgical HX – Medications, Past Medications, Home Medications– Interventions, Procedures– Outcome– Discharge and Disposition– Past Medical and Surgical HX

Data Sources - 2

Page 21: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Data Sources - 1• Triage Data

– Patient Demographics (Age, Ethnicity, Gender)– Vital Signs (T, RR, PR, PO2) – Chief Complaints

• Nurse Notes – Vital Signs, – Complete Review of Systems: General, Respiratory, Neurological,

Gastrointestinal, Dermatological, etc– Past Medical and Surgical HX – Medications, Past Medications, Home Medications– Interventions, Procedures– Outcome– Discharge and Disposition– Past Medical and Surgical HX

Data Sources - 2•From 8 community hospitals and 16 different IT implementations•Structured, semi-structured, non structured entries•Automated submissions through HTTP•Accounts for about %30 Houston ED visits•Data transmission every 10 minutes or less•Over 250,000 concepts, 82 million instances and growing

Page 22: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Texas Commission for Environmental Quality (TCEQ)

- Pollution Parameters– CO,SO2,H2S,NO, NO2, O3, TNMOC, CH4, ...

- Meteorological Parameters– Temperature (Outdoor , Dew Point)– Relative Humidity,– Radiation (Solar, Ultraviolet, Net Radiation)– Barometric Pressure,– Precipitation, …

- Chromatography Data– Ethane, Methylcyclopentane, 1,2,4-

Trimethylbenzene, Ethylene, 2,4-

Data Sources - 3

Page 23: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Texas Commission for Environmental Quality (TCEQ)

- Pollution Parameters– CO,SO2,H2S,NO, NO2, O3, TNMOC, CH4, ...

- Meteorological Parameters– Temperature (Outdoor , Dew Point)– Relative Humidity,– Radiation (Solar, Ultraviolet, Net Radiation)– Barometric Pressure,– Precipitation, …

- Chromatography Data– Ethane, Methylcyclopentane, 1,2,4-

Trimethylbenzene, Ethylene, 2,4-

Data Sources - 3

• From 18 locations 2 sensors each• Data Transmission from TCEQ hourly• 250 concepts on each message• Air Quality indices calculated twice daily

Page 24: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started
Page 25: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Publicationand

SubscriptionService

NLP/NLUService

Vocabulary Service

Rules Service

Pull

Push

SODS

XML to Ontology Service

TransformerService

Queue Service

Notification Service

AuthorizationService

ClassificationService

Event Store

Fact Store

Semantic Repository

OPAL Service

Signal Detection Service

Ontology Service

Semantic Application Programming Interface

Semantic Integration

Page 26: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

SAPPHIRE Implementation

We Are Here

Page 27: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

SAPPHIRE Implementation

We Are Here

Proof of concept

Page 28: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Implementation Platform1- TopBraid Composer as Ontology Management Tool2- Jena from HP as API for Semantic Web3- Eclipse Java Development Environment4- Oracle Semantic Data Management (Started with

10gR2 on Windows, Currently 11gR1 on Linux)5- Pellet/Jena OWL Micro Reasoner6- Services Oriented Architecture7- Microsoft SQL Server 2005 XML archive and

Analysis Services8- IBM Dual Xeon 2.8GH/3GB RAM Blade Server9- EqualLogic iSCSI SAN (4 TB)10- GB Ethernet LAN

Page 29: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

This talk:• Translational BioInformatics and

Information Integration Dilemma• Case Study: Public Health

Preparedness• Our Vision• Our Implementation• Challenges

Page 30: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Challenges• State of the frameworks• Maturity of Tools• Knowledge Engineering and Ontology

Development• Reasoning and Rules Support• Scalability • Performance

Page 31: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Academic and Industrial

• Scalable, High Performance RDF/OWL Repositories (Oracle, Franz)

• Scalable Semantic Application Programming Interface (Oracle, TopQuadrant, HP)

• Ontology based Business Intelligence and Data Mining (TopQuadrant)

Page 32: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Scalable Semantic Application Programming

• Scalable SW application development interface for Oracle Semantic Data Management (SDM)

• Seamless integration of application development interfaces to Oracle SDM

• Without any intermediate or ‘in memory’ representation of semantic data

Page 33: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Method (conceptual)

• Adopt existing SW application development frameworks (e.g. Jena)

• Extend by eradicating the intermediate representation of Semantic data in memory

• Support SPARQL in the applications side• Enable invocation and use of integrated

rules engines through API

Page 34: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Jena/SesameMemory Model

JenaPersistence RQL

engine

SW ApplicationProgramming

Interface

SWApplication

URL based

Jena Conceptual Model As Is

Not scalable, depends on memory resourcesDoes not support concurrent users and distributed applicationsReasoning is not scalable and integrated with repositories

Page 35: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

SW ApplicationProgramming

Interface

SWApplication

ORACLERDF/OWL

(SDM)

Oracle Jena Adapter

Rules Index/Entailment

Jena/Sesame/OWLAPI

Model

RQLengine

Page 36: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

SW ApplicationProgramming

Interface

SWApplication

ORACLERDF/OWL

(SDM)

Oracle Jena Adapter

SW ApplicationProgramming

Interface

SWApplication Je

na/S

esam

e/O

WLA

PI

Mod

el

RQ

Len

gine Rules

Index/Entailment

Jena/Sesame/OWLAPI

Model

RQLengine

Page 37: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Jena/SesameMemory Model

JenaPersistence RQL

engine

URL based

Oracle Jena Adapter

SW ApplicationProgramming

Interface

SWApplication

ORACLERDF/OWL

(SDM)Rules

Index/Entailment

Jena/Sesame/OWLAPI

Model

RQLengine

Page 38: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Pros and Cons Enables use of Oracle SDM for large scale

implementations using Graph and Model objects Complete but indirect support of SPARQL Supports multi-user and distributed application

environments Integrated Support of Oracle Reasoners (RDFS and

OWLPrime) in the application side Robust performance through both programming

interface as well as SPARQL querying

Indirect support of SPARQL ( not available outside of the API through SQL Developer for example)

OntModel to be supported in future releases Rules reasoning not real time

Page 39: A Semantic Web Approach to Integrative Biosurveillance...2- Jena from HP as API for Semantic Web 3- Eclipse Java Development Environment 4- Oracle Semantic Data Management (Started

Contact • Parsa Mirhaji, MD Director

The Center for Biosecurity and Public Health Informatics Research The School of Health Information Sciences The University of Texas -Health Sciences Center at Houston Office: (713) 500-3157 Fax: (713) 500-0370 Assistance (Namiko Burleson): (713) 500-3938

• http://www.phinformatics.org/ResearchProjects/SAPPHIRE/tabid/76/Default.aspx