allegrograph - cognitive probability graph webcast

42
The power of the Cognitive Probability Graph (aka Cognitive Computing) June 2016 Jans Aasman [email protected]

Upload: franz-inc-allegrograph

Post on 12-Apr-2017

91 views

Category:

Technology


1 download

TRANSCRIPT

Page 1: AllegroGraph - Cognitive Probability Graph webcast

The power of the

Cognitive Probability Graph(aka Cognitive Computing)

June 2016

Jans Aasman

[email protected]

Page 2: AllegroGraph - Cognitive Probability Graph webcast

10 years ago

Structured Data

Page 3: AllegroGraph - Cognitive Probability Graph webcast

7 years ago

Structured Data Unstructured Data

Page 4: AllegroGraph - Cognitive Probability Graph webcast

4 to 5 years ago

Structured DataUnstructured

Data

KnowledgeDomain knowledge

Linked Open Data

Vocabularies

Taxonomies/Ontologies

Page 5: AllegroGraph - Cognitive Probability Graph webcast

New #1: Learning. Feed output of data

science back into data infrastructure

Structured

Data

Unstructured

Data

KnowledgeDomain knowledge

Linked Open Data

Vocabularies

Taxonomies/Ontolo

gies

Probabilistic

Inferences.

Page 6: AllegroGraph - Cognitive Probability Graph webcast

New # 2: everything in one (distributed)

semantic graph

Structured

Data

Unstructured

Data

KnowledgeDomain

knowledge

Linked Open Data

Vocabularies

Taxonomies/Ontol

ogies

Probabilistic

Inferences.

Unstructured

Data

KnowledgeDomain

knowledge

Linked Open Data

Vocabularies

Taxonomies/Ontol

ogies

Page 7: AllegroGraph - Cognitive Probability Graph webcast

AKA: Cognitive Computing

Structured

Data

Unstructured

Data

KnowledgeDomain

knowledge

Linked Open Data

Vocabularies

Taxonomies/Ontol

ogies

Probabilistic

Inferences.

Unstructured

Data

KnowledgeDomain

knowledge

Linked Open Data

Vocabularies

Taxonomies/Ontol

ogies

Page 8: AllegroGraph - Cognitive Probability Graph webcast

Examples

Examples

• Healthcare: If I have this class of diagnostics and I get this procedure what are some of the new

symptoms I might get in the next two years.

• eCommerce and brand protection: find all my products based on product similarity

• Logistics: what can I statistically predict about part P breaking down and what other parts do I

usually buy after that part breaks down.

• Police Intelligence: find the most plausible story of a temporally orderend shortest path between

two criminal through observed (hard) facts and inferred (soft facts)

• Fraud detection: find links between your local chamber of commerce and the panama papers

through similar names and addresses.

Page 9: AllegroGraph - Cognitive Probability Graph webcast

Example healthcare

• Franz and Montefiore are partners in the Semantic Data Lake project.

Page 10: AllegroGraph - Cognitive Probability Graph webcast

One cognitive computing platform for all healthcare analytics

Page 11: AllegroGraph - Cognitive Probability Graph webcast

Example healthcare

• We created a single data centric platform that can serve any type of

analytic without building a new data mart for every new question.

• Currently 2.7 million patients with 10 years of data

• All data captured in a Unified Clinical Event model with 350 classes of

events.

Page 12: AllegroGraph - Cognitive Probability Graph webcast

Healthcare: structured and unstructured data

Page 13: AllegroGraph - Cognitive Probability Graph webcast
Page 14: AllegroGraph - Cognitive Probability Graph webcast

Structured patient data combined with complex

integrated terminology

Page 15: AllegroGraph - Cognitive Probability Graph webcast

Provenance for every value

Page 16: AllegroGraph - Cognitive Probability Graph webcast

Healthcare: the knowledge bases

• More > 180 vocabularies and terminology systems integrated in on

unified terminology system (Mesh, Snomed, UMLS, RxNorm, LOINC etc,

etc)

• External databases and

• Linked Open Data

Page 17: AllegroGraph - Cognitive Probability Graph webcast

OMOP

11089001

6600349

11894800

5

7534205

16790501

14667809

35896705

9209005

1732609

9908905

1469609

329005

LOINC

113345001 140460009

skos:semanticRelation

skos:narrower

118948005

skos:broader

“9209005”

SNOMEDCT

M0024135

M0008124 M0004742

skos:semanticRelation

skos:narrower

M0015742

skos:broader“Abdominal Pain”

“M0

02

41

35

skos:exactMatch

A0

54

93

02

A0978543

skos-xl:prefLabel

9209005

“Abdominal Pain”

SAB

AUI

SUI

MeSH

SNOMEDCT

MedDRA

rdfs:subClassOf

rdf:type

C0172359 C0232487

C0238551

“Abdominal Pain”

“C0000737”

skos:semanticRelation

skos:broader

UMLS - MTH

skos:notation

S035799

skos-xl:label

MTH

STR

C000737

Everything linked through SKOS

SK

OS

/SK

OS

-XL

ConceptScheme

Concept

UMLS - Semantic Net

Entity Event

Label

Page 18: AllegroGraph - Cognitive Probability Graph webcast

Population,

Community

Time

Pt.Pt.Pt.Pt.

SDL Paradigm:

Pt.Pt.Pt.Pt.

Diagnosis

Codes

Disease

Classification

OMIM,

GONG

Genetic

Profile

Procedure

CodeHCPC

Manufacturers

PharmKGB

Drug

Classification

Drug

Codes

DrugBank

ClinicalTrials

CER

PubMed

Analytic Tapestry(closed loop analytics)

Page 19: AllegroGraph - Cognitive Probability Graph webcast
Page 20: AllegroGraph - Cognitive Probability Graph webcast
Page 21: AllegroGraph - Cognitive Probability Graph webcast

Healthcare: probabilistic inferences

Why is this so important?

• Usually the output of data science results in reports and publications but

• No formal trace where the data came from

• No formal link to the actual methods you used, or who did it, or when you did it

• Cannot be compared to earlier results

• Cannot be used as building blocks for further research

• In general : the output is not queryable

• This is not good for delivery of care, reproducibility of research findings,

security and compliance, and results in loss of value-added information,

and enterprise intellectual property and assets, and unnecessary

duplication of efforts

Page 22: AllegroGraph - Cognitive Probability Graph webcast

Odds ratio

Page 23: AllegroGraph - Cognitive Probability Graph webcast
Page 24: AllegroGraph - Cognitive Probability Graph webcast
Page 25: AllegroGraph - Cognitive Probability Graph webcast
Page 26: AllegroGraph - Cognitive Probability Graph webcast

Association rules

Page 27: AllegroGraph - Cognitive Probability Graph webcast
Page 28: AllegroGraph - Cognitive Probability Graph webcast

K-means clustering

Page 29: AllegroGraph - Cognitive Probability Graph webcast
Page 30: AllegroGraph - Cognitive Probability Graph webcast
Page 31: AllegroGraph - Cognitive Probability Graph webcast

And then a query you could do never before

• Using the Knowledge Base, the Structured Data and the Probabilistic

inferences all at the same time.

• To find the statistical links between Diabetes and Vision problems in our

Semantic Data Lake

• Find the set of ICD9s that are connected via one or more steps to

concepts in the KB that mention Diabetes

• Find the set of ICD9s that are connected via one or more steps to

vision* or eye* or retinal*

• An show how those two sets are related in the space of odds ratios

Page 32: AllegroGraph - Cognitive Probability Graph webcast
Page 33: AllegroGraph - Cognitive Probability Graph webcast
Page 34: AllegroGraph - Cognitive Probability Graph webcast

And then just a few other examples

Page 35: AllegroGraph - Cognitive Probability Graph webcast

In the ecommerce world: find similar objects based on > 10

criteria, including description, product codes, pictures, etc

Page 36: AllegroGraph - Cognitive Probability Graph webcast
Page 37: AllegroGraph - Cognitive Probability Graph webcast

Returns a Graph in a Table (ughh )

Page 38: AllegroGraph - Cognitive Probability Graph webcast

But powerful when visualized

Page 39: AllegroGraph - Cognitive Probability Graph webcast

Or like this

Page 40: AllegroGraph - Cognitive Probability Graph webcast

And linking with the panama papers

Page 41: AllegroGraph - Cognitive Probability Graph webcast

And now the researchers can start investigating

Page 42: AllegroGraph - Cognitive Probability Graph webcast

Summary: this is the new paradigm of computing

Structured

Data

Unstructured

Data

KnowledgeDomain

knowledge

Linked Open Data

Vocabularies

Taxonomies/Ontol

ogies

Probabilistic

Inferences.

Unstructured

Data

KnowledgeDomain

knowledge

Linked Open Data

Vocabularies

Taxonomies/Ontol

ogies