ontology reasoning for the semantic web and its application to knowledge graph

53
1 Guilin Qi Knowledge Science and Engineering (KSE) Lab Southeast University [email protected] Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

Upload: jun-wang

Post on 26-Jun-2015

142 views

Category:

Technology


3 download

DESCRIPTION

Guilin Qi Knowledge Science and Engineering (KSE) Lab Southeast University [email protected]

TRANSCRIPT

Page 1: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 1

Guilin Qi

Knowledge Science and Engineering (KSE) Lab

Southeast University

[email protected]

Ontology Reasoning for the Semantic Web and Its Application to

Knowledge Graph

Page 2: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 2

Plan of the Talk Part I: Introduction of Ontology Languages for the

Semantic WebPart II: Application of Ontology Reasoning Part III: Reasoning with Large Imprecise Knowledge on

the Semantic Web

Page 3: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 3

Ontology

Different definitions: philosophy, AI,…

Definition in Semantic Web:

An ontology is an explicit specification of a conceptualization

Gruber, 1993

Page 4: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 4

What is an Ontology?

A model of (some aspect of) the worldIntroduces vocabulary

relevant to domain, e.g.:Anatomy

Page 5: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 5

A model of (some aspect of) the worldIntroduces vocabulary

relevant to domainSpecifies meaning (semantics)

of terms Heart is a muscular organ that is part of the circulatory system

What is an Ontology?

Page 6: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 6

Ontologies The core of an ontology is usually a taxonomy:

classes of things, arranged in a hierarchy

Human

MaleHuman [every MaleHuman is a Human]

Son [every Son is a MaleHuman]

Father [every Father is a Son]

FemaleHuman

Daughter

Aunt

Niece

Mother

GrandMother

Page 7: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 7

RDF (Resource Description Framework)Specifies relationship between data

RDFS(Resource Description Framework Schema)Specifies relationship between schema

OWL (Web Ontology Language)Specifies more complex relationship

between schema based on description logics

Ontology languages

Page 8: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 8

RDF idea

Use (directed) graphs as data model

Subject Predicate Object

“Resource Description Framework”

Page 9: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 9

RDF Schema (RDFS)

part of the W3C Recommendation RDFfor schema/terminological knowledgeuses RDF vocabulary with pre-defined

semantics

Page 10: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 10

Classes and Instances

Classes stand for sets of things.In RDF: Sets of URIs.

book:uri is a member of the class ex:Textbook

a URI can belong to several classes

classes can be arranged in hierarchies:each textbook is a book

Page 11: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 11

Implicit knowledge if an RDFS document contains

and

then

is implicitly also the case: it’s a logical consequence. (We can also say it is deduced (deduction) or inferred (inference)

Page 12: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 12

Implicit knowledge – another example

From

the following is a logical consequence:

I.e. rdfs:subClassOf is transitive.

Page 13: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 13

Using implicit knowledge

• Ontology (Knowledge Base)• e.g. RDF or OWL

• Reasoner (produces implicit knowledge)

• Completed (materialized) knowledge base

• Application

Used like a

databaseoffline

Page 14: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 14

The same as graph

Page 15: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 15

OWL – Web Ontology Language

<owl:Class rdf:about="project"> <rdfs:subClassOf> <owl:Class> <owl:unionOf rdf:paseType="Collection"> <owl:Class rdf:about="internal"/> <owl:Class rdf:about="external"/> </owl:unionOf> </owl:Class> </rdfs:subClassOf></owl:Class>

Why do we need OWL?

ProjectInternal

External

Is either

or

Page 16: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 16

OWL 2 Profiles

The OWL 2 spec describes three profiles (fragments, sublanguages) which have polynomial complexity.OWL EL (the description logic EL++)

• Represent medical knolwedge

OWL QL (the description logic DL LiteR)• Targeted to data integration

OWL RL (the description logic DLP)• inspired by intersecting OWL with Datalog• implemented e.g. in Oracle 11g

Page 17: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 17

Description LogicsDescription logics

Are (mostly) decidable fragments of first-order predicate logic

Provide logical underpinning of W3C standard OWL

Building blocksConcepts (unary predicates/formulae with

one free variable)o E.g., Person, Lawer ⊔ Doctor

Roles (binary predicates/formulae with two free variables)

o E.g., hasChildIndividuals (constants)

o E.g., John, Mary

Page 18: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 18

Description Logics (Syntax)Description languages

Defining complex concepts: sets of individuals

Defining complex roles: binary relations on individuals

Complex concepts are built byAtomic concepts: Tissue, HeartConstructors: Tissue⊓part-of.Heart

Complex roles are built byAtomic roles: part-of, has-locationConstructors: HasFatherˉ

Page 19: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 19

Heart is a muscular organ that is part of the circulatory system

Heart⊑MuscularOrgan⊓part-of.CirculatorySystem

Example

Page 20: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 20

Description Logics (Semantics)

Interpretation: I=(I,.I) Domain: I

Assignment function .I

individual names, class names and property names...

• .I

• aI • CI

• RI

•Δ• II • IC

• IR

Page 21: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 21

Description Logics (Cont.)

Interpretation: I=(I,.I)

Construct Syntax Example Semantics

Atomic concept A Heart AI I

Atomic role R part-of RI I I

Negation C Heart I \ CI

Conjunction C ⊓ D Lawyer⊓Doctor CI∩DI

Value restriction R.C part-of.Wood {a|b. (a,b) RI, b CI}

… … … …

Page 22: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 22

Description Logics (Ontology)

TBox T: defining terminology of application domain Inclusion assertion on concept :C ⊑ D

Inclusion assertion on roles: R ⊑ S

ABox A: stating facts about a specific “world” membership assertion: C(a) or R(a,b)

Pericardium ⊑ Tissue ⊓ part-of.Heart

Part-of ⊑ has-location

HappyMan(Bob), HasChild(Bob, Mary)

Page 23: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 23

Given an interpretation ISemantics of TBox axioms

I ⊨ C ⊑ D if CI DI

I ⊨ R ⊑ S if RI SI

Semantics of ABox assertionsI ⊨ C(a) if aI CI I ⊨ R(a,b) if (aI,bI) RI

Description Logics(Semantics)

Page 24: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 24

Model of an ontology O=<T, A>I is a model of O if it satisfies all axioms

in T and all assertions in AConcept satisfiability

Concept C is satisfiable in O if CI is nonempty for some model I of O

Ontology Entailment:O ⊨ iff I⊨ for all models I of O

Description Logics(Semantics)

Page 25: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 25

RDF is a flexible data model for Semantic Web

RDF Schema provides simple inference capability

OWL allows more expressive representation of knowledge but is hard to scale to Web data

Semantic technologies have been adopted by major companies such as Google, Yahoo and Facebook

Conclusion

Page 26: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 26

Reasoning Task: Classification

DiseaseHeartDiseaseMiocardial_Infarction⊑⊑

心肌梗塞 心脏病 疾病

Page 27: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 27

Example

Endocardium ⊑ Tissue cont-in.HeartWall ⊓ ∃ ⊓ ∃ cont-in.HeartValve

HeartWall ⊑ BodyWall part-of.Heart⊓∃

HeartValve ⊑ BodyValve part-of.Heart⊓∃

Endocarditis ⊑ Inflammation⊓ ∃ has-loc.Endocardium

Inflammation ⊑ Disease act-on.Tissue⊓∃

HeartDisease has-⊓∃loc.HeartValve

⊑ CriticalDisease

HeartDisease ⊑ Disease has-loc.Heart⊓∃

Endocarditis⊑HeartDisease

Endocarditis⊑CriticalDisease

Role of Classificationa) Enrich ontologyb) Query writtingc) Check satisfiability of KB

Reasoning Task: Classification

Page 28: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 28

Endocarditis⊑HeartDisease

Meningitis⊑HeartDisease

HeartDisease Disease⊑ ⊓ has-loc.Heart∃

Meningitis⊑ has-loc.Heart∃

Endocarditis ⊑ has-loc.Heart∃

Classification

Justification

Justification

Through classification, we have“Meningitis⊑ has-loc.Heart”∃ After finding justification, we found“Meningitis⊑HeartDisease” is

wrong

Repair KB using justifications

Reasoning Task: Finding Justification

Page 29: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 29

Suppose we have the following query “ 心脏病患者有哪些?”

This query can be used for medical statistics

Reasoning Task: Query Rewritting

Page 30: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 30• 04/13/2023• 3

0

王红

心内膜炎

心脏病

冠心病心肌梗塞

李明… …

We know“ 王红” is 心内膜炎患者, we should include her as 心脏病患者

List(X) :- Endocarditis(X) ⋁ Miocardial_Infarction(X) ⋁ Coronary_disease(X).

Reasoning Task: Query Rewritting

Suppose we have the following ontology

Page 31: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 31

Inconsistency may occur during ontology construction One source of inconsistency comes from disjoint axioms

心脏病

脑科疾病

心内膜炎

• ⊑

• ⊑

Reasoning Task: Checking Inconsistency

Page 32: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 32

Mining disjoint Concepts : Association rule mining, Inductive logic programming

In Zhishi.me, hudong: 大豆食心虫 not only belongs to animal but also contains in plant

In Zhishi.me, there are 50 common instances between animal and plant

• 植物

• 花

• 玫瑰

• 蔷薇

• 农作物

• 玉米

• 小麦

• 动物• 哺乳动物• 老

虎• 狮

RDF Data

Mining Axioms 植物 disjoint 动物

花 disjoint 农作物植物 disjoint 哺乳动物

……

Disjointness Axioms

Reasoning Task: Checking Inconsistency

Page 33: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 33

In Dbpedia, we find 42153 disjointness axioms by mining axioms algorithm

Reasoning Task: Checking Inconsistency

Number of Common individuals Pairs of disjoint classes

[1,10) 317

[10,100] 27

[100,1000) 7

Page 34: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 34

Detecting noisy type assertion

Experiment result of detecting noisy type assertion in DBpedia

Reasoning Task: Checking Inconsistency

Disjoint concepts with common instance

Classifier

Place(pope) errorPerson(Dibra ) error

…Place(Xbox_Music) and

Person(Xbox_Music) error

Noisy type assertion detection

< pope Place Person>< Dibra Place Person >

…< Xbox_Music Place Person >

Classifier Precision Recall F1-Measure

J48 93.6% 93.6% 93.6%

J48(boost) 95.6% 95.6% 95.6%

Page 35: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 35• 04/13/2023

Reasoning with Large Scale Imprecise Knowledge on the Semantic Web

Page 36: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 36• 04/13/2023

Motivation More and more semantic data are published and

linkedSemantic data are inherently imprecise

Data extraction may result in imprecisionData linking may result in imprecision Reasoning with large imprecise semantic data

Schema of the data may also be imprecise Schema inductionOntology enrichment Reasoning with large imprecise ontologies

Page 37: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 37• 04/13/2023

•  

Background: Fuzzy pD*

Page 38: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 38• 04/13/2023

•  

Background: Fuzzy pD*

Page 39: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 39

MapReduce

Page 40: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 40

• Mapper1

• Mapper2

Reducer1

(I)

Reducer2

(love)

Reducer3

(you)

Reducer4

(like)

I love you love

I likeyou

<I , 1>

<love , 1>

<you , 1>

<love , 1>

<I , 1>

<like , 1>

<you , 1>

<I , 2>

<love , 2>

<you , 2>

<like , 1>

Example

Page 41: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 41

Challenges

Ordering the rule applicationsBad orders will generate more non-BDB fuzzy triples

The shortest path calculationSome rules essentially calculates the all-pair shortest paths

Sameas rulesCanonical representation technique is not applicable to handle the semantics of vague sameas triples

Page 42: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 42

Ordering the rule applications

Control flow of the reasoning algorithms

•  

•  

New fuzzy triples derived?

New fuzzy triples derived?

New fuzzy triples derived?

Page 43: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 43

Shortest path calculation

•  

•  

•  

•  

•  

•  

•  

•  

•  

Page 44: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 44• 04/13/2023

Sameas rules

•  

Vague sameas triples

Page 45: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 45

Dataset Weighted DBPedia core ontology wpdLUBM 1000, 2000, 4000, 8000

Cluster 25 machine with at most 75 mapper/reducer slots

Setting

Page 46: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 46

#units 128 64 32 16 8 4 2

Time(sec.) 122.653 136.861 146.393 170.859 282.802 446.917 822.269

Speedup 6.70 6.01 5.62 4.81 2.91 1.84 1.00

Dataset: Weighted DBPedia core ontology

Results:

Experiment

Page 47: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 47

Scalability over number of units

Experiment

Page 48: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 48

Reasoning on fuzzy-EL+

R2

Classfication rules for fuzzy-EL+

∃performAs.ComplicationHeart_ComplicationMiocardial_Infarction⊑(1.0)⊑(0.2)

⊑(0.2)

心肌梗塞 心脏附近并发症 具有并发症表现的疾病

Page 49: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 49

Challenges and methods

Transforming rule-applying to an operation on tables The rules are given by operations on sets. It is more straightforward to treat them as operations on tables, in other words, relation algebra

Page 50: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 50

Handling multi-way join

•   •  

MapReduce can handle a 2-way join in one job.

Challenges and methods

Page 51: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 51

Evaluation on Galen

• EL-Galen• 8nodes, 16 units

Concepts Roles GCI RI

47,840 1,892 87,594 1,947

1. 82

3. 32 4. 45

5. 53

10. 63

0.00

2.00

4.00

6.00

8.00

10. 00

12. 00

0 2 4 6 8 10

hour

number of copi es

Scal abi l i t y Test 2

• Scalability test 1: The system’s performance speeds up linearly to the increase of number of units• Scalability test 2: The cost time increases linearly to the number of copies of the input ontology

1. 82 2. 02

2. 80

4. 09

0.000.501.001.502.002.503.003.504.004.50

0.0000 0.2000 0.4000 0.6000

hours

1/ number of uni t s

Scal abi l i t y Test 1

Page 52: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 52

Evaluation on Snomed-CT

• Snomed-CTConcepts Roles GCI RI

836,612 77 1,438,948 23

• Scalability test : The cost time increases linearly to the number of copies of the Snomed-CT

0.5 1 1.5 2 2.5 3 3.50

5

10

15

20

25

30

the number of copies

hours

Page 53: Ontology Reasoning for the Semantic Web and Its Application to Knowledge Graph

• 53

Conclusion and Discussion Ontology reasoning plays an important role for KG

Detecting logical inconsistency and repair knowledge bases (improve the quality of the knowledge)

Extend knowledge bases ( materialization, classification) Query rewritting and extension

Reasoning in KG need meta-reasoning Variety of knowledge: terminological knowledge, rule, probabilistic

knowledge ect. Challenging problems

Current work is mainly based on MapReduce and Hadoop, thus suffers from the problem of efficiency and dynamics

Only lightweight ontology languages, such as RDFS, OWL 2 RL are supported

Reasoning with imprecise knowledge is not well discussed