prompt: algorithm and tool for automated ontology merging and alignment

61
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya F. Noy Stanford Medical Informatics Stanford University

Upload: thai

Post on 12-Jan-2016

45 views

Category:

Documents


2 download

DESCRIPTION

PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment. Natalya F. Noy Stanford Medical Informatics Stanford University. Outline. Definitions and motivation The PROMPT ontology-merging algorithm Incremental algorithm (PROMPT) Statistical algorithm (Anchor-PROMPT) - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

PROMPT:Algorithm and Tool for Automated Ontology Merging and Alignment

Natalya F. Noy

Stanford Medical Informatics

Stanford University

Page 2: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Outline

Definitions and motivation The PROMPT ontology-merging algorithm

Incremental algorithm (PROMPT) Statistical algorithm (Anchor-PROMPT)

The tools Evaluation Future work

Page 3: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Ontologies

Characterize concepts and relationships in an application area, providing a domain of discourse

Enumerate concepts, attributes of concepts, and relationships among concepts

Define constraints on relationships among concepts

Page 4: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Why do we need ontologies

An ontology provides a shared vocabulary for different applications in a domain

An ontology enables interoperation among applications using disparate data sources from the same domain

Page 5: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Ontologies Are Everywhere

Ontologies have been used in academic projects for a long time Knowledge sharing and reuse Reuse of problem-solving methods

Ontologies are becoming widely used outside of academia Categorization of Web sites (e.g. Yahoo!) Product catalogs

Page 6: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Need for Ontology Merging

There is significant overlap in existing ontologies Yahoo! and DMOZ Open Directory Product catalogs for similar domains

Page 7: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Need for Ontology Merging and Integration

Need to merge or align overlapping ontologies Chemdex™—a portal for accessing life-

science–supply catalogs Workshop on “Ontologies and Information

Sharing” at IJCAI’2001 6 out of 18 papers (1/3) are about ontology

merging and integration

Page 8: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

What Is Ontology Merging

Page 9: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Existing Approaches

Ontology design and integration term matching (Stanford SKC, ISI) graph-based analysis (Stanford SKC) transformation operators (Ontomorph at ISI) merging tools (Chimaera at Stanford KSL)

Object-oriented Programming subject-oriented programming (IBM)

“subjective” views of classes transformation operations concentrates on methods rather than relations

Page 10: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Existing Approaches (II)

Databases develop mediators and provide wrappers define a common data model and mappings define matching rules to translate directly

Most of these approachesdo not provide any guidance to the user,

do not use structural information

Page 11: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Outline

Definitions and motivation The PROMPT ontology-merging algorithm

Incremental algorithm (PROMPT) Statistical algorithm (Anchor-PROMPT)

The tools Evaluation Future work

Page 12: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

PROMPT

Our approach is: Partial automation Algorithms based on

concept-representation structure relations between concepts user’s actions

Our approach is not: Complete automation Algorithm for matching concept names

Page 13: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Knowledge Model

A generic knowledge model of OKBC (Open Knowledge-Base Connectivity Protocol) Classes

Collections of objects with similar properties Arranged in a subclass–superclass hierarchy

Instances Slots

First-class objects in a knowledge base Binary relations describing properties of classes and instances

Facets Constraints on slot values (cardinality, min, max)

Page 14: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Make initial suggestions

Select the next operation

Perform automatic updates

Find conflicts

Make suggestions

The PROMPT Algorithm

Page 15: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Example: merge-classes

Agencyemployee

Agent

Customer

subclass of

agent for

Agent

Employee

Traveler

subclass of

has client

Agencyemployee

Agent

Employee

Customer Traveler

subclass of subclass of

agent for has client

Page 16: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Example: merge-classes (II)

Agencyemployee

Agent

Employee

Customer Traveler

subclass of subclass of

agent for has client

Agencyemployee

Agent

Employee

Customer Traveler

subclass of subclass of

agent for

Page 17: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Analyzing Global Properties Locally

Global properties classes that have the same sets of slots classes that refer to the same set of classes slots that are attached to the same classes

Local context incremental analysis consider only the concepts that were affected

by the last operation

Page 18: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

The PROMPT Operation Set

Extends the OKBC operation set with ontology-merging operations merge classes merge slots merge instances copy of a class

deep or shallow with or without subclasses with or without instances

Page 19: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

After a User Performs an Operation

For each operation perform the operation consider possible conflicts

identify conflicts propose solutions

analyze local context create new suggestions reinforce or downgrade existing suggestions

Page 20: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Conflicts

Conflicts that PROMPT identifies name conflicts dangling references redundancy in a class hierarchy slot-value restrictions that violate class

inheritance

Page 21: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Agent Agent

Agent

Example: merge-classes

Page 22: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Operation Steps: merge-classes

Own slot and their values for the new classask the user in case of conflicts or use preferences

Template slots for the new classunion of template slots of the original classes

Subclasses and superclasses for the new class

Conflicts Suggestions

Page 23: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Agent Agent

Agent

agent for

Template Slots

Copy template slots that don’t exist in the merged ontology

agent for

Page 24: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Agent Agent

Agent

has client

clientclient

Template Slots

Attach the slots that have already been mapped

Page 25: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Employee

Subclasses And Superclasses

If a superclass (subclass) exists, re-establish the links

Agent Agent

Agent

Agencyemployee

superclass

superclass

Page 26: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Agent

Dangling References

Agent Agentagent for

Customer

facet value

For example,allowed class

agent forfacet value

Customer _temp

dummy frame

Page 27: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Agentclient

has client

Additional Suggestions: Merge Slots

If slot names at the merged class are similar, suggest to merge the slots

Page 28: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Agent

Additional Suggestions: Merge Classes

If the set of classes referenced by the merged class is the same as the set of classes referenced by another class, suggest a merge

ReservationClient

hasclients

handlesreservations

Agency employee

Page 29: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Employee Agencyemployee

Agent

If names of superclasses (subclasses) of the merged class are similar, suggest to merge the classes

superclasssuperclass

Additional Suggestions: Merge Classes

Page 30: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Check for Cycles

Person

Employee Agencyemployee

Agent

superclass

superclass

If there is a cycle, suggest removing one of the parents

Page 31: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

To Summarize

Perform the actual operation For the concepts (classes, slots, and

instances) directly attached to the operation arguments perform global analysis for new suggestions Perform global analysis for new conflicts

Page 32: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Non-local context

Classes directly referenced by C

Slots in C

Context

C

Page 33: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Anchor-PROMPT: Using Non-Local Contexts

Input: A set of anchor pairs

Output: A set of related terms with

similarity scores

Where do anchors come from? Lexical matching Interactive tools User-specified

Ontology 1 Ontology 2

Page 34: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Generating Paths in the Graph

Page 35: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Similarity Score

Generate a set of all paths (of length < L) Generate a set of all possible pairs of paths of

equal length For each pair of paths and for each pair of

nodes in the identical positions in the paths, increment the similarity score

Combine the similarity score for all the paths

Page 36: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Equivalence Groups

Page 37: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Anchor-PROMPT: Initial Results

TRIAL Trial

PERSON Person

CROSSOVER Crossover

PROTOCOL Design

TRIAL-SUBJECT Person

INVESTIGATORS Person

POPULATION Action_Spec

PERSON Character

TREATMENT-POPULATION Crossover_arm

Page 38: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Knowledge Model Assumptions

The only assumption:

An OKBC-compliant knowledge model

Page 39: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Outline

Definitions and motivation The PROMPT ontology-merging algorithm

Incremental algorithm (PROMPT) Statistical algorithm (Anchor-PROMPT)

The tools Evaluation Future work

Page 40: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Protégé-2000

An environment for Ontology development Knowledge acquisition

Intuitive direct-manipulation interface Extensibility

Ability to plug in new components

Page 41: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Ontologies in Protégé-2000

Page 42: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Protégé-200 plugins

Domain-specific user-interface plugins Alternative back ends for archival storage Utility programs for knowledge-acquisition

tasks End-user applications

Page 43: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Protégé-based PROMPT tool

Protégé-2000 has an OKBC-compatible knowledge model allows building extensions through a plug-in

mechanism can work as a knowledge-base server for the plug-

ins

Page 44: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

The PROMPT tool

Page 45: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment
Page 46: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

The PROMPT tool features

Setting a preferred ontology Maintaining the user’s focus Providing feedback to the user Preserving original relations

subclass-superclass relations slot attachment facet values

Linking to the direct-manipulation ontology editor Logging operations

Page 47: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Outline

Definitions and motivation The PROMPT ontology-merging algorithm

Incremental algorithm (PROMPT) Statistical algorithm (Anchor-PROMPT)

The tools Evaluation Future work

Page 48: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Evaluation

Knowledge-based systems are rarely evaluated

We can use software-engineering approaches to empirical evaluation of tools

We need to develop additional knowledge-base measurements

Page 49: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Questions we asked

How good are PROMPT’s suggestions and conflict-resolution strategies?

Does PROMPT provide any benefit when compared to a generic ontology-editing tool (Protégé-2000)?

Page 50: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

What we were trying to find out

The benefit that the tool provides Productivity benefit Quality improvement in the resulting

ontologies User satisfaction

Precision and recall of the tool’s suggestions

Page 51: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Source ontologies for the experiments

Two ontologies of problem-solving methods the ontology for the Unified Problem-solving

Method Development Language (UPML) the ontology for the Method-Description

Language (MDL)

Page 52: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Experiment 1: Evaluate the quality of PROMPT’s suggestions

Metrics Precision Recall

Method Automatic

logging Automatic data

reporting

Suggestions that the tool

produced

Operations that the user

performed

Suggestions that the user

followed

Page 53: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Results: the quality of PROMPT’s suggestions

Suggestions that users followed

Conflict-resolution strategies that users followed

Knowledge-base operationsgenerated automatically

90% 75%

74%

Page 54: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Experiment 2: PROMPT versus generic Protégé-2000

Metrics content of the resulting

ontologies number of explicit

knowledge-base operations

PROMPT

Page 55: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Results: PROMPT versus generic Protégé-2000

The resulting ontologies had only one difference Specifying operations explicitly

0

20

40

60

PROMPT Protégé

1660

Page 56: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Results

Experts followed most of the PROMPT’s suggestions

Using PROMPT has improved the efficiency of ontology merging

Page 57: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Anchor-PROMPT Evaluation

Experiment setup Two ontologies from the DAML ontology library Varying parameters

maximum path length number of anchor pairs

Experiment results Ratio of correct results above the median

similarity score

Page 58: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Anchor-PROMPT: Evaluation Results

Max path length

Number of anchors

Result precision

4 4 67%4 3 67%4 2 61%3 4 67%3 3 61%3 2 56%2 4 100%2 3 100%

Page 59: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Anchor-PROMPT Evaluation Results

Equivalence groups of size <= 2 are required

Maximum path lengths of 2 provides extremely high precision (but low recall)

75% precision with maximum path lengths 3 and 4

Page 60: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

Future work

Extend the set of heuristics that PROMPT uses for guiding the experts

Extend the techniques to ontology alignment and ontology refactoring

Develop protocols and metrics for a more detailed evaluation of the tools

Page 61: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment

http://protege.stanford.edu