Ontology Mapping and link discovery

Download Ontology Mapping and link discovery

Post on 28-Jan-2016




0 download

Embed Size (px)


Ontology Mapping and link discovery. Kunal Narsinghani Ashwini Lahane. Agenda. Introduction Levels of heterogeneity Previous work in the field PROMPT Suite of Tools Prompt on Protg The Web of Data CRS : Managing Co-references Silk A link discovery framework. Introduction. - PowerPoint PPT Presentation


<ul><li><p>Kunal NarsinghaniAshwini Lahane</p><p>Ontology Mapping and link discovery</p></li><li><p>Agenda</p><p>IntroductionLevels of heterogeneityPrevious work in the fieldPROMPT Suite of ToolsPrompt on ProtgThe Web of DataCRS : Managing Co-referencesSilk A link discovery framework</p></li><li><p>IntroductionCan a single ontology suffice for various applications?</p><p>Definition The task of relating the vocabulary of two Ontologies that share the same domain of discourse</p><p>Its a morphism that consists of a collection of functions assigning symbols used in one vocabulary to the symbols in the other[1]</p><p>This would provide a common layer from which ontologies can be accessed and exchange information.</p><p>Translation is different from mapping</p></li><li><p>IntroductionAn analogy to the problem Clocks</p><p>Levels of Heterogeneity in Ontologies Syntactic</p><p>Structural</p><p>Semantic</p></li><li><p>Mapping discoveryFirst approach is to use a reference ontology</p><p>Example the upper Ontologies SUMO and DOLCE</p><p>What when a shared ontology is not available?</p><p>Structural &amp; definitional information can be used to discover mappings</p><p>Example tools IF-Map, QOM, MAFRA &amp; Prompt</p></li><li><p>IF-MAP architectureFig: The steps in IF-MAP</p></li><li><p>PROMPT Suite of ToolsInteractive tools for ontology merging and mappingOntology formal specification of domain information facilitate knowledge sharing and reuseDifferent ontologies may overlap, need to be reconciledDetermine correlation Find all conceptsDetermine similaritiesChange source ontologies or remove overlapRecord mapping for future reference</p></li><li><p>Ontology ManagementTasksFinding correlationsMerging ontologiesVersion managementFactoring ontologiesToolsBenefit from being tightly integrated into single frameworkUniform user interfaceSame interaction paradigms Easy access from one tool to another</p></li><li><p>PROMPT Knowledge ModelBased on knowledge model of ProtgFrame based Types of framesClassSet of entities specifying a conceptSlots Attributes of class Has domain and range Must have unique namesInstances Elements of class</p></li><li><p>PROMPT FrameworkTools for multiple-ontology managementExtension to Protege ontology-editing environmentOpen architecture allows easy extension with pluginsTools in PROMPTIPROMPT Interactive ontology merging toolANCHORPROMPT a graph-based tool for finding similarities between ontologiesPROMPTDIFF for finding a diff between two versions of the same ontologyPROMPTFACTOR a tool for extracting a part of an ontology</p></li><li><p>PROMPT Framework</p></li><li><p>IPROMPT</p><p>Interactive ontology merging toolLeads user through merging processSuggestions for mergingIdentifies inconsistencies and potential problemsSuggests strategies for resolvingUses structure of concepts and their relation along with user inputDecision based on local contextIterative</p></li><li><p>IPROMPT Algorithm</p></li><li><p>IPROMPT AlgorithmCreates initial suggestion based on lexical similarity of namesMerged ontology contains frames which are similar to frames in input ontologies2 ontologies O1 and O2 are merged to form OmMerging decisions are designer and task dependentSet of knowledge based operations definedFor each operation:Changes performed automaticallyNew merging suggestionsInconsistencies and potential problems</p></li><li><p>Class hierarchies</p></li><li><p>Suggestion for merging </p></li><li><p>IPROMPT Operations</p><p>Merge classes Merge slotsMerge instancesShallow copy of a classCopy class from source ontology to mergedDeep copy of a classAlso copies all the parents of the class up to the root hierarchy</p></li><li><p>Inconsistencies &amp; Potential Problems</p><p>Name conflicts</p><p>Dangling references</p><p>Redundancy in the class hierarchy</p><p>Slot values violating slot-value restrictions</p></li><li><p>Additional features</p><p>Setting up preferred ontology</p><p>Maintaining user focus</p><p>Providing feedback to user</p><p>Logging of ontology merging and editing operations</p></li><li><p>ANCHORPROMPT</p><p>Graph based tool for finding similarities Compares larger portionsGoal : Augment IPROMPT by determining additional points of similarityInput : Anchors - Set of pairs of related termsAnchor identification Manual /AutomaticEach ontology is viewed as a directed labeled graph</p></li><li><p>ANCHORPROMPT representation</p></li><li><p>ANCHORPROMPT algorithm</p></li><li><p>AlgorithmBegins with anchor pair TRIAL, TrailPERSON, PersonPath 1: TRIAL -&gt; PROTOCOL -&gt; STUDY-SITE -&gt; PERSONPath 2: Trial -&gt; Design -&gt; Blinding -&gt; PersonDetermine similarity score for pair of related termsIf two pairs of terms from the source ontologies are similar and there are paths connecting the terms, then the elements in those paths are often similar as well</p></li><li><p>PROMPTDIFFTool for comparing ontology versionsVersion comparison in software code is based on comparing text filesOntologies have different text representationHeuristics algorithm that produces a structural diff between two versionsCompares the structure of the two ontology versionsIdentifies frames changed and what changes were made</p></li><li><p>PromptDiff AlgorithmAn extensible set of heuristic matchersFixed-point algorithm to combine the results of the matchers to produce a structural diff between two versions</p></li><li><p>PROMPTFACTOR</p><p>Tool for factoring out semantically independent part of an large ontology into a new sub-ontologyEnsures that severed links do not introduce ill-defined concepts in the sub-ontologyUser can specify concepts of interestPerforms the transitive closure of the superclass relation and all the relations defined by slotsTarget ontology works as stand-alone</p></li><li><p>PromptFactor Algorithm</p><p>User specifies the concept of interestPromptFactor traverses the ontology termDetermines transitive closure of all relations including subclass-of relationDetermines all the parents of selected term in hierarchyUser interactiveDetermines inconsistencies</p></li><li><p>Prompt DemoIt is available as a plug-in for Protg 3.4</p><p>Uses linguistic similarity matches between concepts</p><p>Also matches slot names and slot value types</p><p>In cases where automation is not possible, user intervention is needed; possible actions are suggested</p><p>Alignment is followed by merging</p><p>Alignment is establishing links between the ontologies</p><p>Merging is the creation of a single coherent ontology</p></li><li><p>Prompt Demo</p></li><li><p>The Web of DataData sources span a large range of domains</p><p>RDF data model is used to publish structured data on the web</p><p>Explicit RDF links exist between entities in different data sources</p><p>However, there is a lack of tools to set RDF links to other data sources </p></li><li><p>SilkIt is a link specification language </p><p>Allows specification of the links that should be discovered between data sources, as well as conditions to be fulfilled to be linked</p><p>Link conditions are specified using similarity metrics; they can use aggregation functions to combine similarity scores</p><p>Data access performed using SPARQL</p></li><li><p>Silk FeaturesSupport for owl:sameAs links and other types of RDF links</p><p>Provides a declarative language to specify link conditions</p><p>Datasets need not be replicated locally</p><p>Caching, indexing and entity pre-selection are used to enhance performance</p></li><li><p>Silk LSL example</p></li><li><p>Silk LSL example..contd</p></li><li><p>Silk similarity metricsSimilarity metrics can be combined using aggregation functionsSets of resources can be selected using Silk RDF path selector language</p></li><li><p>Silk Pre-MatchingComparison of all entities in Source S and Target T would need O(|S|*|T|)</p><p>Using pre-matching a limited set of target entities that are likely to match a given source entity is found</p><p>Performed by indexing the target resources based on their property values</p><p>Using this scheme reduces runtime to O(|S| + |T|)</p></li><li><p>Silk Implementation</p></li><li><p>Managing coreferences</p><p>Semantic web vision - Large quantities of information Readily available InterlinkedMachine readableFragmented webSignificant overlapNeed to identify duplicatesCo-reference resolution determining equivalent URIs</p></li><li><p>Co-reference Resolution Service (CRS)</p><p>Systematic analysis and heuristic based approach :IdentifyingPublishingManaging Using co-reference information</p><p>Most prevalent way owl:sameAsEquivalence context dependent</p></li><li><p>CRSes</p><p>Maintain sets of equivalent URIsStoring co-reference data separatelyURI definition and synonyms are kept separateManagement techniques - history, rollback, annotationUse of multiple CRSes that applications can useCore functionality in PHP easy integrationBacked by MySQL</p></li><li><p>Data representation in CRS</p><p>Equivalent URIs are stored in bundles1 URI in each bundle is considered as a canon- preferred URIFormation of bundles:Check if URI already exists in any bundleIf not, create a singleton bundle for new URIsPerform merge union of bundles with equivalent URIs Constituent bundles that were merged are marked inactive</p></li><li><p>Examples of bundle formation</p></li><li><p>Data representation</p><p>Data storage Indexed tables of hashed URIsPermits fast lookup to find:Canon of given URIAll URIs in a bundleDeprecate URIs by flagsFinding all equivalences - coref:coreferenceData links to the bundle for that URI and recursively repeat the process for each URI in that bundle</p></li><li><p>2009-01-16 11:11:40</p><p>RDF description of equivalent URIs in a bundle</p></li><li><p>Ways to speed up Look up only 1 URI from each CRSFollow only coref:canon predicate</p><p>Lookup would need O(log|S|+ log|T|)</p></li><li><p>References[1] The PROMPT Suite: Interactive Tools For Ontology Merging And Mapping Natalya F. Noy and Mark A. Musen;Stanford Medical Informatics, Stanford University</p><p>[2] Managing Co-reference on the Semantic Web - Hugh Glaser, Afraz Jaffri, Ian C. Millard School of Electronics and Computer Science University of Southampton Southampton, Hampshire, UK</p><p>[3] Ontology Mapping: The State of the Art Yannis Kalfoglou and Marco Schorlemmer</p><p>[4] Kalfoglou, Y. and Schorlemmer, M. (2003a). IFMap: an ontology mapping method based on information flow theory. Journal on Data Semantics, 1(1):98127.</p><p>[5] Silk A Link Discovery Framework for the Web of Data Julius Volz, Christian Bizer et al.</p></li></ul>


View more >