mapping concept hub berlin

Upload: robert-allen-rippey

Post on 07-Jan-2016

224 views

Category:

Documents


0 download

DESCRIPTION

atfj4imnvnm

TRANSCRIPT

  • Semantic Mappingthrough a concept hubDagobert SoergelCollege of Information Studies, University of Maryland Department of Library and Information Studies, University at Buffalo

  • *Hub

    Water transport Inland water transport Ocean transport

    Traffic station Water transport Traffic station Inland water tr. Traffic station Ocean transport

    Dewey

    387 Water, air, space transportation386 Inland waterway & ferry transportation387.5 Ocean transportation

    386.8 Inland waterway tr. > Ports387.1 PortsLCSH

    Shipping Inland water transportMerchant marine

    Harbors

    GermanHafenMapping through a Hub

  • *OutlineObjective: Interoperability PlusKOS concept hubMethod: Knowledge-based, computer-assisted of canonical representations of conceptsResulting knowledge base and applications

  • *ObjectiveImprove semantic-based search of digital content across multiple collections in multiple languages.Interoperability between any two participating KOS (Knowledge Organization Systems) Support for search, esp. facet-based search for any collection indexed by a participating KOSfor free-text searchAssistance in cataloging (metadata creation) by catalogers or users (social tagging)Long-range goal: Web service where a KOS can be uploaded and mappings to specified target KOS are returned

  • *KOS Concept HubThe backbone of the proposed system is a faceted core classification of atomic concepts together with a set of relationshipsInteroperability is achieved by expressing concepts from all participating KOS as a canonical representation: description logic formula using atomic concepts and relationshipsMapping from KOS to KOS is achieved by reasoning over these canonical representations

  • *Hub

    Water transport Inland water transport Ocean transport

    Traffic station Water transport Traffic station Inland water tr. Traffic station Ocean transport

    Dewey

    387 Water, air, space transportation386 Inland waterway & ferry transportation387.5 Ocean transportation

    386.8 Inland waterway tr. > Ports387.1 PortsLCSH

    Shipping Inland water transportMerchant marine

    Harbors

    GermanHafenMapping through a Hub

  • *Method: How to get DL formulasKey: Efficient creation of canonical representations (DL formulas) Apply existing knowledge: Large knowledge base less effort for processing a new KOSUse knowledge of KOS structure for hierarchical inheritanceUse linguistic analysis of terms and captions Eliminate redundant atomic conceptsCheck or produce mapping results from assignment of concepts to the same recordsGet human editors input and verification where needed through a user-friendly interfaceKOS owners may verify and edit data pertaining to their KOS

  • *Knowledge baseRequires an ever larger classification and lexical knowledge base containing many kinds of data:A faceted classification of atomic concepts Seeded from sources with well-developed facets such as the AOD Thesaurus, the Harvard Business Thesaurus, the Art and Architecture Thesaurus, various ontologiesLinguistic knowledge bases such as Wordnet and mono-,bi-, and multi-langual dictionaries and thesauriMany KOS (Knowledge Organization Systems), such as LCC, DDC, DMOZ directory, LCSH, Gene Ontology, SchlagwortnormdateiThese will over time be fused into one large multilingual knowledge base with many terminological and translation relationships and relationships linking terms to concepts, with an increasing number of concepts semantically represented by a DL formula.

  • *Examples of derivingDL formulas

  • * Underlying faceted classification

    L00Transportation and trafficL10Traffic system components L13Traffic facilities L15Traffic stations L17VehiclesL30Modes of transportation L33Air transportL37Water transportP00Buildings, constructionP23BuildingsP27Architecture P43ConstructionR00EngineeringR30AcousticsR37SoundproofingT70Military vs. civilian T73Military T77Civilian

  • * Method: Assigning atomic concepts 1

    HE TransportationHE550-560 Ports, harbors, docks, wharves, etc. L00 Transportation and traffic T77 CivilianInherited: L00 Transportation and traffic T77 CivilianAdded by editor:L15 Traffic stations L37 Water transportResolved to:L15 Traffic stations L37 Water transport T77 Civilian

  • * Method: Assigning atomic concepts 2

    NA6300-6307 Airport buildings

    From database already established:Airport = L15 Traffic stations L33 Air transport Buildings = P23 BuildingsAdded by editor T77 CivilianResolved toL15 Traffic stations L33 Air transport P23 Buildings T77 Civilian

  • * Method: Assigning atomic concepts 3

    TL681.S6 Airplanes. Soundproofing

    From database already established:Airplane = L17 Vehicles L33 Air transport Soundproofing = R37 SoundproofingAdded by editor: NothingResolved toL17 Vehicles L33 Air transport R37 Soundproofing

  • * Method: Assigning atomic concepts 4

    Aeroplanes-Soundproofing

    From database already established:Aeroplanes = Airplane [Spelling variant]

    ThereforeTerm is recognized as same asAirplanes. SoundproofingResolved toL17 Vehicles L33 Air transport R37 Soundproofing

  • * Method: Assigning atomic concepts 5

    Any class formed by geographical subdivisionSuch asNA6300-6307 Airport buildingsNA6305.E3 Egypt

    Recognized using a dictionary of geographical namesInherits from subject class above it; simply add the countryL15 Traffic stations L33 Air transport P23 Buildings T77 Civilian Egypt

    No editor checking needed

  • *Examples from the resulting knowledge base

  • *

    HE550-560 Ports, harbors, docks, wharves, etc.NA2800 Architectural acousticsNA6300-6307 Airport buildingsNA6330 Dock buildings, ferry houses, etc.TC350-374 Harbor works

    TH1725 Soundproof constructionTL681.S6 Airplanes. SoundproofingTL725-726 Airways (Routes). Airports and landing fields. Aerodromes VA67-79 Naval ports, bases, reservations, docksVM367.S6 Submarines. Soundproofing =L15 Traffic stations L37 Water transport T77 Civilian=P27 Architecture R30 Acoustics=L15 Traffic stations L33 Air transport P23 Buildings T77 Civilian=L15 Traffic stations L37 Water transport P23 Buildings T77 Civilian=L15 Traffic stations L37 Water transport R00 Engineering T77 Civilian=P23 Buildings P43 Construction R37 Soundproofing=L17 Vehicles L33 Air transport R37 Soundproofing=L13 Traffic facilities L33 Air transport Technical aspects=L15 Traffic stations L37 Water transport T73 Military=L17 Vehicles L37 Water transport R37 Soundproofing T73 Military Underwater

    Soergel,

  • *LC subject headings with combinations of atomic concepts

    Aeroplanes-Soundproofing Airports-Buildings Buildings-Soundproofing Ships-Soundproofing =L17 Vehicles L33 Air transport R37 Soundproofing=P23 Buildings L15 Traffic stations L33 Air transport=P23 Buildings P43 Construction R37 Soundproofing=L17 Vehicles L37 Water transport R37 Soundproofing

    Soergel,

  • *Hub

    L17 Vehicles L33 Air transport R37 Soundproofing

    L17 Vehicles L37 Water transport R37 Soundproofing

    L17 Vehicles L37 Water transport R37 Soundproofing T73 Military Underwater

    LCC

    TL681.S6 Airplanes. Soundproofing

    VM367.S6 Submarines. Soundproofing LCSH

    Aeroplanes-Soundproofing

    Ships-SoundproofingMapping through a Hub

  • *Hub

    Canonical form of query(DL formula)

    User queryFree textCombination of elemental concepts through facets (guided query formulation)Controlled term(s) from a KOS, possibly found through browsing a KOS Final query

    (Enriched) free text query

    Query in terms of a KOSMapping user queries

  • *Query: L17 Vehicles AND R37 Soundproofing

    TL681.S6 Airplanes. Soundproofing

    VM367.S6 Submarines. Soundproofing Aeroplanes-Soundproofing Ships-Soundproofing

    [L17 Vehicles L33 Air transport R37 Soundproofing][L17 Vehicles L37 Water transport R37 Soundproofing Military][L17 Vehicles L33 Air transport R37 Soundproofing][L17 Vehicles L37 Water transport R37 Soundproofing]

  • *Examples from NALT and LCSHNALTNational Agricultural Library ThesaurusLCSHLibrary of Congress Subject Headings

  • *Air pollution lawsLCSH termAir Pollution Laws and regulations[isa] Legal rule [appliedTo] {[isa] Condition [isConditionOf] Air [causedBy] Pollutant [property] Undesirable}NALT termsAir pollution [isa] Condition [isConditionOf] Air [causedBy] Pollutant [property] UndesirableLaws and regulations[isa] Legal rule Mapping LCSH NALTAir Pollution Laws and regulations Air pollution ANDLaws and regulationsInterpretation for indexing and searching in both directions

  • *Soil moisture vs. Soil waterLCSH termSoil moisture[isa] Water [containedIn] SoilNALT termSoil water[isa] Water [containedIn] SoilMapping LCSH NALTSoil moisture Soil water

  • *Greenhouse gardeningLCSH termGreenhouse gardening[isa] Gardening [inEnvironment] Greenhouse [inEnvironment] HomeNALT termsHome gardening [isa] Gardening [inEnvironment] HomeGreenhouse[isa] GreenhouseMapping LCSH NALTGreenhouse gardening Home gardening ANDGreenhouse

  • *Salad greensLCSH termSalad greens[isa] Green leafy vegetable [usedFor] SaladNALT termGreen leafy vegetables [isa] Green leafy vegetable Mapping LCSH NALTSalad greens BT Green leafy vegetables

  • *Emerging diseasesLCSH termEmerging infectious diseases[isa] Disease [hasProperty] Infectious [hasProperty] EmergingNALT termEmerging diseases[isa] Disease [hasProperty] Infectious ??? [hasProperty] Emerging

    Mapping LCSH NALTEmerging infectious diseases Emerging diseasesEmerging infectious diseases BT Emerging diseases

  • *Distributed implementationA KOS on the Web could assign DL formulas to its concepts let's call this a semantically enhanced KOS or SEKOSCould use any of a number of faceted core classifications or even several (using a unique URI for each elemental concept)Core classifications could be mapped to each otherIt is now a simple matter to map from any SEKOS to any other (somewhat dependent on the core classifications used)

  • *Take-home messageSemantics gives powerful systemsSemantik schafft maechtige Systeme

  • *LC

  • *This project will achieve the followingInteroperability between any two participating Knowledge Organization Systems (KOS) (to the extent the two schemes allow)Facet-based search for any collection indexed by a participating KOSfor free-text searchAssistance in cataloging (metadata creation) by catalogers or users (social tagging)Long-range goal: Web service where a KOS can be uploaded and mappings to specified target KOS are returnedMeansCreate a comprehensive knowledge base relating many classification schemes and subject heading lists used in libraries and in other contexts (LCC, DDC, DMOZ directory, LCSH, European schemes).Use combinations of atomic concepts taken from a well-structured underlying faceted classification to represent the meaning of classes and subject headings.

  • *

  • *Hub

    Water transport Inland water transport Ocean transport

    Traffic station Water transport Traffic station Inland water tr. Traffic station Ocean transport

    Dewey

    387 Water, air, space transportation386 Inland waterway & ferry transportation387.5 Ocean transportation

    386.8 Inland waterway tr. > Ports387.1 PortsLCSH

    Shipping Inland water transportMerchant marine

    Harbors

    GermanHafenMapping through a Hub

  • *Hub

    LCC LCSH

    Mapping through a Hub

  • Koeln 20090706ThemenRole indicators for building themesarrangement of themes for exploration under user controlcarry-over from citation orderPractical problem of connection to the participating systems should use IDs for combinations in Hub. Make sure that hub stays consistent with participating systems.*

    *