building a national ontology infrastructure - a presentation at swib2013

26
THE NATIONAL LIBRARY OF FINLAND Library Network Services Building a National Ontology Infrastructure Matias Frosterus, Mirja Anttila, Mikko Lappalainen, Susanna Nykyri, Tuomas Palonen, Sini Pessala SWIB 2013

Upload: matias-frosterus

Post on 27-Dec-2014

400 views

Category:

Technology


2 download

DESCRIPTION

Describes Finland's national effort at building an ontology service, describes the linked ontology approach as well as musings on the difficulties of developing multilingual ontologies.

TRANSCRIPT

Page 1: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Building a National Ontology

Infrastructure

Matias Frosterus, Mirja Anttila,

Mikko Lappalainen, Susanna Nykyri,

Tuomas Palonen, Sini Pessala

SWIB 2013

Page 2: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

This presentation

Overview of the ONKI project

Linked ontology approach

Trilingual ontology

Page 3: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

ONKI project

A joint project of the National Library of Finland, the Ministry of

Finance and the Ministry of Education and Culture

The aim is to build a reliable, centralized, national ontology

service named Finto

Page 4: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

ONKI project

What does the ONKI project offer?

Publication of ontologies

Using ontologies in applications through various interfaces

The development of the General Finnish Upper Ontology YSO

Coordination of ontology work on national scale

Improving interoperability across the spectrum by harmonizing

annotations

Page 5: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

ONKI project

Based on the FinnONTO research project, which ran in Aalto

University and the University of Helsinki 2003-2012

Focus on light-weight SKOS ontologies intended for

annotations

Powered by ONKI Light

Open source

https://code.google.com/p/onki-light/

Page 6: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

ONKI Light

Guidelines and support

Browsing Interfaces

Ontology developers

Annota-tors

ApplicationDevelopers

End users

General Upper

Ontology

Finto ontology service:

Users:

Ontology publication

Page 7: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

The second part

Linked ontology approach

Page 8: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Linked ontology approach

Data

Metadata

Thesaurus

Data

Metadata

Thesaurus

silo silo

What we have:

Silos

Expert-made thesauri

A large amount of data and annotations

Page 9: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Linked ontology approach

Data

Metadata

Thesaurus

Data

Metadata

Thesaurus

What we want:

Eliminate the silos

Harmonize the annotations

Page 10: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Linked ontology approach

Data

Metadata

Data

Metadata

Ontology Ontology

How?

Ontologies are much easier to link together than thesauri

Concepts as opposed to terms

Explicit relations

Page 11: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Linked ontology approach

Data

Metadata

Data

Metadata

Ontology Ontology

Data

Metadata

Ontology

Data

Metadata

Ontology

Data

Metadata

Ontology

The problem:

Page 12: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Linked ontology approach

Data

Metadata

Data

Metadata

Ontology Ontology

Data

Metadata

Ontology

Data

Metadata

Ontology

Data

Metadata

Ontology

The problem:

A lot of work!

I have an

update!

I must react! Me too! Me too! Me too!

Page 13: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Linked ontology approach

Data

Metadata

Data

Metadata

Domain Ontology

Domain Ontology

Data

Metadata

Domain Ontology

Data

Metadata

Domain Ontology

Data

Metadata

Domain Ontology

General Upper Ontology

The approach:

Limit the links between

the ontologies

Page 14: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Linked ontology approach

Data

Metadata

Data

Metadata

JUHO TERO

Data

Metadata

LIITO

Data

Metadata

MAO

Data

Metadata

AFO

YSO

KOKO

In practice:

Page 15: Building a National Ontology Infrastructure - a presentation at SWIB2013

Linked ontology approach: KOKO

Ontology Domain Concepts

YSO General upper ontology 24 800

MAO Museum artifacts 6 800

MUSO Music 1 000

TAO Design 3 000

TERO Health 6 500

VALO Photography 2 000

AFO Agriculture 7 000

JUHO Government 6 300

KAUNO Literature 5 000

KTO Linguistics

900

KITO Literary research 850

KULO Cultural research 1 500

LIITO Economics 3 000

MERO Seafaring 1 300

PUHO Military 2 000

Page 16: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Challenges to be tackled

Propagating the changes in the upper general ontology to the

domain ontologies

Locating the overlapping concepts between the domain

ontologies

Not always simple

Labels might be misleading

Ontological structure can help

Coordinating the use and development of ontologies on a

national level

Page 17: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

The third part

Trilingual ontology

Page 18: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Ontology design

The relations between concepts can be designed in several

ways

What affects these choices?

Corpora

Language

Culture

Page 19: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Trilingual ontology

In practice

YSO: General Finnish Upper Ontology

”Finnish” as a culture

Finland has two official languages: Finnish and Swedish

Very different from one another

Lingua franca: English

Page 20: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

YSO

Topmost hierarchy is inspired by DOLCE

Offers the general concepts needed for annotation in many

domains

Complemented with a number of domain ontologies for

specific use cases

Based on the General Finnish Thesaurus YSA

Used and developed for decades in the annotation of all Finnish

non-fiction literature

Page 21: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Language affects the hierarchy

Finnish word ’siirto’ means transfer

siirto

maan-

siirto

hiusten-

siirto voiman-

siirto

skos:broader

Page 22: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Language affects the hierarchy

Finnish word ’siirto’ means transfer

transfer

earth-

moving

hair trans-plantation

power trans-

mission

skos:broader

Page 23: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Language affects the hierarchy

Finnish has a single concept for rivers

Swedish has three

Älv = Scandinavian river situated north of Göta älv (a specific

river)

Å = Scandinavian river situated south of Göta älv

Flod = non-Scandinavian river

A distinction not used in Finland

Page 24: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

Culture before language

Looking beyond the language

Realizing that language does affect the way we perceive the

world

Building an ontology for a specific cultural sphere

Key to the harmonization of different annotations in different

domains

Page 25: Building a National Ontology Infrastructure - a presentation at SWIB2013

THE NATIONAL LIBRARY OF FINLAND – Library Network Services

The development of YSO

Mapping to other ontologies

Mapping to LCSH is underway

Building the guidelines for the development

How to choose the correct approach when language clash leads

to concept clash?