linked logainm: enhancing library metadata using linked data of irish place names

30
Digital Enterprise Research Institute www.deri.ie Enabling networked knowledge Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names Nuno Lopes Rebecca Grant Brian Ó Raghallaigh Eoghan Ó Carragáin Sandra Collins Stefan Decker September 26, 2013

Upload: nunoalexandrelopes

Post on 04-Dec-2014

315 views

Category:

Technology


1 download

DESCRIPTION

Presentation at the First Workshop on Linking and Contextualizing Publications and Datasets

TRANSCRIPT

Page 1: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Digital Enterprise Research Institute www.deri.ie

Enabling networked knowledge

Linked Logainm: Enhancing Library Metadatausing Linked Data of Irish Place Names

Nuno Lopes Rebecca Grant Brian Ó Raghallaigh Eoghan ÓCarragáin Sandra Collins Stefan Decker

September 26, 2013

Page 2: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

logainm.ie

The authority list of Irish placenames, validated by thePlacenames Branch.

Delivering a more detailed levelthan in DBpedia, Geonames.

Unique source of Irish languageplace names

But.. not easily accessibleautomatically

1 / 13

Page 3: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

logainm.ie

The authority list of Irish placenames, validated by thePlacenames Branch.

Delivering a more detailed levelthan in DBpedia, Geonames.

Unique source of Irish languageplace names

But.. not easily accessibleautomatically

1 / 13

Page 4: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

The NLI Longfield Map Collection

The Longfield Maps are a set of 1,570 surveys carried out inIreland between 1770 and 1840.

Currently catalogued in MarcXML

Integrating Logainm data into their workflow:for enabling searching for place names in Irish

using Linked Data

2 / 13

Page 5: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Longfield Map example

MARC/XML<marc:datafield tag="650" ind1="" ind2=""><marc:subfield code="a">Land tenure</marc:subfield><marc:subfield code="z">Ireland</marc:subfield><marc:subfield code="z">Rathdown (Barony)</marc:subfield>

</marc:datafield><marc:datafield tag="650" ind1="" ind2=""><marc:subfield code="a">Land use surveys</marc:subfield><marc:subfield code="z">Ireland</marc:subfield><marc:subfield code="z">Wicklow (County)</marc:subfield>

</marc:datafield>

3 / 13

Page 6: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Longfield Map example

MARC/XML<marc:datafield tag="650" ind1="" ind2=""><marc:subfield code="a">Land tenure</marc:subfield><marc:subfield code="z">Ireland</marc:subfield><marc:subfield code="z">Rathdown (Barony)</marc:subfield>

</marc:datafield><marc:datafield tag="650" ind1="" ind2=""><marc:subfield code="a">Land use surveys</marc:subfield><marc:subfield code="z">Ireland</marc:subfield><marc:subfield code="z">Wicklow (County)</marc:subfield>

</marc:datafield>

3 / 13

Page 7: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Approach for creating the dataset

1 Translate Logainm database dump into RDF

2 Determine links to other datasets based on:Place namesTypeGeographical coordinatesHierarchy of places

3 Evaluation of generated links

4 Library catalogue enhancement

4 / 13

Page 8: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Overview of GLD

Providers:DBpedia

Exported from WikipediaLinkedGeoData

Exported fromOpenStreetMap

GeoNames

GeoLinkedDataOrdnance Survey

Vocabularies:W3C Geo

SpatialThingNeoGeo

Feature vs GeometrySpatial Relations(is_part_of)

Most providers define their own

5 / 13

Page 9: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Overview of GLD

Providers:DBpedia

Exported from WikipediaLinkedGeoData

Exported fromOpenStreetMap

GeoNamesGeoLinkedDataOrdnance Survey

Vocabularies:W3C Geo

SpatialThingNeoGeo

Feature vs GeometrySpatial Relations(is_part_of)

Most providers define their own

5 / 13

Page 10: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Overview of GLD

Providers:DBpedia

Exported from WikipediaLinkedGeoData

Exported fromOpenStreetMap

GeoNamesGeoLinkedDataOrdnance Survey

Vocabularies:W3C Geo

SpatialThingNeoGeo

Feature vs GeometrySpatial Relations(is_part_of)

Most providers define their own

5 / 13

Page 11: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

1. Converting Logainm dump to RDF

SPA QLML

XDF

R

∼ 1.3M triples

Data provided in XML

Translated to RDF using XSPARQL

Exposed using Openlink Virtuoso

6 / 13

Page 12: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

1. Converting Logainm dump to RDF

SPA QLML

XDF

R

∼ 1.3M triples

Data provided in XML

Translated to RDF using XSPARQL

Exposed using Openlink Virtuoso

6 / 13

Page 13: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

1. Converting Logainm dump to RDF

SPA QLML

XDF

R

∼ 1.3M triples

Data provided in XML

Translated to RDF using XSPARQL

Exposed using Openlink Virtuoso

6 / 13

Page 14: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Linked Logainm

http://lod-cloud.net/

Government

Media

User-generated

Publications

Life sciencesCross-domain

GeoLogainm

OCLC FAST

7 / 13

Page 15: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Linked Logainm

http://lod-cloud.net/

Government

Media

User-generated

Publications

Life sciencesCross-domain

GeoLogainm

OCLC FAST

7 / 13

Page 16: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Linked Logainm

http://lod-cloud.net/

Government

Media

User-generated

Publications

Life sciencesCross-domain

GeoLogainm

OCLC FAST

7 / 13

Page 17: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

2. Place name matching using Silk

1 Place NameIsland, Cavan: 2641 "Place"s inDBpediaAirport, Dublin: 7828

2 Geographical Location

∼50% of place names in logainmcontain geographical information

3 Name of the county / parent placename

4 Mapping of types from Logainm totypes in other datasets

logainm.ie DBpedia LinkedGeoData Geonames

townlandPopulatedPlace

LocalityLCTY,PPLF

8 / 13

Page 18: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

2. Place name matching using Silk

1 Place NameIsland, Cavan: 2641 "Place"s inDBpediaAirport, Dublin: 7828

2 Geographical Location∼50% of place names in logainmcontain geographical information

3 Name of the county / parent placename

4 Mapping of types from Logainm totypes in other datasets

logainm.ie DBpedia LinkedGeoData Geonames

townlandPopulatedPlace

LocalityLCTY,PPLF

8 / 13

Page 19: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

2. Place name matching using Silk

1 Place NameIsland, Cavan: 2641 "Place"s inDBpediaAirport, Dublin: 7828

2 Geographical Location∼50% of place names in logainmcontain geographical information

3 Name of the county / parent placename

4 Mapping of types from Logainm totypes in other datasets

logainm.ie DBpedia LinkedGeoData Geonames

townlandPopulatedPlace

LocalityLCTY,PPLF

8 / 13

Page 20: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

2. Place name matching using Silk

1 Place NameIsland, Cavan: 2641 "Place"s inDBpediaAirport, Dublin: 7828

2 Geographical Location∼50% of place names in logainmcontain geographical information

3 Name of the county / parent placename

4 Mapping of types from Logainm totypes in other datasets

logainm.ie DBpedia LinkedGeoData Geonames

townlandPopulatedPlace

LocalityLCTY,PPLF

8 / 13

Page 21: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

3. Silk results

Entities IE # Links % LinksDBpedia1 10,715 1,552 14.5LinkedGeoData2 36,237 6,611 18GeoNames3 23,102 8,229 35.5

Links in other datasets

Entities # Links % LinksDBpedia 873,643 653,7074 74.84LinkedGeoData 6,251,067 462,098 7,4

1Entities of type “Place” or “Feature”2Entities of type “Node”3No hierarchy info4Including internal & Freebase links

9 / 13

Page 22: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

3. Silk results

Entities IE # Links % LinksDBpedia1 10,715 1,552 14.5LinkedGeoData2 36,237 6,611 18GeoNames3 23,102 8,229 35.5

Links in other datasets

Entities # Links % LinksDBpedia 873,643 653,7074 74.84LinkedGeoData 6,251,067 462,098 7,4

1Entities of type “Place” or “Feature”2Entities of type “Node”3No hierarchy info4Including internal & Freebase links

9 / 13

Page 23: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Evaluation Results

Links Checked CorrectDBpedia 1,552 1,552 (100%) 98%LinkedGeoData 6,611 500 (7.5%) 96%GeoNames 8,229 500 (6%) 99%

Same place names can be “towns”, “population centre”, and“townland” in logainm.ie. DBpedia contains only one entry:

Adrigole (population centre) and Adrigole (townland)http://dbpedia.org/resource/Adrigole

Similar for LinkedGeoData

10 / 13

Page 24: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Longfield Map example (Updated)

<marc:datafield tag="650" ind1="" ind2=""><marc:subfield code="a">Land tenure</marc:subfield><marc:subfield code="z">Ireland</marc:subfield><marc:subfield code="z">Rathdown (Barony)</marc:subfield>

</marc:datafield><marc:datafield tag="650" ind1="" ind2=""><marc:subfield code="a">Land use surveys</marc:subfield><marc:subfield code="z">Ireland</marc:subfield><marc:subfield code="z">Wicklow (County)</marc:subfield>

</marc:datafield>

<marc:datafield tag="650" ind1="" ind2=""><marc:subfield code="a">Land tenure</marc:subfield><marc:subfield code="z">Ireland</marc:subfield><marc:subfield code="z">Rathdown (Barony)</marc:subfield>

</marc:datafield><marc:datafield tag="650" ind1="" ind2=""><marc:subfield code="a">Land use surveys</marc:subfield><marc:subfield code="z">Ireland</marc:subfield><marc:subfield code="z">Wicklow (County)</marc:subfield>

</marc:datafield><marc:datafield tag="651" ind2="7" ind1=""><marc:subfield code="2">logainm.ie</marc:subfield><marc:subfield code="a">Rathdown</marc:subfield><marc:subfield code="0">http://data.logainm.ie/place/283</marc:subfield>

</marc:datafield>

11 / 13

Page 25: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Longfield Map example (Updated)

<marc:datafield tag="650" ind1="" ind2=""><marc:subfield code="a">Land tenure</marc:subfield><marc:subfield code="z">Ireland</marc:subfield><marc:subfield code="z">Rathdown (Barony)</marc:subfield>

</marc:datafield><marc:datafield tag="650" ind1="" ind2=""><marc:subfield code="a">Land use surveys</marc:subfield><marc:subfield code="z">Ireland</marc:subfield><marc:subfield code="z">Wicklow (County)</marc:subfield>

</marc:datafield>

<marc:datafield tag="650" ind1="" ind2=""><marc:subfield code="a">Land tenure</marc:subfield><marc:subfield code="z">Ireland</marc:subfield><marc:subfield code="z">Rathdown (Barony)</marc:subfield>

</marc:datafield><marc:datafield tag="650" ind1="" ind2=""><marc:subfield code="a">Land use surveys</marc:subfield><marc:subfield code="z">Ireland</marc:subfield><marc:subfield code="z">Wicklow (County)</marc:subfield>

</marc:datafield><marc:datafield tag="651" ind2="7" ind1=""><marc:subfield code="2">logainm.ie</marc:subfield><marc:subfield code="a">Rathdown</marc:subfield><marc:subfield code="0">http://data.logainm.ie/place/283</marc:subfield>

</marc:datafield>

11 / 13

Page 26: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Longfield Map example (Updated)

<marc:datafield tag="650" ind1="" ind2=""><marc:subfield code="a">Land tenure</marc:subfield><marc:subfield code="z">Ireland</marc:subfield><marc:subfield code="z">Rathdown (Barony)</marc:subfield>

</marc:datafield><marc:datafield tag="650" ind1="" ind2=""><marc:subfield code="a">Land use surveys</marc:subfield><marc:subfield code="z">Ireland</marc:subfield><marc:subfield code="z">Wicklow (County)</marc:subfield>

</marc:datafield>

<marc:datafield tag="650" ind1="" ind2=""><marc:subfield code="a">Land tenure</marc:subfield><marc:subfield code="z">Ireland</marc:subfield><marc:subfield code="z">Rathdown (Barony)</marc:subfield>

</marc:datafield><marc:datafield tag="650" ind1="" ind2=""><marc:subfield code="a">Land use surveys</marc:subfield><marc:subfield code="z">Ireland</marc:subfield><marc:subfield code="z">Wicklow (County)</marc:subfield>

</marc:datafield><marc:datafield tag="651" ind2="7" ind1=""><marc:subfield code="2">logainm.ie</marc:subfield><marc:subfield code="a">Rathdown</marc:subfield><marc:subfield code="0">http://data.logainm.ie/place/283</marc:subfield>

</marc:datafield>

11 / 13

Page 27: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Demo page:http://apps.dri.ie/locationLODer

12 / 13

Page 28: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Conclusions

Creation of a new Linked Data geographical DatasetLinking to other publicly available datasetsEnhancing of NLI’s MARC/XML records

Future workImprove the Silk matching rules to obtain better matching

Street level matching

Enhancing the NLI’s cataloguing system (VuFind)

Thank you! Questions?

13 / 13

Page 29: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Conclusions

Creation of a new Linked Data geographical DatasetLinking to other publicly available datasetsEnhancing of NLI’s MARC/XML records

Future workImprove the Silk matching rules to obtain better matching

Street level matching

Enhancing the NLI’s cataloguing system (VuFind)

Thank you! Questions?

13 / 13

Page 30: Linked Logainm: Enhancing Library Metadata using Linked Data of Irish Place Names

Conclusions

Creation of a new Linked Data geographical DatasetLinking to other publicly available datasetsEnhancing of NLI’s MARC/XML records

Future workImprove the Silk matching rules to obtain better matching

Street level matching

Enhancing the NLI’s cataloguing system (VuFind)

Thank you! Questions?

13 / 13