lodlam meeting melbourne museum

10
Department of Parliamentary Services Parliamentary Library Semantic Web Technologies at the Victorian Parliamentary Library Peter Neish, Systems Officer Victorian Parliamentary Library @peterneis h

Upload: peter-neish

Post on 16-May-2015

1.663 views

Category:

Education


1 download

DESCRIPTION

Lightning talk given at the LOD-LAM (Linked Open Data in Libraries, Archives and Museums) workshop in Melbourne on April 18, 2012.

TRANSCRIPT

Page 1: LODLAM meeting Melbourne Museum

Department of Parliamentary ServicesParliamentary Library

Semantic Web Technologies at the Victorian Parliamentary Library

Peter Neish, Systems OfficerVictorian Parliamentary Library

@peterneish

Page 2: LODLAM meeting Melbourne Museum

Department of Parliamentary ServicesParliamentary Library

The Library• Established in 1851

• Clients:

– Members of Parliament and their staff

– Department of Parliamentary Services (especially committees)

– Academics and public

• Historical information, Hansard, parliamentary papers, government agencies, member biographies

• Provide current information to members e.g. news clips, video, journal articles, media releases

Page 3: LODLAM meeting Melbourne Museum

Department of Parliamentary ServicesParliamentary Library

Number of Media Releases per year

0

1000

2000

3000

4000

5000

6000

7000

Page 4: LODLAM meeting Melbourne Museum

Department of Parliamentary ServicesParliamentary Library

Semantic Tagging

• Increasing number of media releases meant that manual indexing was too time consuming

• Examined ways of automatically tagging media releases without human intervention

• Web services examined:

– Alchemy API– Evri– OpenAmplify– OpenCalais– Yahoo Term Extractor– Zemanta

Page 5: LODLAM meeting Melbourne Museum

Department of Parliamentary ServicesParliamentary Library

Open Calais

• Product of Thomson Reuters – focus is on news articles

• Generous limits on API calls

• Data in RDF/XML, N3, Simple Text, Microformats, JSON

• Good documentation and community

• http://viewer.opencalais.com/

However: closed box (algorithm secret), recently company appear to have scaled back development

Page 6: LODLAM meeting Melbourne Museum

Department of Parliamentary ServicesParliamentary Library

Number of Tags assigned by OpenCalais

0

500

1000

1500

2000

2500

3000

3500

4000

4500

0 20 40 60 80 100 120

Tags per item

To

tal

nu

mb

er

Page 7: LODLAM meeting Melbourne Museum

Department of Parliamentary ServicesParliamentary Library

85%

4%

6%5%

Correct Tags

Incorrect Tags

Repeated Tags

Redundant Tags

Tag Quality

Page 8: LODLAM meeting Melbourne Museum

Department of Parliamentary ServicesParliamentary Library

Open Calais RDF

• OpenCalais links to its own ontology (rich in data for companies, but other classes have limited data)

• RDF has a lot of N-ary relationships (up to 1000 triple statements per article)

• SameAs or web links to:

– DBpedia, Wikipedia, Freebase, Reuters.com, GeoNames, Shopping.com, IMDB, LinkedMDB

Page 9: LODLAM meeting Melbourne Museum

Department of Parliamentary ServicesParliamentary Library

User interface

Page 10: LODLAM meeting Melbourne Museum

Department of Parliamentary ServicesParliamentary Library

Current Projects using Linked Data

• Government Agencies Database

• Parliamentary Papers

• Images