lodlam meeting melbourne museum
DESCRIPTION
Lightning talk given at the LOD-LAM (Linked Open Data in Libraries, Archives and Museums) workshop in Melbourne on April 18, 2012.TRANSCRIPT
Department of Parliamentary ServicesParliamentary Library
Semantic Web Technologies at the Victorian Parliamentary Library
Peter Neish, Systems OfficerVictorian Parliamentary Library
@peterneish
Department of Parliamentary ServicesParliamentary Library
The Library• Established in 1851
• Clients:
– Members of Parliament and their staff
– Department of Parliamentary Services (especially committees)
– Academics and public
• Historical information, Hansard, parliamentary papers, government agencies, member biographies
• Provide current information to members e.g. news clips, video, journal articles, media releases
Department of Parliamentary ServicesParliamentary Library
Number of Media Releases per year
0
1000
2000
3000
4000
5000
6000
7000
Department of Parliamentary ServicesParliamentary Library
Semantic Tagging
• Increasing number of media releases meant that manual indexing was too time consuming
• Examined ways of automatically tagging media releases without human intervention
• Web services examined:
– Alchemy API– Evri– OpenAmplify– OpenCalais– Yahoo Term Extractor– Zemanta
Department of Parliamentary ServicesParliamentary Library
Open Calais
• Product of Thomson Reuters – focus is on news articles
• Generous limits on API calls
• Data in RDF/XML, N3, Simple Text, Microformats, JSON
• Good documentation and community
• http://viewer.opencalais.com/
However: closed box (algorithm secret), recently company appear to have scaled back development
Department of Parliamentary ServicesParliamentary Library
Number of Tags assigned by OpenCalais
0
500
1000
1500
2000
2500
3000
3500
4000
4500
0 20 40 60 80 100 120
Tags per item
To
tal
nu
mb
er
Department of Parliamentary ServicesParliamentary Library
85%
4%
6%5%
Correct Tags
Incorrect Tags
Repeated Tags
Redundant Tags
Tag Quality
Department of Parliamentary ServicesParliamentary Library
Open Calais RDF
• OpenCalais links to its own ontology (rich in data for companies, but other classes have limited data)
• RDF has a lot of N-ary relationships (up to 1000 triple statements per article)
• SameAs or web links to:
– DBpedia, Wikipedia, Freebase, Reuters.com, GeoNames, Shopping.com, IMDB, LinkedMDB
Department of Parliamentary ServicesParliamentary Library
User interface
Department of Parliamentary ServicesParliamentary Library
Current Projects using Linked Data
• Government Agencies Database
• Parliamentary Papers
• Images