Download - LODLAM meeting Melbourne Museum
Department of Parliamentary ServicesParliamentary Library
Semantic Web Technologies at the Victorian Parliamentary Library
Peter Neish, Systems OfficerVictorian Parliamentary Library
@peterneish
Department of Parliamentary ServicesParliamentary Library
The Library• Established in 1851
• Clients:
– Members of Parliament and their staff
– Department of Parliamentary Services (especially committees)
– Academics and public
• Historical information, Hansard, parliamentary papers, government agencies, member biographies
• Provide current information to members e.g. news clips, video, journal articles, media releases
Department of Parliamentary ServicesParliamentary Library
Number of Media Releases per year
0
1000
2000
3000
4000
5000
6000
7000
Department of Parliamentary ServicesParliamentary Library
Semantic Tagging
• Increasing number of media releases meant that manual indexing was too time consuming
• Examined ways of automatically tagging media releases without human intervention
• Web services examined:
– Alchemy API– Evri– OpenAmplify– OpenCalais– Yahoo Term Extractor– Zemanta
Department of Parliamentary ServicesParliamentary Library
Open Calais
• Product of Thomson Reuters – focus is on news articles
• Generous limits on API calls
• Data in RDF/XML, N3, Simple Text, Microformats, JSON
• Good documentation and community
• http://viewer.opencalais.com/
However: closed box (algorithm secret), recently company appear to have scaled back development
Department of Parliamentary ServicesParliamentary Library
Number of Tags assigned by OpenCalais
0
500
1000
1500
2000
2500
3000
3500
4000
4500
0 20 40 60 80 100 120
Tags per item
To
tal
nu
mb
er
Department of Parliamentary ServicesParliamentary Library
85%
4%
6%5%
Correct Tags
Incorrect Tags
Repeated Tags
Redundant Tags
Tag Quality
Department of Parliamentary ServicesParliamentary Library
Open Calais RDF
• OpenCalais links to its own ontology (rich in data for companies, but other classes have limited data)
• RDF has a lot of N-ary relationships (up to 1000 triple statements per article)
• SameAs or web links to:
– DBpedia, Wikipedia, Freebase, Reuters.com, GeoNames, Shopping.com, IMDB, LinkedMDB
Department of Parliamentary ServicesParliamentary Library
User interface
Department of Parliamentary ServicesParliamentary Library
Current Projects using Linked Data
• Government Agencies Database
• Parliamentary Papers
• Images