wikipedia as knowledge organization system
TRANSCRIPT
Wikipedia as Knowledge Organization System
Digitale Bibliothek
Jakob VoVerbundzentrale des GBV (VZG)
Wikipediaas KnowledgeOrganization System
International UDC Seminar 2009The Hague, 29 October
Unless noted differently this presentation and all of its parts can be copied and reusedunder the terms of the Creative Commons Attribution-Share Alike 3.0 Unported license.
What do
UDC and Wikipedia
have in common?
Mundaneum
Imagine a world in which every single person on the planet is given free access to the sum of all human knowledge. That's what we're doing.
Paul Otlet
Imagine a world in which every single person on the planet is given free access to the sum of all human knowledge. That's what we're doing.
Paul Otlet
Jimbo Wales
Wikimedia Commons
http://en.wikipedia.org/wiki/Infinite_monkey_theorem
The Happy Accident
Joseph Reagle: Wikipedia: The Happy Accident.ACM Interactions.
Volume 16, Number 3 (2009), pp 42-45.
DOI: 10.1145/1516016.1516026
World Wide Web(Infrastructure)
Nupedia (Funding)
Wiki (Tool)
2001
Its a wiki!
one database of pages
heavily hyperlinked
everyone can edit
every edit is traced
and revertible
Its the authority!
Most visited websites
Google*
Yahoo!
YouTube
Windows Live
Wikipedia
* 95% of Wikipedia articles (en)rank in Googles top 10
http://www.alexa.com/topsites
UDC and Wikipedia
Monographic principle
one article per topic (and language)
Multilinguality
24 Wikipedias with > 1,000,000 articles
81 Wikipedias with > 1,000 articles
Perpetual modifications
Recent changes, page history
How is Knowledge
in Wikipedia
Organized?
1. Wikipedia Categories
Download: http://stats.wikimedia.org/EN/CategoryOverviewIndex.htm
Jakob Voss (2006):Collaborative thesaurus tagging the Wikipedia
way.
http://arxiv.org/abs/cs/0604036
Each article can be sorted
into multiple categories
Multihierarchy of categories
Its a dynamic thesaurus
partly faceted, and precoordinated
UDC
35 Public administration352(493) Belgian cities3 Sciences sociales352 Lowest level of administration
(4) Europe(493) Belgium
Brusseles
Wikipedia Categories (en)
EuropeCapitals in EuropaGeographyGeography of EuropeCapitals
BrusselesRegions of BelgiumGeography by ContinentGeography by Place
... by Continent
Continents
Social Sciences
skipped 5 steps
Belgium
skipped 2 steps
http://toolserver.org/~dapete/catgraph/
Categories and Classifications
2. Wikipedia Article structure
substructure (sections, intro, sentences)
links between articles, to other language editions and to external resources
redirects (synonyms) and
disambiguation pages (homonyms)
lists, portals, and navigation boxes
structured infoboxes and geodata
(bibliographic) references
....
http://en.wikipedia.org/wiki/Brussels#History
WikiWord Thesaurus
multilingual thesaurus in SKOS, build
by mining the Wikipedia link structure
Daniel Kinzler (2008): Automatischer Aufbau eines multilingualen
Thesaurus durch
Extraktion semantischer und lexikalischer Relationen aus der
Wikipedia.
http://brightbyte.de/page/WikiWord
DBpedia
extracts structured information from Wikipedia and converts it to RDF triples
Kobilarov, Bizer, Auer & Lehmann (2009): DBpedia - A Linked Data Hub and Data
Source for Web and Enterprise Applications. http://www2009.eprints.org/228/
http://dbpedia.org/
RDF triples
BrusselsPaul Otlet
Belgium
Information sciencefield
born in
capital of
Paul Otlet
same
PND authority file
DBpedia (from Wikipedia)
other databases
http://dbpedia.org/resource/Brussels
http://sws.geonames.org/2800866/
a hub on the Semantic Web
DBPedia
the Semantic Webas of March 2009
Wikipedia Encyclopedia
WikiWord & DBPedia
Used for NLP, database Mapping
and Semantic Tagging
subject indexing with RDF concepts
for instance at BBC and by CommonTags
Wikipedia as KOS
Wikipedia and UDC
Link UDC and Wikipedia
Index by UDC, get Wikipedia
Index by Wikipedia, get UDC
Make UDC part of the Semantic Web!
Thoughts and Questions?
Klicken Sie, um das Format des Titeltextes zu bearbeiten
Klicken Sie, um die Formate des Gliederungstextes zu bearbeiten
Zweite Gliederungsebene
Dritte Gliederungsebene
Vierte Gliederungsebene
Fnfte Gliederungsebene
Sechste Gliederungsebene
Siebente Gliederungsebene
Achte Gliederungsebene
Neunte Gliederungsebene
Jakob Voss: Wikipedia as Knowledge Organization System Verbundzentrale des GBV (VZG)International UDC Seminar 2009The Hague, 29-30 October
Klicken Sie, um das Format des Titeltextes zu bearbeiten
Klicken Sie, um die Formate des Gliederungstextes zu bearbeiten
Zweite Gliederungsebene
Dritte Gliederungsebene
Vierte Gliederungsebene
Fnfte Gliederungsebene
Sechste Gliederungsebene
Siebente Gliederungsebene
Achte Gliederungsebene
Neunte Gliederungsebene