wikipedia as knowledge organization system

Download Wikipedia as Knowledge Organization System

If you can't read please download the document

Upload: jakob-

Post on 16-Apr-2017

4.322 views

Category:

Documents


3 download

TRANSCRIPT

Wikipedia as Knowledge Organization System

Digitale Bibliothek

Jakob VoVerbundzentrale des GBV (VZG)

Wikipediaas KnowledgeOrganization System

International UDC Seminar 2009The Hague, 29 October

Unless noted differently this presentation and all of its parts can be copied and reusedunder the terms of the Creative Commons Attribution-Share Alike 3.0 Unported license.

What do
UDC and Wikipedia
have in common?

Mundaneum

Imagine a world in which every single person on the planet is given free access to the sum of all human knowledge. That's what we're doing.

Paul Otlet

Imagine a world in which every single person on the planet is given free access to the sum of all human knowledge. That's what we're doing.

Paul Otlet

Jimbo Wales

Wikimedia Commons

http://en.wikipedia.org/wiki/Infinite_monkey_theorem

The Happy Accident

Joseph Reagle: Wikipedia: The Happy Accident.ACM Interactions. Volume 16, Number 3 (2009), pp 42-45.
DOI: 10.1145/1516016.1516026

World Wide Web(Infrastructure)

Nupedia (Funding)

Wiki (Tool)

2001

Its a wiki!

one database of pages

heavily hyperlinked

everyone can edit

every edit is traced
and revertible

Its the authority!

Most visited websites

Google*

Facebook

Yahoo!

YouTube

Windows Live

Wikipedia

* 95% of Wikipedia articles (en)rank in Googles top 10

http://www.alexa.com/topsites

UDC and Wikipedia

Monographic principle
one article per topic (and language)

Multilinguality
24 Wikipedias with > 1,000,000 articles
81 Wikipedias with > 1,000 articles

Perpetual modifications
Recent changes, page history

How is Knowledge
in Wikipedia
Organized?

1. Wikipedia Categories

Download: http://stats.wikimedia.org/EN/CategoryOverviewIndex.htm

Jakob Voss (2006):Collaborative thesaurus tagging the Wikipedia way.
http://arxiv.org/abs/cs/0604036

Each article can be sorted
into multiple categories

Multihierarchy of categories

Its a dynamic thesaurus

partly faceted, and precoordinated

UDC

35 Public administration352(493) Belgian cities3 Sciences sociales352 Lowest level of administration

(4) Europe(493) Belgium

Brusseles

Wikipedia Categories (en)

EuropeCapitals in EuropaGeographyGeography of EuropeCapitals

BrusselesRegions of BelgiumGeography by ContinentGeography by Place

... by Continent

Continents

Social Sciences

skipped 5 steps

Belgium

skipped 2 steps

http://toolserver.org/~dapete/catgraph/

Categories and Classifications

2. Wikipedia Article structure

substructure (sections, intro, sentences)

links between articles, to other language editions and to external resources

redirects (synonyms) and
disambiguation pages (homonyms)

lists, portals, and navigation boxes

structured infoboxes and geodata

(bibliographic) references

....

http://en.wikipedia.org/wiki/Brussels#History

WikiWord Thesaurus

multilingual thesaurus in SKOS, build
by mining the Wikipedia link structure

Daniel Kinzler (2008): Automatischer Aufbau eines multilingualen Thesaurus durch
Extraktion semantischer und lexikalischer Relationen aus der Wikipedia.

http://brightbyte.de/page/WikiWord

DBpedia

extracts structured information from Wikipedia and converts it to RDF triples

Kobilarov, Bizer, Auer & Lehmann (2009): DBpedia - A Linked Data Hub and Data

Source for Web and Enterprise Applications. http://www2009.eprints.org/228/

http://dbpedia.org/

RDF triples

BrusselsPaul Otlet

Belgium

Information sciencefield

born in

capital of

Paul Otlet

same

PND authority file

DBpedia (from Wikipedia)

other databases

http://dbpedia.org/resource/Brussels

http://sws.geonames.org/2800866/

a hub on the Semantic Web

DBPedia

the Semantic Webas of March 2009

Wikipedia Encyclopedia

WikiWord & DBPedia

Used for NLP, database Mapping
and Semantic Tagging

subject indexing with RDF concepts

for instance at BBC and by CommonTags

Wikipedia as KOS

Wikipedia and UDC

Link UDC and Wikipedia

Index by UDC, get Wikipedia

Index by Wikipedia, get UDC

Make UDC part of the Semantic Web!

Thoughts and Questions?

Klicken Sie, um das Format des Titeltextes zu bearbeiten

Klicken Sie, um die Formate des Gliederungstextes zu bearbeiten

Zweite Gliederungsebene

Dritte Gliederungsebene

Vierte Gliederungsebene

Fnfte Gliederungsebene

Sechste Gliederungsebene

Siebente Gliederungsebene

Achte Gliederungsebene

Neunte Gliederungsebene

Jakob Voss: Wikipedia as Knowledge Organization System Verbundzentrale des GBV (VZG)International UDC Seminar 2009The Hague, 29-30 October

Klicken Sie, um das Format des Titeltextes zu bearbeiten

Klicken Sie, um die Formate des Gliederungstextes zu bearbeiten

Zweite Gliederungsebene

Dritte Gliederungsebene

Vierte Gliederungsebene

Fnfte Gliederungsebene

Sechste Gliederungsebene

Siebente Gliederungsebene

Achte Gliederungsebene

Neunte Gliederungsebene