web science: the digital heritage case
DESCRIPTION
29 January 2010: Keynote SOFTSEM 2010, Spindleruv Mlyn, Czech RepublicTRANSCRIPT
![Page 1: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/1.jpg)
Web Science:The Digital Heritage Case
Guus Schreiber
Informatics Dept., Web & Media
VU University Amsterdam
![Page 2: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/2.jpg)
21st century computer-science landscape
• Nowadays small (student) projects are achieving more than large multi-year multi-person projects 10 years ago
• Data storage and computing power are no longer an omnipresent problem
• Globalization of data and service availability
• Data and applications are moving to the Web
![Page 3: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/3.jpg)
Web science
• Web science is not computer science transported to the Web
• Web science is socially embedded
• Broad scope of research issues: trust, reputation, security, governance, social networks, economic models
• “Shift from studying chips to studying clicks”
![Page 4: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/4.jpg)
My journeyknowledge engineering
• design patterns for problem solving
• methodology for knowledge systems
• models of domain knowledge
• ontology engineering
![Page 5: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/5.jpg)
My journeyaccess to digital heritage
![Page 6: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/6.jpg)
My journeyWeb standards
• OWL Web Ontology Language
• SKOS model for publishing vocabularies on the Web
![Page 7: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/7.jpg)
A definition of Web science
“Web science is the study of
(i) the social behavior in the Web at the personal, organizational and societal level,
(ii) the Web technology that enables this behavior, and
(iii) the interactions between technology and behavior”
![Page 8: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/8.jpg)
Good introduction: Shneiderman, Comm. ACM, 2007
![Page 9: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/9.jpg)
![Page 10: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/10.jpg)
We need to study the Web as a phenomenon
• Is the Web changing faster than our ability to observe it?
• How to measure or instrument the Web?
• How to identify behaviors and patterns ?
• How to analyze the changing structure of the Web?
Sample research themes:• Web dynamics• Collective intelligence• Privacy, trust and
security• Linked open data
22 January 2010 Information Day on FET Flagships, Brussels
10
![Page 11: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/11.jpg)
Web science issues: social computing
• What makes online communities successful?– Role of moderation– Issues of trust and identity– Growth models– Role of type of discussion topic– Type of language used
• Driving role of technology
![Page 12: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/12.jpg)
![Page 13: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/13.jpg)
Web science issues:psychology and pedagogy
• Effect of chat/email on:– Mental development of children– Forming of relationships– Changes in cultural preferences
• Distance learning
• Distance psychotherapy
![Page 14: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/14.jpg)
Web science issues:privacy
• Control over personal data– Video’s and pictures of you
• Identity theft
• Who is allowed to store what?
• What do you accept as a user?
![Page 15: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/15.jpg)
Web science issues:legal problems
• Lack of “location”– Which law to apply?
• Copyright– New types of licenses required– See for example Creative Commons
• Data aggregation: what can Google/Facebook/… do with our data?
![Page 16: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/16.jpg)
Web science issues:new economic models
• Micro payments for mail messages?!
• Downloading material with copyright– What is the price of a song?
• Global personalized services replace shops
– E.g., Web services for items now in museum shops
![Page 17: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/17.jpg)
New economic models for distribution of fees for music rights
![Page 18: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/18.jpg)
Web science issues:universal access
• Ideals: access for all!
• Limited Web access in particular countries
• Spreading of hate
• Quality of material gathered on a global scale– Wikipedia
![Page 19: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/19.jpg)
Web for Social
Development
19
![Page 20: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/20.jpg)
http://www.archive.com
![Page 21: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/21.jpg)
THE DIGITAL-HERITAGE CASE
![Page 22: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/22.jpg)
![Page 23: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/23.jpg)
The Web: resources and links
URL URL
Web link
![Page 24: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/24.jpg)
The Semantic Web: typed resources and links
URL URL
Web link
ULAN
Henri Matisse
Dublin Core
creator
Painting“Woman with hatSFMOMA
![Page 25: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/25.jpg)
![Page 26: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/26.jpg)
![Page 27: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/27.jpg)
The myth of a unified vocabulary
• In large virtual collections there are always multiple vocabularies – In multiple languages
• Every vocabulary has its own perspective– You can’t just merge them
• But you can use vocabularies jointly by defining a limited set of links– “Vocabulary alignment”
• It is surprising what you can do with just a few links
![Page 28: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/28.jpg)
Example use of vocabulary alignment
“Tokugawa”
SVCN period Edo
SVCN is local in-house ethnology thesaurus
AAT style/period Edo (Japanese period) Tokugawa
AAT is Getty’s Art & Architecture Thesaurus
![Page 29: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/29.jpg)
Semantic search: clustering and cluster-order principles
![Page 30: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/30.jpg)
![Page 31: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/31.jpg)
Web science issues:technical
• Information retrieval as graph search– more semantics => more paths– finding optimal graph patterns
• Vocabulary alignment
• Information extraction– recognizing people, locations, …– identity resolution
• Multi-lingual resources
![Page 32: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/32.jpg)
Search: WordNet patterns that increase recall without sacrificing precisions
![Page 33: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/33.jpg)
Web science issues:economic
• Cultural heritage organizations find it difficult to “give away” their data– concerns for quality
• Re-orientation: Web is not derivative of physical presence; they should stand side-by-side
• Universal access: everyone should be able to enjoy the Rijksmuseum Amsterdam
![Page 34: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/34.jpg)
Web science issues:economic
• Primary access free
• Secondary services cost money– virtual museum shop can offer much larger
collection– access to high-resolution images– tourist services on mobile devices
![Page 35: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/35.jpg)
Web science issues:legal
• Europeana.eu faces enormous rights issues, in particular wrt recent works
• New licensing frameworks:– Creative Commons– Open Data Commons
![Page 36: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/36.jpg)
Availability of government data: http://data.gov.uk
![Page 37: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/37.jpg)
Linked Open Data initiative
![Page 38: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/38.jpg)
Handling billions of statements
![Page 39: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/39.jpg)
User-generated metadata
![Page 40: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/40.jpg)
Auto completion services
![Page 41: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/41.jpg)
Video tagging games
![Page 42: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/42.jpg)
Web science issues
• CH organizations need outside help to annotate their objects
• Tension between in-house conservation-biased metadata and user view– who, what, where, when
• How can we derive trust levels?
• Auto completion is a difficult technical issue in case of many vocabularies
![Page 43: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/43.jpg)
http://chip-project.org/
![Page 44: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/44.jpg)
Mobile museum tour
![Page 45: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/45.jpg)
Web science issues
• Recommendation strategies– content-based vs. collaborative
• Combining your Web profiles for interests– OpenID– Google Social Graph API
http://code.google.com/apis/socialgraph/
• Tour connects virtual and physical world
![Page 46: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/46.jpg)
www.iFanzy.nl
![Page 47: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/47.jpg)
Friends following this event
Friends following this event
Friends following this eventFriends following this event
That was never a corner..
Billy
Friends following this event
![Page 48: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/48.jpg)
Web science issues
• TV and Web are becoming merged– decoupling of media content from device
• Connecting Web content with media content– use music preferences to suggest TV
programs
• Trusted access to preferences of friends• User profile standards: FOAF•
![Page 49: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/49.jpg)
VU University Amsterdam
Department of Informatics49
User experience Lab
![Page 50: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/50.jpg)
Some things I learned
![Page 51: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/51.jpg)
Principles of knowledge engineering on the Web
Principle 1: Be modest! allow for multiple views and realities
Principle 2: Think large! cf. Doug Lenat
Principle 3: Don't recreate but enrich and align!
Principle 4: : Beware of ontologicalover-commitment!
![Page 52: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/52.jpg)
SKOS: metamodel for publishing vocabularies on the Web
![Page 53: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/53.jpg)
Issues in specification of SKOS semantics
• SKOS should cover a large range of “vocabularies”, “thesauri”, “terminologies”, “classification schemes”, etc.
• Therefore: objective was to define the minimal semantics
• Leave hooks for specializations
![Page 54: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/54.jpg)
![Page 55: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/55.jpg)
Large organizations have adopted SKOS
![Page 56: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/56.jpg)
Web science education
Typical curriculum topics
• Web technology
• Web communication
• Web society
• Web data
http://wiki.websciencetrust.org/w/Curriculum_topics
![Page 57: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/57.jpg)
http://webscience.org/webscience.html
![Page 58: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/58.jpg)
![Page 59: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/59.jpg)
Take home message
• Web science is much more than computer science for the Web
• Web science is strongly interdisciplinary
• Computer science is an important enabler for Web developments
![Page 60: Web Science: the digital heritage case](https://reader038.vdocuments.net/reader038/viewer/2022102815/5578f5f7d8b42a675b8b4721/html5/thumbnails/60.jpg)
Thank you!
• This talk represents work of many people, in particular colleagues at the VU and in the Web Science Trust. Lora Aroyo, Nigel Shadbolt, Jacco van Ossenbruggen and Michiel Hildebrand provided slides