crowdsourcing in the audiovisual domain

30
CROWDSOURCING in the Audiovisual Domain Maarten Brinkerink Netherlands Institute for Sound and Vision @mbrinkerink

Upload: maartenbrinkerink

Post on 19-Feb-2017

459 views

Category:

Presentations & Public Speaking


0 download

TRANSCRIPT

CROWDSOURCINGin the Audiovisual Domain

Maarten Brinkerink Netherlands Institute for Sound and Vision

@mbrinkerink

SOUND AND VISIONThe Netherlands Institute for Sound and Vision is a cultural-historical organization of national interest. It collects, preserves and opens the audiovisual heritage for as many users as possible: media professionals, education, science and the general public.

1,000,000 HOURS OF A/V

SMART, CONNECTED & OPEN

ASSUMPTIONS FOR THIS TALK

•Users and collections share the same information space online

•This is a win-win situation

TYPOLOGY OF CROWDSOURCING1. Tagging and Classification2. Collection Acquisition3. Contextualisation 4. Correction and Transcription5. Co-curation6. Crowdfunding

(Oomen & Aroyo 2011)

TAGGING AND CLASSIFICATION1. Tagging and Classification2. Collection Acquisition3. Contextualisation 4. Correction and Transcription5. Co-curation6. Crowdfunding

VIDEO LABELING GAME

GAME WITH A PURPOSE: FINE-GRAINED TAGS, MATCHING CONTROLLED VOCABULARIES

GAME MECHANISMS

OBJECTIVE: TYPE WHAT YOU HEAR AND SEE AT ANY GIVEN TIMEREWARD: YOU SCORE POINTS WHEN YOU MATCH ANOTHER PLAYER

RESULTS

OVER 240,000 TAGS, 143,000 MATCHESFROM RESEARCH: SEARCH BASED ON USER TAGS MOST EFFECTIVE

COLLECTION ACQUISITION1. Tagging and Classification2. Collection Acquisition3. Contextualisation 4. Correction and Transcription5. Co-curation6. Crowdfunding

SOUNDS OF THE NETHERLANDS

WE UPLOADED 2,000 HIGH QUALITY FIELD RECORDINGS TO SOUNDCLOUD AND SHARED OPENLY

COLLABORATIVE SOUND MAPPING

WE PUT ALL THE SOUNDS ON A MAP & ASKED PEOPLE TO FILL THE GAPS

UNEXPECTED RESULTS

ENRICHMENTS ON SOUNDCLOUD

REUSE IN WIKIPEDIA

APPS

CONTEXTUALISATION1. Tagging and Classification2. Collection Acquisition3. Contextualisation 4. Correction and Transcription5. Co-curation6. Crowdfunding

OPEN IMAGES

• Open content• Open software• Open formats• Open metadata• Open API

VIDEO ON WIKIMEDIA• Through Open Images, NISV shares over 4,000 open video items

• This is 8% of the total amount of video available on Wikimedia Commons

• This includes almost 3,500 historical newsreels, and recent raw nature footage

• This is a mix of PD and openly licensed material ‘owned’ by the institute

OUR NEWSREELS ON WIKIPEDIA

WIKIPEDIA ARTICLE ‘MOTHER’S DAY’ REUSES OUR NEWSREEL TO ILLUSTRATE INTERNATIONAL HISTORY

REUSE AND PUBLIC REACH• Over 40% of the available open video items are being reused at least once

• In total almost 4,000 Wikipedia articles reuse the items, spread over 98 language versions of Wikipedia

• On a monthly basis, these articles are viewed over 4,000,00 times

• Since 2010 this growing open collection has attracted over 160,000,000 pageviews

FIGURES IN PERSPECTIVE•The entire NISV collection contains 1,000,000 hours of audiovisual material

•Through Open Images NISV shares 150 hours of video and 150 hours of audio

•This is just 0.03% of the collection•The potential is enormous!!!

CORRECTION AND TRANSCRIPTION1. Tagging and Classification2. Collection Acquisition3. Contextualisation 4. Correction and Transcription5. Co-curation6. Crowdfunding

SUBTITLES AND TRANSLATIONS

CO-CURATION1. Tagging and Classification2. Collection Acquisition3. Contextualisation 4. Correction and Transcription5. Co-curation6. Crowdfunding

PERSONAL COLLECTIONS AND MOTISUBMIT YOUR OWN COLLECTION

VOTE FOR OTHER COLLECTIONS

ONLINE EXHIBIT

CROWDFUNDING1. Tagging and Classification2. Collection Acquisition3. Contextualisation 4. Correction and Transcription5. Co-curation6. Crowdfunding

FOR THE ARTS

DIGITAL CONTENT LIFE CYCLE

FINAL THOUGHTS

• Crowdsourcing is not (only) about substitution, but also about user engagement and closing (semantic) gaps

• Determine what type of crowdsourcing could benefit your specific collection

• Look at existing (open source) platforms, tools and solutions, to avoid the ‘not invented here syndrome’

• If possible, focus on specific collections, tasks and communities (of interest)

• Allow for unexpected outcomes, by embracing openness

THANKS!!! QUESTIONS?Maarten [email protected] @mbrinkerink

Thanks to Johan Oomen for his contributions!

Digital Content Life Cycle based on:http://beta.digipedia.org.uk/wiki/Digital_content_life_cycle (CC-BY)