presentation 16 may morning casestudy 2 xavier jacques jourion

27
© 2013 RTBF DGTE

Upload: nederlands-instituut-voor-beeld-en-geluid

Post on 18-Jun-2015

387 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐  

Page 2: Presentation 16 may morning casestudy 2 xavier jacques jourion

GEMSThe future is now

Semantics for [audiovisual] dummies

Xavier Jacques-Jourion

FIAT-IFTA Media Management Seminar

Beeld & Geluid, Hilversum, May 16th, 2013

Page 3: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐  

Agenda

• Introduction

• Semantics 101

• Linked Data

• Demonstration

• Conclusion

3

Page 4: Presentation 16 may morning casestudy 2 xavier jacques jourion
Page 5: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐  

• Public broadcaster

• French-speaking

• 3 TV stations6 Radio stationsInternet portals

• Around 200.000 hours of archives (radio & TV)

• Digitisation in progress (SONUMA)

5

Page 6: Presentation 16 may morning casestudy 2 xavier jacques jourion

Semantics 101

An introduction to the semantic web

Page 7: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐   7

Page 8: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐  

From data to knowledge

8

131573076011/09/2001 - 08:46 EST

First plane hits the World Trade Center North Tower in New York

Page 9: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐  

From data to knowledge

9

Raw data

Information / Content

Knowledge

Page 10: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐  

Data triplets

• Data inside the system is qualified

• Model: subject - predicate - object

• Examples:§ Steve is Peter’s son.

§ Peter is John’s brother.

10

has the colourthe sky blue

Subject ObjectPredicate

Page 11: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐  

From searching to knowing

11

Page 12: Presentation 16 may morning casestudy 2 xavier jacques jourion

As of September 2011

MusicBrainz

(zitgist)

P20

Turismo de

Zaragoza

yovisto

Yahoo! Geo

Planet

YAGO

World Fact-book

El ViajeroTourism

WordNet (W3C)

WordNet (VUA)

VIVO UF

VIVO Indiana

VIVO Cornell

VIAF

URIBurner

Sussex Reading

Lists

Plymouth Reading

Lists

UniRef

UniProt

UMBEL

UK Post-codes

legislationdata.gov.uk

Uberblic

UB Mann-heim

TWC LOGD

Twarql

transportdata.gov.

uk

Traffic Scotland

theses.fr

Thesau-rus W

totl.net

Tele-graphis

TCMGeneDIT

TaxonConcept

Open Library (Talis)

tags2con delicious

t4gminfo

Swedish Open

Cultural Heritage

Surge Radio

Sudoc

STW

RAMEAU SH

statisticsdata.gov.

uk

St. Andrews Resource

Lists

ECS South-ampton EPrints

SSW Thesaur

us

SmartLink

Slideshare2RDF

semanticweb.org

SemanticTweet

Semantic XBRL

SWDog Food

Source Code Ecosystem Linked Data

US SEC (rdfabout)

Sears

Scotland Geo-

graphy

ScotlandPupils &Exams

Scholaro-meter

WordNet (RKB

Explorer)

Wiki

UN/LOCODE

Ulm

ECS (RKB

Explorer)

Roma

RISKS

RESEX

RAE2001

Pisa

OS

OAI

NSF

New-castle

LAASKISTI

JISC

IRIT

IEEE

IBM

Eurécom

ERA

ePrints dotAC

DEPLOY

DBLP (RKB

Explorer)

Crime Reports

UK

Course-ware

CORDIS (RKB

Explorer)CiteSeer

Budapest

ACM

riese

Revyu

researchdata.gov.

ukRen. Energy Genera-

tors

referencedata.gov.

uk

Recht-spraak.

nl

RDFohloh

Last.FM (rdfize)

RDF Book

Mashup

Rådata nå!

PSH

Product Types

Ontology

ProductDB

PBAC

Poké-pédia

patentsdata.go

v.uk

OxPoints

Ord-nance Survey

Openly Local

Open Library

OpenCyc

Open Corpo-rates

OpenCalais

OpenEI

Open Election

Data Project

OpenData

Thesau-rus

Ontos News Portal

OGOLOD

JanusAMP

Ocean Drilling Codices

New York

Times

NVD

ntnusc

NTU Resource

Lists

Norwe-gian

MeSH

NDL subjects

ndlna

myExperi-ment

Italian Museums

medu-cator

MARC Codes List

Man-chester Reading

Lists

Lotico

Weather Stations

London Gazette

LOIUS

Linked Open Colors

lobidResources

lobidOrgani-sations

LEM

LinkedMDB

LinkedLCCN

LinkedGeoData

LinkedCT

LinkedUser

FeedbackLOV

Linked Open

Numbers

LODE

Eurostat (OntologyCentral)

Linked EDGAR

(OntologyCentral)

Linked Crunch-

base

lingvoj

Lichfield Spen-ding

LIBRIS

Lexvo

LCSH

DBLP (L3S)

Linked Sensor Data (Kno.e.sis)

Klapp-stuhl-club

Good-win

Family

National Radio-activity

JP

Jamendo (DBtune)

Italian public

schools

ISTAT Immi-gration

iServe

IdRef Sudoc

NSZL Catalog

Hellenic PD

Hellenic FBD

PiedmontAccomo-dations

GovTrack

GovWILD

GoogleArt

wrapper

gnoss

GESIS

GeoWordNet

GeoSpecies

GeoNames

GeoLinkedData

GEMET

GTAA

STITCH

SIDER

Project Guten-berg

MediCare

Euro-stat

(FUB)

EURES

DrugBank

Disea-some

DBLP (FU

Berlin)

DailyMed

CORDIS(FUB)

Freebase

flickr wrappr

Fishes of Texas

Finnish Munici-palities

ChEMBL

FanHubz

EventMedia

EUTC Produc-

tions

Eurostat

Europeana

EUNIS

EU Insti-

tutions

ESD stan-dards

EARTh

Enipedia

Popula-tion (En-AKTing)

NHS(En-

AKTing) Mortality(En-

AKTing)

Energy (En-

AKTing)

Crime(En-

AKTing)

CO2 Emission

(En-AKTing)

EEA

SISVU

education.data.g

ov.uk

ECS South-ampton

ECCO-TCP

GND

Didactalia

DDC Deutsche Bio-

graphie

datadcs

MusicBrainz

(DBTune)

Magna-tune

John Peel

(DBTune)

Classical (DB

Tune)

AudioScrobbler (DBTune)

Last.FM artists

(DBTune)

DBTropes

Portu-guese

DBpedia

dbpedia lite

Greek DBpedia

DBpedia

data-open-ac-uk

SMCJournals

Pokedex

Airports

NASA (Data Incu-bator)

MusicBrainz(Data

Incubator)

Moseley Folk

Metoffice Weather Forecasts

Discogs (Data

Incubator)

Climbing

data.gov.uk intervals

Data Gov.ie

databnf.fr

Cornetto

reegle

Chronic-ling

America

Chem2Bio2RDF

Calames

businessdata.gov.

uk

Bricklink

Brazilian Poli-

ticians

BNB

UniSTS

UniPathway

UniParc

Taxonomy

UniProt(Bio2RDF)

SGD

Reactome

PubMedPub

Chem

PRO-SITE

ProDom

Pfam

PDB

OMIMMGI

KEGG Reaction

KEGG Pathway

KEGG Glycan

KEGG Enzyme

KEGG Drug

KEGG Com-pound

InterPro

HomoloGene

HGNC

Gene Ontology

GeneID

Affy-metrix

bible ontology

BibBase

FTS

BBC Wildlife Finder

BBC Program

mes BBC Music

Alpine Ski

Austria

LOCAH

Amster-dam

Museum

AGROVOC

AEMET

US Census (rdfabout)

Media

Geographic

Publications

Government

Cross-domain

Life sciences

User-generated content

©  2013  RTBF  -­‐  DGTE  -­‐  

Linked Open Data (LOD)

12

Page 13: Presentation 16 may morning casestudy 2 xavier jacques jourion

But what does it do?

The power of linked data

Page 14: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐  

Do not read this.

Linked Data is about using the Web to connect related data that wasn't previously linked, or using the Web to lower the barriers to linking data currently linked using other methods. More specifically, Wikipedia defines Linked Data as "a term used to describe a recommended best practice for exposing, sharing, and connecting pieces of data, information, and knowledge on the Semantic Web using URIs and RDF."

14

Page 15: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐  15

Page 16: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐  16

Page 17: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐  17

Page 18: Presentation 16 may morning casestudy 2 xavier jacques jourion

The GEMS project

Page 19: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐  

GEMS

• Goal: build a proof of concept for a semantic-based multimedia browser interface, using raw extracts from our media databases.

• De-mystify the field of semantics.

• Developed with two external partners:

19

Page 20: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐  

Project intentions

20

• Use semantics to assemble the knowledge previously spread across multiple databases.

• Connect to public data sources using LOD.

• Propose a new research tool for journalists and production assistants.

• Cross-media searches.

• Speech-to-text engine.

• Ideally: change the way research is done by giving access to the knowledge harvested from the different media collection(s).

Page 21: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐  

Principle

21

Nétia Tramontane Radio Dalet Tramontane

TV

GEMS

Page 22: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐  

Content

22

• Medias linked to the end of the Belgian government crisis in July 2011§ 4 “JT 19h30”, week starting July 4th, 2011

§ 3 “Invités de Matin Première” (Radio show)

§ 1 “Mise au point” (August 28, 2011)

§ Metadata linked to the above medias

Page 23: Presentation 16 may morning casestudy 2 xavier jacques jourion

Demo

Page 24: Presentation 16 may morning casestudy 2 xavier jacques jourion

Conclusion

Page 25: Presentation 16 may morning casestudy 2 xavier jacques jourion

Questions?

Page 26: Presentation 16 may morning casestudy 2 xavier jacques jourion

Thank you!

Page 27: Presentation 16 may morning casestudy 2 xavier jacques jourion

©  2013  RTBF  -­‐  DGTE  -­‐