paola mazzucchi, aie and medra project manager, converging metadata for converging media @...
DESCRIPTION
Paola Mazzucchi, mEDRA Project Manager, Converging metadata for converging media @ Converging Media 2014TRANSCRIPT
With the support of
Good metadata help selling more books.
But that’s not the end of the story
Paola Mazzucchi
Converging metadata for converging media
Converging Media Conference, Gent, 24th September 2014
2
TISP is the platform for the publishing industry and the ICT industry to discuss about innovation, collaboration and partnerships, within an international network.
TISP helps publishing and ICT converging on specific business needs and on the strategic view for the future.
TISP is an EU funded Thematic Network
Over TISP:Twitter: @tispnetworkLinkedin group: TISP - Technology and Innovation for Smart Publishing www.smartbook-tisp.eu
Technology and Innovation for Smart Publishing
3
People make Stuff. People use Stuff.
People do deals about Stuff.
Scenario setting: convergence
• «What is a product» in the digital environment?• Granularity and Complexity• Beyond the product: works and abstractions• Relations among stuff
• Same people, multiple names and identities• Same people multiple roles in multiple
products• Relations among people
• Events• Actions people do with stuff• Relations among people and stuff
4
These basic principles help us to further elaborate
on the challenges in today digital environment
5
People make Stuff. People use Stuff.
People do deals about Stuff.
Scenario setting: convergence
Identify unambiguously the relevant entities
Describe relevant features of the relevant entities (rich metadata)
Create semantic relationships among different entities (products, works, people, parties, etc.)
Respond to the demand by users to re-use content
Enrich the experience of the end user
Provide information about how end users can get appropriate permission to re-use contentIncrease discoverability
of content (SEO)
Optimise supply chains operations, from royalties distribution to and sales management
6
It’s all about metadata!
The possibility to create and maintain relationships among entities relies on a network of metadata records and persistent identifiers, ensuring interoperability across market-segments
This applies to each of the media sectors as a self-contained domain
This applies more and more to a cross-media environment
7
Our tool kit: Identifiers & Metadata
ISBNDOIISTC
ONIX4BONIX4DOI
MarcDublin Core
GTINGRid & ISRC
ISWC
DDEX
ISANEIDR
EIDR
PLUS ID
IPTCPLUS
XMP & Exif
8
So far so good, as far as the theory goes…
but
Let’s come back to our daily job
9
Metadata in everyday life in the publishing industry/1
Publishers
Distributors
E-book platforms
BIP
Wholesalers
Online retailers
Publishers
Qu
ality
ch
eck
Qu
ality
ch
eck a
nd
data
en
rich
men
tQuality check e data
enrichment
RRO
Social networks
s
Libraries
Bookshops
Search engines
Metadata creation Metadata managementand supply
ISBN assignment
Metadata or book
Metadata or book
Metadata
Metadata consumption
10
Metadata in everyday life in the publishing industry/2
Good metadata help selling more books
hope you all know by now…
Basic metadata Enhanced metadata
11
But that’s not the end of the story
Let’s have a look at some initiatives and services that make use of the good metadata that help selling more books in a different context to enable a different range of services
Publishing industry
Converging metadata
Converging media sectors
12
Rights information management services
ARROW is a system to streamline “rights information discovery” in a book or collection of books to lawfully digitise and make available the European cultural heritageARROW operations and algorithms are powered by metadata:
- national bibliographies (stuff)- authors authority files (people)- book supply chain data (BIP) (stuff)- rights management data (RRO and CMO) (events)
Media sector: books and audio-visual
FORWARD will build a rights discovery service for the audio-visual sector through an automated system that will search, harvest and process metadata from film archives and producers.FORWARD and the ARROW system will be fully interoperable and accessible to queries from all users across the EU.
13
Accessibility for the visually impaired services
Media sector: books
LIA (Accessible Italian Books) is a dedicated service to increase the number of accessible e-books available on the Italian market for blind and visually impaired readers.LIA operations are powered by metadata:
- Title metadata from publishers (stuff)- Supply chain metadata (BIP) (stuff)- Accessibility metadata created in the
certification service (stuff and event)
PRODUCTION CATALOGUING DISTRIBUTION USE
Semantic Mark-up
Metadata management
Metadata supply
Mark-up and Metadata use
In LIA accessibility metadata are merged with title information from E-Kitab (the BIP of Italian e-books) The resulting ONIX 3.0 metadata convey information on ebook accessibility all along the publishing supply chain: stores, libraries, aggregators.
14
The implementation of Schema.org mark-up on LIA website was tested, mapping product and accessibility metadata from ONIX 3.0 records to the schema Book
Unfortunately the STANCA act that regulates the accessible web in Italy excludes the use of HTML5 to develop accessible websites.
Improve discoverability and SEO
14
microdata RDFa JSON-LD
✔︎�✖︎�✖︎�
microdata RDFa JSON-LD
✖︎ ✔︎�✔︎�
15
RDI is a project, EU-funded under the CIP framework, aimed at demonstrating how to efficiently manage and trade intellectual property rights online for any and all types of usage, across any and all types of content, in any and all media. RDI operations are powered by metadata provided by different sources:- Content metadata (stuff)- Rightsholders metadata: authors/creators and
publishers/producers (people)- Rights and licensing metadata (RRO/CMO)
(events)
Streamline online rights transaction services
Media sector: books and journals; film and audio-visual; music; images
At the core of RDI is the creation of an interoperable communication layer between data sources and users, for example consumers looking for a license to use a piece of content, or “B2B” users looking for permission to re-sell or re-purpose existing content to create new content.
RDI implements the principles of the Linked Content Coalition to facilitate and expand the legitimate use of content in the digital network through the effective use of interoperable identifiers and metadata
16
Use case in the books and journal sector
Users, other aggregators, other services
17
One of the most well-known DOI-based services is in the in the academic and research environment where the DOI makes resolvable the semantic relations among different content types. Crosslinking and citation services between journals, datasets, researchers and funders are powered by DOI resolution and DOI metadata.
DOI
Media sector: books and journals; film and audio-visual; datasets; PSI
DOI (Digital Object identifier) is persistent, cross-media and resolvable identifier. Resolution is the process of going from an identifier to information about the identified entity (metadata) and in some cases the entity itself. DOI has been made interoperable with other identifiers (ex. the ISBN), therefore can support the use online of other identifiers, to access metadata and services associated to the content identified.
Media sector: books and journals; datasets
18
DOI Context-aware multiple resolution
DOI
Metadata services
Registration and
bibliographicmetadata
Rights informationmetadata
Accessibilitymetadata
DOI kernelmetadata
Content services
Reuse (licensing/permission)
Buy
Access
Other resolutions
Alternative version
about the contributors
about the content
Get metadata in Turtle
Get metadatain citation format
Get metadatain xml
Get metadata in RDF xml
Media sector: books and journals; film and audio-visual; datasets; PSI
19
Content Negotiation is an application to make available and disseminate DOI metadata in different formats regardless the Registration Agency where the DOI has been registered and where associated metadata are stored.
Content negotiation of DOI metadata
Media sector: books and journals; datasets
20
Formatting service of content negotiated DOI metadata
Media sector: books and journals; datasets
21
Sooner or later in a converging future…
Conclusions
22
What do these have in
common?
The four Gospels
Columbo(aka Peter Falk)
Harry Potter and the Deathly Hallows
Part 1
Einsturzende Neubauten
What else?
23
Nick Cave and the Bad Seeds
ISNI 0000 0001 1958 9618
The four GospelsISBN
9781847678355
Columbo(aka Peter Falk)ISNI 0000 0000
7823 8642
Harry Potter and the Deathly
HallowsPart 1
ISAN 0000-0002-C755-0000-D-0000-0002-V
(DVD)
Einsturzende Neubauten
ISNI 0000 0001 2291 0804
Related productsISBN 9780862417963
Related workISTC A02-2012-000013F3-7
Related productISBN 9781408835029
Related workISTC A02-2013-000003E3-0
Related work
ISAN 0000-0000-32FE-0000-T-0000-0000-O (Der Himmel über Berlin)
Related Party0000 0000 7839 5824 (Blixa Bargeld)
Related work
ISAN 0000-0002-C755-0000-D-0000-0000-Z EIDR 10.5240/FE76-07CC-CACA-7BE7-C49E-K
Related version
ISAN 0000-0002-C755-0000-D-0000-0001-X (Blue Ray)
With the support ofMy metadataName: PaolaSurname: MazzucchiProfessional Affiliation: AIE/mEDRARole: Project Manager Email: [email protected]: @MetalGoddess Linkedin: https://www.linkedin.com/in/paolamazzucchi