metadata for the gpii liddy nevile. drd metadata
TRANSCRIPT
Metadata for the GPII
Liddy Nevile
DRD metadata
GPII
• Offers a chance for cumulative, crowd-sourced, cloud-based, just-in-time provision of accessible resources through AccessForAll services.
• Needs lots of quality metadata.
So let’s take a use case - a resource about dogs.
• Martha’s preferred language is English, so when video is not in English, she wants English subtitles.
• 6 am. to 6 pm., the subtitles should be large.• 6 pm. to 6 am., the subtitles should be extra large.• 9 am. to 7 pm., the volume should be loud.• At other times, medium volume is OK. • The video brightness level should be at maximum
brightness until 6 pm. • but 80% of its maximum brightness in the evenings.
There are two ways to find what we want:
1.a typical database method and
2.a typical metadata method
A database solution
• If it is about dogs, does it have video?• If so, is the time between 6 pm. and 6 am?• If so, does it have extra large font subtitles?• If so, is it after 7 pm.?• If so, does it have loud volume?• If so, deliver it!
The algorithmic approach to discovery based on lots of conditionals.
Behind the database solutionResource x
Font sizeTime of day
Media type
video
text
image
6am - 6pm
7pm - 6am
midnight
large
Extra large
small
v. small
A metadata solution
Match• Media, time, font size, volume, to the metadata of the resource using
standard ‘advanced search’ matching algorithm
Note that changing size of font, e.g. might have to be implemented by a (cloud) service on the way.
valuepropertydata
Behind the metadata solution
Resource x
has mediumvideo
textimage
…
valuepropertydata
A metadata solution
Resource x
has medium
videotext
image…
is video with subtitles
V smallsmall
largeX large
valuepropertydata
Behind the metadata solution
Resource xhas medium
videotext
image…
Video with subtitles
V smallsmall
largeX large
Video with volumelow
mediumhigh
…
But, structure is often in the value• Perhaps, I have a general value - ‘loud.• Then I refined it to be more specific - ‘very
loud’.• Then I refined it again to be more specific -
‘low-pitch very loud.• Etc.(the core metadata value is loud and the
refinements are structured so they are always associated with it)
Essentials for metadata
• Sharing of metadata happens only if it is interoperable and that means interoperable syntax and interoperable semantics.
• Interoperable semantics depends on a combination of common words and common structure.
• So the GPII needs to specify common syntax, semantics and structure
What’s new, what’s available?
• The syntax of the Semantic Web is already there and very interoperable with the various computer languages people use - not a problem if we use a standard syntax.
• The semantics of terms, elements, vocabularies, whatever can be defined and published so everyone can share them.
But ….
• The Semantic Web is ‘wonderful’ but too broad and unmanageable for us so, perhaps like the DC people, we will need to constrain it?
• So let’s see what DC people do.
The ‘Semantic Web’ does what?
• By having a very clear and atomic, uniform structure it supports computer decision-making and so linking of one data resource to another, sometimes extending across a chain of resources(because computers are capable of simple logic).
The ‘Semantic Web’ does what?
• If a computer knows:– Fred is son of Bill and Mary– Tom is son of Bill and Mary
• then Fred and Tom are brothers, and if– Bill and Mary have son Fred– Bill and Mary have son Tom
• then Fred and Tom are brothers.
Semantic Web statements
• They are ALWAYS the same VERY simple statements - subject, relationship, value
• i.e. there is an invariable structure.• Note that this is the case even when we have
– Tom is son of Bill and Mary, and – Bill and Mary have son Tom.
• The structure is really important.
Dublin Core statements
• DC metadata people use the SemWeb but they don’t (yet) use it backwards and forwards. They say:– (the story) My Holidays was written by Jack, and– My Holidays was published by PBS
• because they tend to want to keep the descriptions of the story together, and elsewhere they describe the author, for simplicity.
• But using SemWeb technology, one can associate a description of what PBS publishes with what is known about Jack, or the story My Holidays.
Dublin Core statements
• Graphically, the difference is between a set of descriptions all flowing from single resources, even when they are later linked up (DC metadata), and a web where all sorts of things are linked one to another (in any direction).
• Either way, both use triples.
DC Metadata (bottom) and the Semantic Web (top)
Making metadata for ISO 24751
The steps are:– Analyse and determine the needs of the
client base (use domain expertise)– Sort the properties (domain and metadata
expertise)– Represent the properties using the
Resource Description Framework– Provide examples of encoding of metadata– Develop cross-walks where necessary
Making good metadata
Simple PNP descriptions
• People can be in a hurry and want immediate changes to their resources.
• A wizard can solve this problem, only presenting the detailed or complicated possibilities if they are indicated by the user.
Other important notes
• Metadata is generated in many ways - by people, by workflow processes, by usage (paradata), etc.
• Full-text searches etc are not useless - they are just different (and complementary, in most cases)
• Quantity has a quality all of its own but in the field of accessibility, quantity is not common.
Remember, we are aiming to support the user’s choice, not to impose constraints on their access to resources!
• Thank you!