“indexing” non-text assets download resource handout for seminar at: david riecks, project...
TRANSCRIPT
“INDEXING” NON-TEXT ASSETS
Download resource handout for seminar at:http://www.controlledvocabulary.com/sla/sla-chi.pdf
David Riecks, Project Leader, http://photometadata.org; Chief Technical Advisor, PLUS (http://www.useplus.org);IPTC Photo Metadata Working Group Member; and Founder of http://ControlledVocabulary.com/Twitter: @davidriecks
www.controlledvocabulary.comtwitter: @davidriecks
THIS IS NOT A WORD
©D
avid
R
ieck
s
www.controlledvocabulary.comtwitter: @davidriecks
THIS IS NOT A WORD
This is not a Word!
©D
avid
R
ieck
s
www.controlledvocabulary.comtwitter: @davidriecks
WHAT IS A DIGITAL IMAGE?
©D
avid
R
ieck
s
www.controlledvocabulary.comtwitter: @davidriecks
WHAT IS A DIGITAL IMAGE?
©D
avid
R
ieck
s
www.controlledvocabulary.comtwitter: @davidriecks
WHAT IS A DIGITAL IMAGE?
©D
avid
R
ieck
s
www.controlledvocabulary.comtwitter: @davidriecks
WHAT IS A DIGITAL IMAGE?
©D
avid
R
ieck
s
www.controlledvocabulary.comtwitter: @davidriecks
WHAT IS METADATA?
• Standard definition: • “Metadata is data about data”
www.controlledvocabulary.comtwitter: @davidriecks
WHAT IS METADATA?
• Standard definition: • “Metadata is data about data”
• Better definition:
• Metadata is information about a thing, apart from the thing itself
• Metadata surrounds us…
www.controlledvocabulary.comtwitter: @davidriecks
REAL-LIFE EMBEDDED METADATA
www.controlledvocabulary.comtwitter: @davidriecks
REAL-LIFE EMBEDDED METADATA
www.controlledvocabulary.comtwitter: @davidriecks
MAKING “SMART” ASSETS
• What makes an asset smart?
©D
avid
R
ieck
s
www.controlledvocabulary.comtwitter: @davidriecks
MAKING “SMART” ASSETS
• What makes an asset smart?• A description that tells you about the asset.• “Controlled” Keywords are used to “tag” its “aboutness”• You can easily find the creator, and copyright holder• You know how to credit the asset if published• You know what rights you have licensed (if not your own)
www.controlledvocabulary.comtwitter: @davidriecks
DOCUMENTING YOUR WORKFLOW
• Use the IPTC / CEPIC package
www.controlledvocabulary.comtwitter: @davidriecks
MAKING “SMART” ASSETS
• How do we add this “smartness”?• By using “Standard” Metadata Schemas
• Exif• IPTC-IIM• IPTC Core• IPTC Extension• Dublin Core• PLUS• PMI (PRISM Metadata for Images)
For details visit: http://www.photometadata.org/META-101-metadata-types
www.controlledvocabulary.comtwitter: @davidriecks
MAKING “SMART” ASSETS
©D
avid
R
ieck
s
www.controlledvocabulary.comtwitter: @davidriecks
MAKING “SMART” ASSETS
www.controlledvocabulary.comtwitter: @davidriecks
MAKING “SMART” ASSETS
• How do we add this “smartness”?• By using “Standard” Metadata Schemas• By embedding the metadata values with software like:
• Adobe Bridge• Adobe Creative Suite Apps (Photoshop/Illustrator/InDesign)• Photo Mechanic (Camerabits.com)• Media Pro (PhaseOne.com)• Apple Aperture• Or Other DAM Software
www.controlledvocabulary.comtwitter: @davidriecks
MAKING “SMART” ASSETS
• “Smart” Assets are ideal for distribution:• Self-describing• Recipient can see all (or nearly all) the data you can.• All “derivative” files inherit this “smartness” as well
Read about a new initiative…..
www.controlledvocabulary.comtwitter: @davidriecks
EMBEDDED METADATA MANIFESTO
• The Five Guiding Principles:• 1. Metadata is essential to describe, identify and track digital
media and should be applied to all media items which are exchanged as files or by other means such as data streams.
View more at: http://www.embeddedmetadata.org/
www.controlledvocabulary.comtwitter: @davidriecks
EMBEDDED METADATA MANIFESTO
• The Five Guiding Principles:• 1. Metadata is essential to describe, identify and track digital
media and should be applied to all media items which are exchanged as files or by other means such as data streams.
• 2. Media file formats should provide the means to embed metadata in ways that can be read and handled by different software systems.
www.controlledvocabulary.comtwitter: @davidriecks
EMBEDDED METADATA MANIFESTO
• The Five Guiding Principles:• 1. Metadata is essential to describe, identify and track digital
media and should be applied to all media items which are exchanged as files or by other means such as data streams.
• 2. Media file formats should provide the means to embed metadata in ways that can be read and handled by different software systems.
• 3. Metadata fields, their semantics (including labels on the user interface) and values, should not be changed across metadata formats.
www.controlledvocabulary.comtwitter: @davidriecks
EMBEDDED METADATA MANIFESTO
• The Five Guiding Principles:• 4. Copyright management information metadata must never
be removed from the files.
www.controlledvocabulary.comtwitter: @davidriecks
EMBEDDED METADATA MANIFESTO
• The Five Guiding Principles:• 4. Copyright management information metadata must never
be removed from the files.• 5. Other metadata should only be removed from files by
agreement with their copyright holders.
View more at: http://www.embeddedmetadata.org/
www.controlledvocabulary.comtwitter: @davidriecks
TESTING ASSETS FOR METADATA PRESERVATION
• Test by Reading One or All After Processing:• Use off the shelf DAM “Cataloging” software
• Media Pro• Idimager• Extensis Portfolio• Canto Cumulus
• Use Originating software (Photoshop / Bridge)• Use Command Line tools (ExifTool)• Use free reader tools
• Mac: Apple Preview, Spotlight, Search function in Finder• Windows: IrfanView, Exifer, etc. • AIR: EMET (Embedded Metadata Extraction Tool)
www.controlledvocabulary.comtwitter: @davidriecks
TESTING ASSETS FOR METADATA PRESERVATION
• ALWAYS! Test After Moving or Posting Online
Jeffrey’s Online Metadata Viewer
http://regex.info/exif.cgi
Sample image available at:
http://www.controlledvocabulary.com/socialmedia/cv-testbed_social-media.jpg
The base URL http://www.controlledvocabulary.com/socialmedia/ for info on how the various social media and Photo Sharing sites fare.
www.controlledvocabulary.comtwitter: @davidriecks
TESTING ASSETS FOR METADATA PRESERVATION
www.controlledvocabulary.comtwitter: @davidriecks
WHAT KIND OF ASSETS CAN WE MAKE “SMART”
• Image Files• JPEG• TIFF• PSD• EPS• DNG
©D
avid
R
ieck
s
www.controlledvocabulary.comtwitter: @davidriecks
WHAT KIND OF ASSETS CAN WE MAKE “SMART”
• Document / Illustration Files• PDF• Adobe Illustrator (AI)• Adobe InDesign (INDD)• EPS
www.controlledvocabulary.comtwitter: @davidriecks
WHAT KIND OF ASSETS CAN WE MAKE “SMART”
• Document / Illustration Files• PDF• Adobe Illustrator (AI)• Adobe InDesign (INDD)• EPS
www.controlledvocabulary.comtwitter: @davidriecks
WHAT KIND OF ASSETS CAN WE MAKE “SMART”
• Audio Files*• XMP or ID3 metadata tags?• mp3, aiff,aif, wav, m4p, m4a, snd
• Video Files*• XMP or QuickTime wrapper?• mov, mpg, mp4, divx, qtz, avi, wmv, dv
*These file formats have decreased interoperability
www.controlledvocabulary.comtwitter: @davidriecks
WHAT DO YOU ADD TO MAKE AN ASSET “SMART”?
Use the “Guide To Photo Metadata Fields”
http://photometadata.org/META-Resources-Field-Guide-to-Metadata
www.controlledvocabulary.comtwitter: @davidriecks
WHAT DO YOU ADD TO MAKE AN ASSET “SMART”?
Use the “IPTC / CEPIC Image Metadata Handbook
http://www.iptc.org/goto?imagemetadatahandbook
www.controlledvocabulary.comtwitter: @davidriecks
QUICK REVIEW: UPSIDES
• Advantages to Embedded Metadata?• Metadata makes it easy to find/locate assets in collections• Find the File >> Find the Info• Metadata travels with any “derivative” files you distribute.• Can be “leveraged” in downstream uses.
www.controlledvocabulary.comtwitter: @davidriecks
CONNECTING THE DOTS: REVIEW
• Advantages to Embedded Metadata?• Metadata makes it easy to find/locate assets in collections• Find the File >> Find the Info• Metadata travels with any “derivative” files you distribute.• Can be “leveraged” in downstream uses.
www.controlledvocabulary.comtwitter: @davidriecks
QUICK REVIEW: DOWNSIDES
• What are the Downsides to embedding metadata?• Not all applications “know” to use this info• Some applications may inadvertently remove some/all info• Increased time to update info in large files (photos or videos)
www.controlledvocabulary.comtwitter: @davidriecks
HOW DO I FIND OR CREATE METADATA / TAGS?
• Tips to create “Smart” Collections• Add metadata before placing assets into database• Make adding metadata part of a documented workflow• Require users to annotate and tag assets they contribute• “Batch” the addition of metadata whenever possible• Use a “controlled vocabulary” when adding keywords
www.controlledvocabulary.comtwitter: @davidriecks
HOW DO I FIND OR CREATE METADATA / TAGS?
• How do you want to “Tag” your Assets?• Manually: In-house• Manually:
• Request to Suppliers• Out-source to Third Party
• Auto-magically
www.controlledvocabulary.comtwitter: @davidriecks
MAKING “SMART” ASSETS WITH CONTROLLED VOCAB
www.controlledvocabulary.comtwitter: @davidriecks
MAKING “SMART” ASSETS WITH CONTROLLED VOCAB
Start with:
feet
shoes
girl
youth
grass
humor
outside
©D
avid
R
ieck
s
www.controlledvocabulary.comtwitter: @davidriecks
MAKING “SMART” ASSETS WITH CONTROLLED VOCAB
w/ Controlled Vocabulary:
child, juvenile, 4-12 years old, people, human beings, humans, person, body, foot, feet, fashion, clothing, apparel, clothes, womens clothing, womens apparel, women’s clothing, attire, shoes, shoe, female, girl, lass, plants, grass, humor, outside, outdoors©
Dav
id
Rie
cks
www.controlledvocabulary.comtwitter: @davidriecks
DOCUMENTING YOUR WORKFLOW
• Use the IPTC / CEPIC package• Documents give sample workflow overviews• Special PDF allows you to make and print your own
http://www.iptc.org/goto?imagemetadatahandbook
www.controlledvocabulary.comtwitter: @davidriecks
DOCUMENTING YOUR WORKFLOW
• Use the IPTC / CEPIC package
www.controlledvocabulary.comtwitter: @davidriecks
DOCUMENTING YOUR WORKFLOW
• Use the IPTC / CEPIC package
www.controlledvocabulary.comtwitter: @davidriecks
DOCUMENTING YOUR WORKFLOW
• Use the IPTC / CEPIC package
www.controlledvocabulary.comtwitter: @davidriecks
DOCUMENTING YOUR WORKFLOW
• Use the IPTC / CEPIC package
www.controlledvocabulary.comtwitter: @davidriecks
DOCUMENTING YOUR WORKFLOW
• Use the IPTC / CEPIC package
www.controlledvocabulary.comtwitter: @davidriecks
DOCUMENTING YOUR WORKFLOW
• Use the IPTC / CEPIC package
www.controlledvocabulary.comtwitter: @davidriecks
DATA EXPORT & IMPORT
www.controlledvocabulary.comtwitter: @davidriecks
DATA EXPORT & IMPORT
• Code Replacement& Variables in PhotoMechanic used to Import Keywords
• See handout for URL
www.controlledvocabulary.comtwitter: @davidriecks
DOCUMENTING YOUR WORKFLOW
• Use the IPTC / CEPIC package
Download resource handout for seminar at:http://www.controlledvocabulary.com/sla/sla-chi.pdf
David Riecks, Project Leader, http://photometadata.org; Chief Technical Advisor, PLUS (http://www.useplus.org);IPTC Photo Metadata Working Group Member; and Founder of http://ControlledVocabulary.com/Twitter: @davidriecks
QUESTIONS?
“INDEXING” NON-TEXT ASSETS