“indexing” non-text assets download resource handout for seminar at: david riecks, project...

52
“INDEXING” NON-TEXT ASSETS Download resource handout for seminar at: http://www.controlledvocabulary.com/sla/ sla-chi.pdf David Riecks, Project Leader, http://photometadata.org; Chief Technical Advisor, PLUS (http://www.useplus.org ); IPTC Photo Metadata Working Group Member; and Founder of http

Upload: olivia-dean

Post on 29-Dec-2015

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

“INDEXING” NON-TEXT ASSETS

Download resource handout for seminar at:http://www.controlledvocabulary.com/sla/sla-chi.pdf

David Riecks, Project Leader, http://photometadata.org; Chief Technical Advisor, PLUS (http://www.useplus.org);IPTC Photo Metadata Working Group Member; and Founder of http://ControlledVocabulary.com/Twitter: @davidriecks

Page 2: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

THIS IS NOT A WORD

©D

avid

R

ieck

s

Page 3: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

THIS IS NOT A WORD

This is not a Word!

©D

avid

R

ieck

s

Page 4: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

WHAT IS A DIGITAL IMAGE?

©D

avid

R

ieck

s

Page 5: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

WHAT IS A DIGITAL IMAGE?

©D

avid

R

ieck

s

Page 6: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

WHAT IS A DIGITAL IMAGE?

©D

avid

R

ieck

s

Page 7: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

WHAT IS A DIGITAL IMAGE?

©D

avid

R

ieck

s

Page 8: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

WHAT IS METADATA?

• Standard definition: • “Metadata is data about data”

Page 9: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

WHAT IS METADATA?

• Standard definition: • “Metadata is data about data”

• Better definition:

• Metadata is information about a thing, apart from the thing itself

• Metadata surrounds us…

Page 10: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

REAL-LIFE EMBEDDED METADATA

Page 11: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

REAL-LIFE EMBEDDED METADATA

Page 12: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS

• What makes an asset smart?

©D

avid

R

ieck

s

Page 13: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS

• What makes an asset smart?• A description that tells you about the asset.• “Controlled” Keywords are used to “tag” its “aboutness”• You can easily find the creator, and copyright holder• You know how to credit the asset if published• You know what rights you have licensed (if not your own)

Page 14: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package

Page 15: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS

• How do we add this “smartness”?• By using “Standard” Metadata Schemas

• Exif• IPTC-IIM• IPTC Core• IPTC Extension• Dublin Core• PLUS• PMI (PRISM Metadata for Images)

For details visit: http://www.photometadata.org/META-101-metadata-types

Page 16: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS

©D

avid

R

ieck

s

Page 17: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS

Page 18: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS

• How do we add this “smartness”?• By using “Standard” Metadata Schemas• By embedding the metadata values with software like:

• Adobe Bridge• Adobe Creative Suite Apps (Photoshop/Illustrator/InDesign)• Photo Mechanic (Camerabits.com)• Media Pro (PhaseOne.com)• Apple Aperture• Or Other DAM Software

Page 19: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS

• “Smart” Assets are ideal for distribution:• Self-describing• Recipient can see all (or nearly all) the data you can.• All “derivative” files inherit this “smartness” as well

Read about a new initiative…..

Page 20: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

EMBEDDED METADATA MANIFESTO

• The Five Guiding Principles:• 1. Metadata is essential to describe, identify and track digital

media and should be applied to all media items which are exchanged as files or by other means such as data streams.

View more at: http://www.embeddedmetadata.org/

Page 21: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

EMBEDDED METADATA MANIFESTO

• The Five Guiding Principles:• 1. Metadata is essential to describe, identify and track digital

media and should be applied to all media items which are exchanged as files or by other means such as data streams.

• 2. Media file formats should provide the means to embed metadata in ways that can be read and handled by different software systems.

Page 22: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

EMBEDDED METADATA MANIFESTO

• The Five Guiding Principles:• 1. Metadata is essential to describe, identify and track digital

media and should be applied to all media items which are exchanged as files or by other means such as data streams.

• 2. Media file formats should provide the means to embed metadata in ways that can be read and handled by different software systems.

• 3. Metadata fields, their semantics (including labels on the user interface) and values, should not be changed across metadata formats.

Page 23: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

EMBEDDED METADATA MANIFESTO

• The Five Guiding Principles:• 4. Copyright management information metadata must never

be removed from the files.

Page 24: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

EMBEDDED METADATA MANIFESTO

• The Five Guiding Principles:• 4. Copyright management information metadata must never

be removed from the files.• 5. Other metadata should only be removed from files by

agreement with their copyright holders.

View more at: http://www.embeddedmetadata.org/

Page 25: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

TESTING ASSETS FOR METADATA PRESERVATION

• Test by Reading One or All After Processing:• Use off the shelf DAM “Cataloging” software

• Media Pro• Idimager• Extensis Portfolio• Canto Cumulus

• Use Originating software (Photoshop / Bridge)• Use Command Line tools (ExifTool)• Use free reader tools

• Mac: Apple Preview, Spotlight, Search function in Finder• Windows: IrfanView, Exifer, etc. • AIR: EMET (Embedded Metadata Extraction Tool)

Page 26: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

TESTING ASSETS FOR METADATA PRESERVATION

• ALWAYS! Test After Moving or Posting Online

Jeffrey’s Online Metadata Viewer

http://regex.info/exif.cgi

Sample image available at:

http://www.controlledvocabulary.com/socialmedia/cv-testbed_social-media.jpg

The base URL http://www.controlledvocabulary.com/socialmedia/ for info on how the various social media and Photo Sharing sites fare.

Page 27: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

TESTING ASSETS FOR METADATA PRESERVATION

Page 28: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

WHAT KIND OF ASSETS CAN WE MAKE “SMART”

• Image Files• JPEG• TIFF• PSD• EPS• DNG

©D

avid

R

ieck

s

Page 29: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

WHAT KIND OF ASSETS CAN WE MAKE “SMART”

• Document / Illustration Files• PDF• Adobe Illustrator (AI)• Adobe InDesign (INDD)• EPS

Page 30: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

WHAT KIND OF ASSETS CAN WE MAKE “SMART”

• Document / Illustration Files• PDF• Adobe Illustrator (AI)• Adobe InDesign (INDD)• EPS

Page 31: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

WHAT KIND OF ASSETS CAN WE MAKE “SMART”

• Audio Files*• XMP or ID3 metadata tags?• mp3, aiff,aif, wav, m4p, m4a, snd

• Video Files*• XMP or QuickTime wrapper?• mov, mpg, mp4, divx, qtz, avi, wmv, dv

*These file formats have decreased interoperability

Page 32: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

WHAT DO YOU ADD TO MAKE AN ASSET “SMART”?

Use the “Guide To Photo Metadata Fields”

http://photometadata.org/META-Resources-Field-Guide-to-Metadata

Page 33: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

WHAT DO YOU ADD TO MAKE AN ASSET “SMART”?

Use the “IPTC / CEPIC Image Metadata Handbook

http://www.iptc.org/goto?imagemetadatahandbook

Page 34: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

QUICK REVIEW: UPSIDES

• Advantages to Embedded Metadata?• Metadata makes it easy to find/locate assets in collections• Find the File >> Find the Info• Metadata travels with any “derivative” files you distribute.• Can be “leveraged” in downstream uses.

Page 35: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

CONNECTING THE DOTS: REVIEW

• Advantages to Embedded Metadata?• Metadata makes it easy to find/locate assets in collections• Find the File >> Find the Info• Metadata travels with any “derivative” files you distribute.• Can be “leveraged” in downstream uses.

Page 36: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

QUICK REVIEW: DOWNSIDES

• What are the Downsides to embedding metadata?• Not all applications “know” to use this info• Some applications may inadvertently remove some/all info• Increased time to update info in large files (photos or videos)

Page 37: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

HOW DO I FIND OR CREATE METADATA / TAGS?

• Tips to create “Smart” Collections• Add metadata before placing assets into database• Make adding metadata part of a documented workflow• Require users to annotate and tag assets they contribute• “Batch” the addition of metadata whenever possible• Use a “controlled vocabulary” when adding keywords

Page 38: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

HOW DO I FIND OR CREATE METADATA / TAGS?

• How do you want to “Tag” your Assets?• Manually: In-house• Manually:

• Request to Suppliers• Out-source to Third Party

• Auto-magically

Page 39: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS WITH CONTROLLED VOCAB

Page 40: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS WITH CONTROLLED VOCAB

Start with:

feet

shoes

girl

youth

grass

humor

outside

©D

avid

R

ieck

s

Page 41: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

MAKING “SMART” ASSETS WITH CONTROLLED VOCAB

w/ Controlled Vocabulary:

child, juvenile, 4-12 years old, people, human beings, humans, person, body, foot, feet, fashion, clothing, apparel, clothes, womens clothing, womens apparel, women’s clothing, attire, shoes, shoe, female, girl, lass, plants, grass, humor, outside, outdoors©

Dav

id

Rie

cks

Page 42: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package• Documents give sample workflow overviews• Special PDF allows you to make and print your own

http://www.iptc.org/goto?imagemetadatahandbook

Page 43: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package

Page 44: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package

Page 45: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package

Page 46: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package

Page 47: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package

Page 48: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package

Page 49: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

DATA EXPORT & IMPORT

Page 50: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

DATA EXPORT & IMPORT

• Code Replacement& Variables in PhotoMechanic used to Import Keywords

• See handout for URL

Page 51: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

www.controlledvocabulary.comtwitter: @davidriecks

DOCUMENTING YOUR WORKFLOW

• Use the IPTC / CEPIC package

Page 52: “INDEXING” NON-TEXT ASSETS Download resource handout for seminar at:  David Riecks, Project Leader,

Download resource handout for seminar at:http://www.controlledvocabulary.com/sla/sla-chi.pdf

David Riecks, Project Leader, http://photometadata.org; Chief Technical Advisor, PLUS (http://www.useplus.org);IPTC Photo Metadata Working Group Member; and Founder of http://ControlledVocabulary.com/Twitter: @davidriecks

QUESTIONS?

“INDEXING” NON-TEXT ASSETS