a peek inside the carolina digital repository michael daines digital repository analyst unc –...

25
A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

Upload: lisa-cunningham

Post on 17-Dec-2015

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

A Peek Inside theCarolina Digital Repository

Michael DainesDigital Repository Analyst

UNC – Chapel Hill

Page 2: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill
Page 3: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

Goals

Page 4: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

What’s in the repository?

Page 5: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

What’s in the repository?

• 41158 images• 18671 texts (PDF, Microsoft Word, text files)• 11856 audio files• 1438 datasets• 54 video files

(As of July 17, 2013)

Page 6: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

What’s in the repository?

• Research Laboratories of Archaeology35502 images (photographs and scans)

• Electronic Theses and Dissertations4035 PDFs

• BioMed Central1777 PDFs (articles)

(As of July 17, 2013)

Page 7: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

How to show what we have?

Page 8: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill
Page 9: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill
Page 10: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

“Peek”

https://github.com/UNC-Libraries/peek

Page 11: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill
Page 12: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill
Page 13: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill
Page 14: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill
Page 15: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

How do we findinteresting images?

Page 16: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

Cover pages?

Page 17: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

Random pages?

Page 18: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill
Page 19: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

How do we findinteresting images?

Query → Download → Split → Resize → Choose

Page 20: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

Query, Download

Solr queryDownload public datastreams

Page 21: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

Split, Resize

CoreGraphicsImageMagick

Page 22: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

Choose

Page 23: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

Initial set

2000 objects35855 images split

425 images for homepage

Page 24: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

Further work

• Larger sample?• Automation?• Integration with repository?• Collaborative filtering?• Image classification?• No processing step?• A/V objects?• Bias?

Page 25: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

Try it!

https://cdr.lib.unc.edu/https://github.com/UNC-Libraries/peek

https://github.com/UNC-Libraries/peek-data