scelc colloquium shared print analysis
TRANSCRIPT
SCELC Colloquium
What OCLC data analysis reveals about SCELC libraries
John McDonaldCIO, Claremont University
ConsortiumMarch 6, 2013
SCELC’s Need for DATA• Nascent resource sharing program (CAMINO)
What can I get out of this if I join?
• Interest in shared print preservation program
What will I be obligated to keep if I join?
• Some have interest in closer collaborative collection
development
What can I stop buying or what else can I buy?
OCLC Data Analysis
• SCELC officially requested provision of print book
holdings from OCLC for a portion of its members
• 56 SCELC schools requested (50% of membership)
• Simple Data provided:
By OCLC Number
Holding Libraries by Symbol
OCLC Data Analysis
• 2.2 Million Books (or 2,190,464 to be exact)
• 5.5 Million Holdings (or 5,558,921 to be exact)
Data looks a little like this…
0
100,000
200,000
300,000
400,000
500,000
600,000C
lar
em
on
t S
an
ta C
lar
aL
MU
US
FO
xy
Fu
lle
rP
ep
pe
rd
ine
Ca
lte
ch
Un
ive
rs
ity
of
the
Pa
cif
icB
iola
La
Sie
rr
aA
zu
sa
Pa
cif
ic
Lo
ma
Lin
da
S
t. M
ar
y's
L
a V
er
ne
Pa
cif
ic U
nio
n C
oll
eg
eP
oin
t L
om
a N
az
ar
en
eC
ali
for
nia
Lu
the
ra
nC
lar
em
on
t S
ch
oo
l o
f …G
old
en
Ga
te B
ap
tis
t …M
ills
Co
lle
ge
Am
er
ica
n J
ew
ish
…W
es
tmo
nt
Co
lle
ge
Sim
ps
on
Un
ive
rs
ity
Va
ng
ua
rd
Un
ive
rs
ity
Ca
l A
rts
Ca
l B
ap
tis
tM
on
ter
ey
In
sti
tute
Do
min
ica
n
Mo
un
t S
t. M
ar
y's
W
hit
tie
r
Wo
od
bu
ry
S
an
Die
go
Ch
ris
tia
n
Go
lde
n G
ate
H
op
e I
nte
rn
ati
on
al
Jo
hn
F.
Ke
nn
ed
y
Me
nlo
Co
lle
ge
Wil
lia
m J
es
su
p
Ho
ly N
am
es
M
ar
ym
ou
nt
Co
lle
ge
Ca
l In
st
of
Inte
gr
al …
Sie
rr
a N
ev
ad
a
We
ste
rn
Un
ive
rs
ity
of …
Cit
y o
f H
op
eA
llia
nt
Sa
n D
ieg
oW
rig
ht
Ins
titu
teC
ha
rle
s D
re
w
Pa
lo A
lto
Un
ive
rs
ity
All
ian
t -
SF
Sa
n F
ra
nc
isc
o …
All
ian
t In
ter
na
tio
na
l …A
llia
nt
-F
re
sn
oIn
st
of
Tr
an
sp
er
so
na
l …N
otr
e D
am
e d
e N
am
ur
S
F C
en
ter
fo
r …
All
ian
t -
Irv
ine
Total Books Held, by Library
So what? What will the data tell us…
Who makes a good resource sharing partner?
Who makes a good shared print partner?
What traits can influence a Library to join a
program or start a partnership?
Who do is best to collaborate with on
collections in the future?
0%
5%
10%
15%
20%
25%
30%
0.0% 10.0% 20.0% 30.0% 40.0% 50.0% 60.0% 70.0%
To
tal P
ort
ion o
f C
olle
ction
Unique across all Libraries
Fuller Theological Seminary, 100K
Caltech, 75K
Claremont, 180K
LMU, USF, Santa
Clara, 70-80K each
American Jewish
University, 50K
Occidental, 50K
Shared Print: Find Unique Holdings to Maximize Preservation
Shared Print: Find Overlap Holdings to Maximize Deselection
Bo
ok
s a
lso
held
by
Cla
rem
on
t
Shared Print: Find Overlap as a % of Collection
% o
f C
oll
ec
tio
n h
eld
by
Cla
rem
on
t
0.0%
10.0%
20.0%
30.0%
40.0%
50.0%
60.0%
70.0%
0%
2%
4%
6%
8%
10%
12%
14%
16%
18%
20%
40% 50% 60% 70% 80% 90% 100%
LMU, USF, Santa
Clara, 200-250K each
To
tal P
ort
ion o
f C
olle
ction
Unique from Claremont
Fuller Theological
Seminary, 230K
Loma Linda, 120K
Biola, 135K
Caltech, 150K
Resource Sharing: Find Libraries Most Unlike Us
0 200,000 400,000 600,000 800,000 1,000,000 1,200,000
CUC
LMU
Oxy
Pep
UOP
CST
Wstmt
CalArts
CBU
Dom
WJU
WUHS
AJU
HNU
Books held only by library Books held by BOTH library and the rest of Camino Books held only by the rest of Camino
Resource Sharing: CAMINO Collections
Resource Sharing: CAMINO Collections0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
CUC
LMU
Oxy
Pep
UOP
CST
Wstmt
CalArts
CBU
Dom
WJU
WUHS
AJU
HNU
Percentage of Each Library's books that are Unique to Camino
119012
115071
149023
96352
68159
49731
47403
73435
32632
33712
35796
30527
23506
394706
377476
270988
226332
182984
150355
148658
136616
129135
116530
101624
100582
94953
0 50000 100000 150000 200000 250000 300000 350000 400000
Santa Clara
USF
Fuller
Caltech
Biola
La Sierra
Azusa Pacific
Loma Linda
St. Mary's
La Verne
Pacific Union
Point Loma Nazarene
Cal Lutheran
Unique to Library
Total Books
Resource Sharing: Prospective CAMINO Collections
Resource Sharing: Prospective CAMINO Collections
Caltech60%
Both5%
Loma Linda35%
Fuller54%
Both14%
Biola32%
Santa Clara40%
Both22%
USF38%
• Data has proven to be valuable in modeling potential for resource
sharing, print preservation, and collaboration
• Additional areas of analysis:
▫ Overlap and uniqueness by publication year and subject area (LC
Call Number)
▫ Paired and multiple modeled scenarios
• OCLC Data is just a snapshot in time (and already outdated)
• OCLC is hard to work with and can be expensive
Potential for this data
• Need data from members directly
▫ Could include circulation
▫ Simple data extraction should be easy and can be supplemented by
OCLC API
• Find appropriate permanent home for database
• Develop self-service tool with (close to) real time data
• Determine if new OCLC Collection Analysis tool will provide the
same or similar information
Potential Next Steps