sir alister hardy foundation for ocean science history of the continuous plankton recorder database...
TRANSCRIPT
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
History of the Continuous Plankton Recorder Database
D. Stevens and P.C. Reid
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
Introduction
• Continuous Plankton Recorder Survey
• The development of the CPR database
• Recent and future development
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
The CPR Survey
• Monitor the near-surface plankton of the North Atlantic and North Sea on a monthly basis
• Year to year changes in abundance
• Seasonal cycles of species
• Geographical distributions
• Effects of fisheries and climate change
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
How data are collected
• The CPR is a plankton sampling instrument designed to be towed from merchant ships on their normal sailings
• Alister Hardy used the first prototype to sample krill in the Antarctic on the Discovery cruises of 1925-27
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
How data are analysed
• Phytoplankton x450 magnification
~1/8000 of sample
• Zooplankton traverse (<2mm)x48 magnification
~1/40th of sample
• Zooplankton eye count (>2mm)removed and counted
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
How data are STORED
• Before computers• The first computerised database• Geographical effects• Updated computer languages• Current system
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
Before Computers
• All calculations had to be carried out by hand
• Sample positions
• Cutting points
• Local time
Before1969
1970 1980 1990 2000
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
Before Computers (2)• Data were stored on paper for each tow
• This data would then be transposed onto cards for each species
Sample No
Spe
cies
nam
e
Tow record
Sample NoT
ow n
ame
Species card
1970 1980 1990 2000
Before1969
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
Before Computers (3)
• This task was very time consuming.
• The analysis team of more than 12 spent 1/3rd of their time entering data.
• Process was in need of speeding up.
1970 1980 1990 2000
Before1969
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
The first computerised database• Based at Edinburgh Regional Computing Centre (ERCC)• Prior to this Principle Component Analysis had been
carried out using a DEUCE electronic computer• Data processed using a KDF 9 (32Kb)
1969
1970 1980 1990 2000
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
Card Reader
Operator's Console Tape Drives
Pictures from www.cs.bham.ac.uk/.../history/ photogallery/pages/1968-3.htm
1969
1970 1980 1990 2000
KDF 9
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
The first computerised database• Based at Edinburgh Regional Computing Centre
(ERCC)
• Prior to this Principle Component Analysis had been carried out using a DEUCE electronic computer
• Data processed using a KDF 9 (32Kb)
• ALGOrithmic Language (ALGOL)
1969
1970 1980 1990 2000
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
External Changes
• ALGOL no longer supported by the ERCC
• Programs transfer to IMP and Fortran (IV)G
1969
1970 1980 1990 2000
1971
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
Relocation
• Move to Plymouth, UK
• PDP 11 (16-bit)
• Programs rewritten into BASIC
Image from: www.montagar.com/dfwcug/VMS_HTML/timeline/16-bit.htm
1970 1980 1990 2000
1969 1971 1977
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
PDP11
Source: http://www.telnet.hu/hamster/pdp-11/
1970 1980 1990 2000
1969 1971 1977
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
Hardware changes• CPR database was moved to a GEC 41/90
1970 1980 1990 2000
1969 1971 1977 1982
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
1970 1980 1990 2000
1969 1971 1977 1982
Disc Drive (70Mb)
Tape Reader
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
1970 1980 1990 2000
1969 1971 1977 1982 1989
Further Changes
Out with the old...
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
1970 1980 1990 2000
1969 1971 1977 1982 1989
IBM 93/70
..in with the new
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
Rapid Changes• The database moved home again
to RS 6000– This home was short lived due to
problems networking the RS 6000
• Now all data processing was programmed in Fortran 77
1970 1980 1990 2000
1969 1971 1977 1982 1989 1991
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
Development using Fortran
• The database was stored on a Solaris 2.4
• Processing was carried out using an IBM PC
1970 1980 1990 2000
1969 1971 1977 1982 1989 1992
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
ORACLE
• First relational database contain CPR data
• Due to cost of maintaining ORACLE system was dropped
1970 1980 1990 2000
1969 1971 1977 1982 1989 1993
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
ACCESS - Current system
• Relational database developed in ACCESS
• User friendly interface
1970 1980 1990 2000
1969 1971 1977 1982 1989 1991-93 1995
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
ACCESS - Current system
1970 1980 1990 2000
1969 1971 1977 1982 1989 1991-93 1995
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
1970 1980 1990 2000
1969 1971 1977 1982 1989 1991-93 1995
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
Further development• Sample processing transferred to
Microsoft Access
1970 1980 1990 2000
1969 1971 1977 1982 1989 1991-93 1995 2001
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
Consistency• Strategy to ensure all systems are stored on
current media.– Some programs can be lost because the media on
which that have been stored has become obsolete
• Ensure same algorithms are applied to data
1970 1980 1990 2000
1969 1971 1977 1982 1989 1991-93 1995 2001
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
Quality Control
• Computers are not always a good thing.
• Basic QC programme used to check data and advice given.
• Analysis sheets are checked by experienced analysts.
1970 1980 1990 2000
1969 1971 1977 1982 1989 1991-93 1995 2001
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
Data Policy
• SAHFOS operates an open access data policy
• Data requests made to the Data Manager.
• Policy statement available on the web site
– www.sahfos.org
1970 1980 1990 2000
1969 1971 1977 1982 1989 1991-93 1995 2001
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
CPR Plankton Atlas
1970 1980 1990 2000
2001
1973
1969 1971 1977 1982 1989 1991-93 1995 2001
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
DATA AVAILABLE• The CPR database contains information for
– 185902 samples (1946-2001)• 4.7 million miles towed
– 2198052 positive plankton entries.
• 90906078 plankton entries • CPR bibliography contains
– 775 references.
– http://www.sahfos.org/bibliography.htm
1970 1980 1990 2000
1969 1971 1977 1982 1989 1991-93 1995 2001
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
Web Development
1970 1980 1990 2000
1969 1971 1977 1982 1989 1991-93 1995 2001
http://www.sahfos.org/standard_areas.htm
S
ir A
liste
r H
ard
y F
ou
nd
atio
n f
or
Oce
an
Sci
en
ce
Future development
• Web based development– increase ease of access to the database
• Expansion to include further environmental variables.
• Plankton Atlas available on CD or the Web
• CPR Browser available on CD
1970 1980 1990 2000
1969 1971 1977 1982 1989 1991-93 1995 Future