arctos a multi-institution, multi- collection museum database

Download Arctos A multi-institution, multi- collection museum database

If you can't read please download the document

Upload: edwin-reed

Post on 18-Jan-2018

219 views

Category:

Documents


0 download

DESCRIPTION

in partnership with which is a member of TeraGrid – A nationwide network of 11 supercomputing facilities U. S. National Science Foundation’s Office of Cyberinfrastructure which is sponsored by

TRANSCRIPT

Arctos A multi-institution, multi- collection museum database Major repositories using the Arctos database: (34 collections of specimens or observations, 1.3M records) in partnership with which is a member of TeraGrid A nationwide network of 11 supercomputing facilities U. S. National Science Foundations Office of Cyberinfrastructure which is sponsored by Arctos: A 15 year history MVZ: Hired Stan Blum to develop relational data model (following modeling by Assoc. Systematic Collections). MVZ: Hired John Wieczorek to implement model (desktop application) using Sybase and Versata. Partial implementation (e.g., no loans). UAM: John W. migrated mammal data to Oracle, set up Versata. UAM: Dusty McDonald replaced Versata with ColdFusion, implemented full model (first web-based instance, aka Arctos). MSB: 2003 Joined Arctos at UAM (first multi-hosting instance). MVZ and MCZ: Implemented separate instances of Arctos at Berkeley and Harvard (MVZ: first Postgres, then Oracle). MVZ: Moved hosting of data to Alaska (Virtual Private Database version). Arctos is an ongoing effort to integrate access to museum data, collection management tools, and external resources on the Web. Specimen Catalog label data (and more) Projects contribute and/or use specimens Accessions Loans, usage Publications cite specimens GenBank Federated portals Arctos BerkeleyMapper Media in TeraGrid The rest of Cyberspace Citations Arctos Philosphy Maximize access to high quality information about collections on the web. Integrate different kinds of information (specimen data, DNA sequences, media, etc.) into easily and visually retrievable records. Facilitate management of data by curators and collection managers. Collaboration (development, curatorial) is key to success. Show value of specimens for scientific research to the public through web resources. Breadth of Data in Arctos Fish, amphibians, reptiles, mammals, birds and bird eggs/nests, plants, arthropods, fossils, molluscs Specimens and observations Media (images, audio) Publications, fieldnotes Arctos constantly evolving to incorporate new kinds of data, e.g.,: Better representation of non-publication documents (fieldnotes, correspondence) Cultural collections (art, anthropology...) Nearly all that is known about an object (or observation) can be included in Arctos. Linking specimen records to archival documentation Access and Security Virtual Private Database (Row-Level Security) allows collections to maintain control over data while sharing certain nodes (taxonomy, agents, geography). Access to data is controlled by user privileges (collections, tables). Public can query without user account. More options if logged in. Business rules enforced at database (not application) level. Arctos is A versatile online collections management system Cataloged Items (ID, attributes, parts, etc.; batch uploading, downloading, editing; encumbrances) Localities & Collecting Events (mapping, media, history) Transactions (loans, accessions, borrows, permits;reminders) Usage (publications, projects, sponsors, GenBank) Curatorial (object tracking, parts, condition, relations, etc.) Determination history (identification, georef, attributes) Arctos is Modular and Pluggable Projects and Publications Media (local, external URI) External Services (GenBank, TACC, MorphBank, CalPhotos, Ubio, etc.; ) Collaborative DiGIR (ORNIS, HerpNET, MaNIS, VertNET, etc.) Arctos is Customizable A cataloged item can have any number of attributes, customized to collections. A generic solution.) Specific features, e.g., Attributes (A cataloged item can have any number of attributes, customized to collections. A generic solution.) Users (search, results, identifiers, download; save andsearches) Managers (users, headers, CSS, reports) Arctos is Open Open source code Built upon community needs Compartmentalized (choose what you want to share) Community input valued (user group, code issue list) Responsive development model (bugs generally fixed within hours, requests within days to weeks unless major new feature) community discussion drives development Arctos is Economical $15,300/year hosting (currently split by 3 institutions): secure environment dedicated web server and database server server and database administrators Oracle 11G offsite backups in Alaska, TACC, San Diego 1 full-time developer 1 part-time DBA Lots of input from users Arctos Multihosting Shared Code Shared Cost Shared Expertise Curatorial Technical Location not important (electrons are fast!) MCZ ? Goal Run Arctos as a single system from one or more mirrored locations, from which each participating institution can be represented to its users and operators as an exclusive instance through row-level security, but in which vocabulary, programming, and physical resources are shared. but in which vocabulary, programming, and physical resources are shared. (No technical issues to adding collections. Social issues in multihosting.) Governance and Operating Support Steering Committee and Advisory Group with representatives of each institution contributing to operational budget. Steering Committee (institutional Directors or designees): budget, proposals for funding, proposals to connect to external resources, addition of collections, etc. Advisory Group (curators): priorities for development - Chair of Advisory Group oversees activity of developer(s), reports to Steering Committee Operating costs shared among institutions, funded by outside grants - identify developments that require outside funding vs. in-house support Summary Cost-effective access to a deeply relational, highly expandable database capable of recording all aspects of specimen curation, running on Enterprise-caliber hardware and software, managed by professional IT staff. Community of experienced users provide input into data structure and new development. Burden of development and operation shared among institutions. All Arctos data are online. The public can query it, privileged users manage it through web browsers. Demonstrates and documents the scientific context of specimens (projects, documents, media, etc.)more than just label data.