franz et al. scan - southwest collections of arthropods network: leveraging filtered push technology...

21
Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications Nico Franz 1 , Edward Gilbert 1 , Neil Cobb 2 & Paul Morris 3 1 School of Life Sciences, Arizona State University 2 Merriam-Powell Center, Northern Arizona University 3 Museum of Comparative Zoology, Harvard University TDWD 2013 Annual Conference, Florence, Italy Biodiversity Data Quality Issues, Methods and Tools October 29, 2013

Upload: taxonbytes

Post on 07-Jul-2015

632 views

Category:

Education


0 download

DESCRIPTION

SCAN - the Southwest Collections of Arthropods Network (http://symbiota1.acis.ufl.edu /scan/portal/) - is the first regional arthropod biodiversity data network that utilizes the Symbiota software platform (http://symbiota.org/tiki/tiki-index.php). Since its origin in 2012 SCAN has unified and newly created specimen-level occurrence records on-line pertaining to nearly 15 south-western United States arthropod collections; including more than 515,000 records that represent some 18,000 species. However, due to the disproportionately mismatched diversity versus taxonomic expertise for the region and focal taxa, at least one third of the specimens are not identified (authoritatively or otherwise) to the level of species, with concomitant limitations for derivative taxonomic or evolutionary/ecological research. The member collections are typically separated from each other geographically by distances that prohibit frequent interactions with regional or global experts, except in virtual realm. SCAN has therefore implemented a Filtered Push (FP) based service (http://wiki.filteredpush.org /wiki/) whose primary purpose is to connect high-quality imaged of yet insufficiently identified specimens with suitable experts who can provide identifications remotely. This is achieved through the FP-server system which both records these contributions externally and pushes them back into the source Symbiota platform for review, acceptance, or rejection by the respective collection/node leaders. SCAN is therefore primed to utilize FP at a large scale and with a well circumscribed focal purpose that is relevant to the specific needs of this collections network. We illustrate the SCN/FP workflow, underlying concepts and technology, and current state of implementation and usage. FP allows experts to gradually accumulate credit and "reputations" for their identification contributions, and thus represents a promising means to improve data quality through transparent and distributed expert involvement and attribution.

TRANSCRIPT

Page 1: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

Leveraging Filtered Push Technology to

Enhance Remote Taxonomic Identifications

Nico Franz1, Edward Gilbert1, Neil Cobb2 & Paul Morris3

1 School of Life Sciences, Arizona State University2 Merriam-Powell Center, Northern Arizona University3 Museum of Comparative Zoology, Harvard University

TDWD 2013 Annual Conference, Florence, Italy Biodiversity Data Quality – Issues, Methods and Tools

October 29, 2013

Page 2: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

Currently 9 Thematic Collection Networks with 130 participating institutions

Advancing Digitization of Biodiversity Collections (NSF ADBC Program) Digitize 1 billion specimens in 10 years

Page 3: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

UNMTAMU

ASU

DMNS

UCB

NMSU

CSU

TTU

NAU

UA

SCAN member collectionsAverage ~ 480 miles apart

Page 4: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

SCAN digitization objectives

• Digitize 1 million records for southwestern ground-dwelling arthropods

• Produce 16,000 high-resolution images of species; promote identifications

• Leverage an interactive identification & annotation workflow via Symbiota

Gerstaeckeria porosa (LeConte, 1876) – ASUHIC0017017 Crotanius trivittatus (Champion, 1908) – ASUHIC0012067

Page 5: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

SCAN ADBCGround-Dwelling

Arthropod Records

SCAN ADBCNon-Target Taxa

Records

SCAN ADBC Collections

SCAN Non-ADBC Broader Impact

Records

SCAN Broader Impact Collections

September, 2013: 510,262 records in SCAN

• 510,262 specimens in Symbiota• 300,984 (59%) georeferenced• 338,836 (66%) identified to species• 1,016 families• 8,056 genera• 17,538 species

Primary need:remote IDs

Page 6: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

Deployment diagram – Symbiota & Filtered Push interaction

Source: http://wiki.filteredpush.org/wiki/FP-Medium_deployment_for_SCAN

Filtere

dP

ush

3N

od

eh

ttp://fp

3.acis.u

fl.edu

/FPAn

no

tation

Pro

cessor-W

eb/

• New, remotely added identifications are grounded in the Annotation Ontology.

• FP team has developed Symbiota-integrated PHP Client Tools that record andpush new annotations to the external FP infrastructure where statistics are kept.

SCAN Symbiota Portalhttp://symbiota1.acis.ufl.edu/scan/portal/index.php

New FP Client Tools in Symbiota

Page 7: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

Current workings & look in SCAN

Page 8: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

Homepage – http://symbiota1.acis.ufl.edu/scan/portal/index.php

Images

Page 9: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

ID = Scarabaeidae

Image thumbnail gallery – some are insufficiently identified

ID = Epicaerus

Page 10: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

More information – occurrence records, images – is clicks away

Page 11: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

More information – occurrence records, images – is clicks away

Page 12: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

Experts can log in and view a taxon-tailored IDs Needed tab

This is the scarabin need of an ID

Page 13: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

Experts can log in and view a taxon-tailored IDs Needed tab

This is the scarabin need of an ID

Occurrence tab

Page 14: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

Experts can log in and view a taxon-tailored IDs Needed tab

This is the scarabin need of an ID

Occurrence tab

Images tab

Page 15: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

Adding a new identification in the Determination History tab

Scientific Name is linked to theSCAN Taxonomic Thesaurus.

Page 16: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

The fully integrated Symbiota tab for IDs is Filtered Push-enabled

• New = current ID• Image remapping• Submission to FP

Page 17: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

Confirmationin SCAN

Simultaneous ID recording internally (SCAN) and externally (FP)

Page 18: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

Confirmationin SCAN

Confirmationin FP3 node

Simultaneous ID recording internally (SCAN) and externally (FP)

Annotationslist view

AO translation

Page 19: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

Simultaneous ID recording internally (SCAN) and externally (FP)

Confirmationin FP3 node

Confirmationin SCAN

Annotationsdetail view

RDF / XMLtranslation

AO translation

Page 20: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

Future work – 1st production-level Symbiota / FP implementation

• Optimization of SCAN "IDs Needed" user interface – thumbnail view

• Roll-out to the SCAN expert community, creation of expert profiles in FP

• Expansion beyond SCAN members, diversified notification systems

"Curculionidae" ("Calles" sp.) – ASUHIC0031695

Page 21: Franz Et Al. SCAN - Southwest Collections of Arthropods Network: Leveraging Filtered Push Technology to Enhance Remote Taxonomic Identifications

• TDWG 2013 Symposium organizers – Antonio Mauro Saraiva

• James Hanken, Maureen Kelly & David Lowery – http://wiki.filteredpush.org/wiki/

• ASUHIC digitization team – Sangmi Lee, David Fleming, Soon Flynn, Andrew Jansen,Catherine Mercado, Joshua Persson, Sarah Shirota, Michael Shillingburg.

• NSF Award EF-1207107. "Digitization TCN: Collaborative Research: Southwest Collections of Arthropods

Network (SCAN): a Model for Collections Digitization to Promote Taxonomic and Ecological Research."

Acknowledgments

http://symbiota1.acis.ufl.edu/scan/portal/index.php http://taxonbytes.org

https://sols.asu.edu http://symbiota.org/tiki/tiki-index.php