ontology of citizen science @ siena 2016 11 24

16
Workshop on “crowdsourced information & citizen science: critical aspects and the future” November 25th, 2016 Luigi Ceccaroni ENERGIC IC1203 COST action Towards an ontology of citizen science The representation of crowdsourced information Luigi Ceccaroni (1000001 Labs) Siena, November 24 th , 2016

Upload: luigi-ceccaroni

Post on 13-Apr-2017

74 views

Category:

Science


2 download

TRANSCRIPT

Workshop on “crowdsourced information & citizen science: critical aspects and the future”

November 25th, 2016 – Luigi CeccaroniENERGIC IC1203 COST action

Towards an ontology of citizen science

The representationof crowdsourced information

Luigi Ceccaroni (1000001 Labs)

Siena, November 24th, 2016

Workshop on “crowdsourced information & citizen science: critical aspects and the future”

November 25th, 2016 – Luigi CeccaroniENERGIC IC1203 COST action

Index

• An ontology of citizen science

– Projects

– Tools

Workshop on “crowdsourced information & citizen science: critical aspects and the future”

November 25th, 2016 – Luigi CeccaroniENERGIC IC1203 COST action

An ontology of citizen science

Workshop on “crowdsourced information & citizen science: critical aspects and the future”

November 25th, 2016 – Luigi CeccaroniENERGIC IC1203 COST action

• Systems with no overall organizing rationale– Not incorporating any organizing principle

for data, information and knowledge

• Systems with inward organization– Incorporating an organizing principle (such

as standard-based metadata schema) to bolster categorization and processing capabilities

– However, imposing constraints on adopting organizations

Knowledge organization

Workshop on “crowdsourced information & citizen science: critical aspects and the future”

November 25th, 2016 – Luigi CeccaroniENERGIC IC1203 COST action

• Systems with outward organization– Based on standards that are already accepted or

in use

– Facilitating future interaction between diverse organizations by providing data to other participants in predictable and mutually agreed upon formats

– In some cases, based on the specifications of a single system (with inward organization) that became accepted over time

Knowledge organization

Workshop on “crowdsourced information & citizen science: critical aspects and the future”

November 25th, 2016 – Luigi CeccaroniENERGIC IC1203 COST action

• Influenced by the way that information is structured in the SciStarter database:

– US Federal Crowdsourcing and Citizen Science Catalog developed by the Wilson Center

– Atlas of Living Australia

• Data shared through a set of custom-designed APIs (at the most basic level)

Shared standards for project metadata

Workshop on “crowdsourced information & citizen science: critical aspects and the future”

November 25th, 2016 – Luigi CeccaroniENERGIC IC1203 COST action

• Readable by a computer

• Can enhance inter-organizational communication through a standard set of definitions based on a format like:

– RDF/XML OWL

– JavaScript object notation for Linked Data (JSON-LD, a method of encoding Linked Data using JSON)

Benefits of outward organization

Workshop on “crowdsourced information & citizen science: critical aspects and the future”

November 25th, 2016 – Luigi CeccaroniENERGIC IC1203 COST action

An ontology of project metadata

Workshop on “crowdsourced information & citizen science: critical aspects and the future”

November 25th, 2016 – Luigi CeccaroniENERGIC IC1203 COST action

• An international WG

CSA’s Data and metadata working group

Description ALA - BioCollect SciStarter PPSR-CORE (CitSci.org) The Federal Crowdsourcing and Citizen Science Catalog

Dublin Core GBIF (IPT) POD v1.1 CKAN API DCAT Schema.org OGC CobWeb ADIwg Data Type Multiplicity ISOInstance (Citclops)

Database Name Type Mandatory/Optional Database Name Type Required Database Name Type Required Type Required Database Name Database Name 19115/19110

Value TypeIDENTIFIERS, DESCRIPTORS & VERSIONS

Globally unique identifier (GUID) for the project; system generated project:projectId text M id integer always ProjectGUID GUID Y cartodb_id integer y collectionID alternateIdentifier identifier id dct:identifier Citclops textType of identifier indicating the remote repository

The short name of the project that led to the creation of the datasetCitclops text

The title of the project that led to the creation of the dataset project:name text M title string always ProjectName text Y project.title title title dct:title name gmd:name text 0 .. 1

Citizens' observatory for coast and ocean optical monitoring text

A persistent identifier of the dataset in an external repository activity:projectActivityId text M alternateIdentifierCitclops text

Type of identifier indicating the remote repository

The edition or version number of the submitted dataset additionalMetadata.hierarchyLevel gmd:edition text 0 .. 1

2015_09_30 textThe activity status of the project (This automatically updates based on serverDate relative to project start/end dates.) project:status enumeration M

Derived from date range

expired boolean always ProjectStatus text / categorical Y project_status string Y temporal dct:temporal temporal

ended

enumeration: pending, active/ongoing, ended/complete, undefined

How often the project information or dataset is updated maintenance.maintenanceUpdateFrequency (controlled vocabulary) & maintenance.description.para (free text)

accrualPeriodicity dct:accrualPeriodicity

Short text name or title of the project; title used to identify the submission project:name + activity:name text M project_name string y

Citclops textThe unique ID for the submission gmd:identifier text 0 .. n

CitclopsThe Datacite DOI minted for the submission citation@identifier

Instructions on how the dataset may be reused intellectualRights.para

ID/Name(s) of datasets related to this one

The name of the dataset for citation purposes project:organisationName + activity:name text M title gmd:title text 1 .. 1

CitclopsAlternative or other name given to the dataset title@xml:lang (titles in other languages)

EyeOnWaterFree text description of the aim, objectives or expected/intended outcomes of the project; description of what the project should accomplish

project:aim text M goal string[64] always intended_outcomes string y abstract gmd:abstract

Natural-waters optical monitoring textProject outcomes

Suggested Dataset Objective purpose

Free text description of the project project:description text M description string always ProjectDescription text Y project_description string y project.designDescription description notes dct:description description gmd:description text 0 .. 1

The Citclops project developed systems to retrieve and use data on natural-waters colour, transparency and fluorescence, using low-cost sensors and contextual information combined with citizen participation. text

Short description of what needs to be done by the participant project:task text M task string[64] always participation_tasks string y

To retrieve and use data on natural-waters colour, transparency and fluorescence, using low-cost sensors text

Catch-all for any project-specific data administrators want to make available ProjectMetadata text N additionalInfo

Citclops is supported by the EC-FP7 Programme, grant agreement nº 308469

International Standard Book Number (ISBN) bibliography.citation.identifier gmd:ISBN text 0 .. 1

International Standard Serial Number (ISSN) bibliography.citation.identifier gmd:ISSN text 0 .. 1

DATE FIELDS

The date the submission was published into the receiving system pubDate

The date and time that the project was created in the database project:dateCreated ISODate M date datetime always created_at string y issued dct:issued datePublished

The date and time that project metadata was last updated project:lastUpdated ISODate M updated datetime always ProjectDateLastUpdated ISO 8601 DateTime (UTC)

Y updated_at string y additionalMetadata.dateStamp modified dct:modified dateModified gmd:date CI_Date 1 .. n

2015-09-30 "YYYY-MM-DD"The date that the project is planned to commence. The date on which the project began or will begin. project:plannedStartDate ISODate M begin_date date optional ProjectStartYear ISO 8601 Year

(UTC)N start_date string y

2012-10-01 "YYYY-MM-DD"The date that the project is planned to end. Applicable for projects operating over a defined period of time. The date on which the project ended or will end.

project:plannedEndDate ISODate O end_date date optional ProjectEndDate ISO 8601 Date (UTC)

N

2015-09-30 "YYYY-MM-DD"Actual start date for project project:startDate ISODate M

2012-10-01 "YYYY-MM-DD"Actual end date for project project:endDate ISODate O

2015-09-30 "YYYY-MM-DD"The date that the activity/survey is planned to commence. activity:startDate ISODate M

The date that the activity/survey is planned to end. activity:endDate ISODate O

CONTACTS, OWNERS, SUBMITTERS & PARTICIPANTS

Primary project coordinator: first and last name(s) of person, or name of organization project:manager O project_owner_name string[64] optional ProjectCoordinator Person Object/Construct

N project_contact string y contact contactPoint ? fn maintainer dcat:contactPoint ? vcard:fn provider ? Person:name

Luigi Ceccaroni

text

Primary dataset contact: first and last name(s) of person project:manager O ProjectContactName Person Object / Construct

Y gov_contact string y personnel.individualName

Luigi Ceccaroni

text

Workshop on “crowdsourced information & citizen science: critical aspects and the future”

November 25th, 2016 – Luigi CeccaroniENERGIC IC1203 COST action

CSA’s Data and metadata working groupDescription ALA - BioCollect SciStarter

Database Name Type Mandatory/Optional

Database Name Type Required

IDENTIFIERS, DESCRIPTORS & VERSIONS

Globally unique identifier (GUID) for the project; system generated project:projectId text M id integer always

Type of identifier indicating the remote repository

The short name of the project that led to the creation of the dataset

The title of the project that led to the creation of the dataset project:name text M title string always

A persistent identifier of the dataset in an external repository activity:projectActivityId text M

Type of identifier indicating the remote repository

The edition or version number of the submitted dataset

The activity status of the project (This automatically updates based on serverDate relative to project start/end dates.)

project:status enumeration MDerived from date

range

expired boolean always

Workshop on “crowdsourced information & citizen science: critical aspects and the future”

November 25th, 2016 – Luigi CeccaroniENERGIC IC1203 COST action

CSA’s Data and metadata working group

ALA - BioCollect

SciStarter

PPSR-CORE (CitSci.org)

The Federal Crowdsourcing and

Citizen Science Catalog

Dublin Core

GBIF (IPT)

POD v1.1

CKAN API

DCAT

Schema.org

OGC

CobWeb

ADIwg

ISO 19115/19110

Workshop on “crowdsourced information & citizen science: critical aspects and the future”

November 25th, 2016 – Luigi CeccaroniENERGIC IC1203 COST action

• [http://citizenscience.org/2015/11/12/introducing-the-data-and-metadata-working-group/]

• Contact people:

– Anne Bowser (co-chair), Woodrow Wilson International Center for Scholars

– Peter Brenton (ACSA liaison), Atlas of Living Australia

– Luigi Ceccaroni (ECSA liaison), 1000001 Labs

CSA’s Data and metadata working group

Workshop on “crowdsourced information & citizen science: critical aspects and the future”

November 25th, 2016 – Luigi CeccaroniENERGIC IC1203 COST action

An ontology of tools

Workshop on “crowdsourced information & citizen science: critical aspects and the future”

November 25th, 2016 – Luigi CeccaroniENERGIC IC1203 COST action

SciStarter’s tools databaseDefinition Format Values Notes

What about

Tool/Device

Accessories?

Tool Name Name of specific tool

give usable

examples to

define the

name

Description

Image

Manufacturer Maker/ producer of tool

Model

Vendor Provider of tool

Domain Field of study

Medium Sample type

Equipment/Sensor E.g., sensor, etc.

Function

measure, model,

analyze, observe,

support data collection,

recording

Measures

Cost Price

Availability

Manufacturer - Build,

Buy, Borrow

Free text

fields

Time to build

and ship, effort

to obtain the

thing,

recommender

system

Accesssibility

Ease of use for different

populations

Portability

Size/Weight (shipping vs

final)

Size

Weight

Total weight of assembled

tool

Technical

requirements/Add-ons

Response Time

Ideal Conditions

Range of Error

How long to set up

Expertise needed to

operate/Instructions

Training Required?

Skills Needed?

Calibration Needed?

Ages appropriate

Detection capability

Response time

Active

Frequency of useHow often do you need to

check the tool, data upload

Definition

Workshop on “crowdsourced information & citizen science: critical aspects and the future”

November 25th, 2016 – Luigi CeccaroniENERGIC IC1203 COST action

• [http://scistarter.com/finder]

• Contact people:

– Darlene Cavalier, SciStarter, Arizona State University

– Anne Bowser, Woodrow Wilson International Center for Scholars

SciStarter’s tools database

Workshop on “crowdsourced information & citizen science: critical aspects and the future”

November 25th, 2016 – Luigi CeccaroniENERGIC IC1203 COST action

Towards an ontology of citizen science

Luigi Ceccaroni 1000001 Labs, Research lead

Citizen science COST action 15212, Interoperability WG chairECSA, Board of Directors

CSA, Data and metadata working group

[email protected]://www.1000001labs.org/