beyond a data portal: a collaborative environment for the deep carbon science communities han wang,...

1
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma, and Peter Fox Rensselaer Polytechnic Institute 110 8th St., Troy, NY, 12180 United States Semantic Representation of Information and Data in DCO Knowledge Base Deep Carbon Observatory (DCO) is a decade-long scientific endeavor to understand carbon in the complex deep Earth system. Thousands of DCO scientists from institutions across the globe are organized into communities representing four domains of exploration: Extreme Physics and Chemistry, Reservoirs and Fluxes, Deep Energy, and Deep Life. Cross-community and cross- disciplinary collaboration is one of the most distinctive features in DCO's flexible research framework. In order to expedite research collaboration between DCO scientists and communities, the DCO Data Science team developed an infrastructure that integrates multiple open source software including Drupal, CKAN, VIVO, Handle, etc. This infrastructure not only serves as a registry and a repository of DCO resources such as datasets, documents, images, and videos, it also formulates a rapidly growing knowledge base that interconnects people, organizations, publications, activities, locations, and other entities of research interest in DCO communities to enable better browsing, searching, visualizing, and reporting. DCO Community Global community of ‘Carbon scientists’ contributing to the Deep Earth Computer (data legacy) comprising: •Global Earth Mineral Laboratory •Global Inventory of Deep Fluids •Global Volcano Gas Emissions •Global Census of Deep Microbial Life •State of High Pressure and Temperature Carbon and Related Materials •Global Inventory of Diamonds with Inclusions DCO Statistics: •Over 4,800 people across 575 organizations. •Over 2,100 publications. •Over 210 projects including field studies. •Over 1,600 research topics. •Over 130 datasets. •Over 590 research locations. DCO Knowledge Base Infrastructure Abstract DCO Resources (e.g. datasets, documents, images, videos, etc.) stored using CKAN Repository (data.deepcarbon. net) All resources receive a unique handle, a DCO-ID (dx.deepcarbon. net) Semantic representation of information stored and maintained in VIVO, a Knowledge Graph (info.deepcarbon .net) Data Information Knowledge Producers Consumers Context Presentation Organization Integration Conversation Creation Gathering Experience We take heterogeneous data and information from multiple different science domains across different organizations and organize them into a knowledge graph with over 550,000 triples. We take the raw data, have users augment that with additional information and context, link the data together, then present it to the user from the knowledge base. Reports, Visualization, Search and Browse all Using DCO Knowledge Base Group data deposit and reporting Listings of group content Group management and messaging Listings of group documents VIVO - represents academic research communities Every person, organization, or other data entity in VIVO has a unique identifier VIVO enables the discovery of research and scholarship across disciplines at one institution or across many Records are both human-readable and machine-readable VIVO Extension - we’ve extended (yes, ontologies) VIVO to the science network – datasets, instruments, sites, etc. Semantic representations and ontologies User profile page Community portal Faceted browser Community network map Report dashboard Glossary: CKAN – DCO data repository, https : //data.deepcarbon.net Drupal –DCO Community Portal, https: //deepcarbon.net Handle – Resolution service for unique and persistent DCO identifiers, https: //dx.deepcarbon.net VIVO – DCO Data Portal, https :/ /info.deepcarbon.net Data publicat ion Get this poster

Upload: jocelin-shields

Post on 27-Dec-2015

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,

Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science CommunitiesHan Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma, and Peter FoxRensselaer Polytechnic Institute 110 8th St., Troy, NY, 12180 United States

Semantic Representation of Information and Data in DCO Knowledge BaseDeep Carbon Observatory (DCO) is a decade-long scientific endeavor to understand carbon in the complex deep Earth system. Thousands of DCO scientists from institutions across the globe are organized into communities representing four domains of exploration: Extreme Physics and Chemistry, Reservoirs and Fluxes, Deep Energy, and Deep Life. Cross-community and cross-disciplinary collaboration is one of the most distinctive features in DCO's flexible research framework.

In order to expedite research collaboration between DCO scientists and communities, the DCO Data Science team developed an infrastructure that integrates multiple open source software including Drupal, CKAN, VIVO, Handle, etc. This infrastructure not only serves as a registry and a repository of DCO resources such as datasets, documents, images, and videos, it also formulates a rapidly growing knowledge base that interconnects people, organizations, publications, activities, locations, and other entities of research interest in DCO communities to enable better browsing, searching, visualizing, and reporting.

DCO CommunityGlobal community of ‘Carbon scientists’ contributing to theDeep Earth Computer (data legacy) comprising:• Global Earth Mineral Laboratory• Global Inventory of Deep Fluids• Global Volcano Gas Emissions• Global Census of Deep Microbial

Life• State of High Pressure and

Temperature Carbon and Related Materials

• Global Inventory of Diamonds with Inclusions

DCO Statistics:• Over 4,800 people across 575

organizations.• Over 2,100 publications.• Over 210 projects including field

studies.• Over 1,600 research topics.• Over 130 datasets.• Over 590 research locations.

DCO Knowledge Base Infrastructure

Abstract

DCO Resources (e.g. datasets, documents, images, videos, etc.) stored using CKAN Repository(data.deepcarbon.net)

All resources receive a unique handle, a DCO-ID(dx.deepcarbon.net)

Semantic representation of information stored and maintained in VIVO, a Knowledge Graph(info.deepcarbon.net)

Data Information Knowledge

Producers Consumers

Context

PresentationOrganization

IntegrationConversation

CreationGathering

Experience

We take heterogeneous data and information from multiple different science domains across different organizations and organize them into a knowledge graph with over 550,000 triples.

We take the raw data, have users augment that with additional information and context, link the data together, then present it to the user from the knowledge base.

Reports, Visualization, Search and Browse all Using DCO Knowledge Base

Group data deposit and

reporting

Listings of group content

Group management

and messaging

Listings of group

documents

VIVO - represents academic research communities• Every person, organization, or other data entity in

VIVO has a unique identifier• VIVO enables the discovery of research and

scholarship across disciplines at one institution or across many

• Records are both human-readable and machine-readable

• VIVO Extension - we’ve extended (yes, ontologies) VIVO to the science network – datasets, instruments, sites, etc.

Semantic representations and ontologies

User profile page

Community portal

Faceted browser

Community network map

Report dashboard

Glossary:CKAN – DCO data repository, https://data.deepcarbon.net Drupal –DCO Community Portal, https://deepcarbon.net Handle – Resolution service for unique and persistent DCO identifiers, https://dx.deepcarbon.netVIVO – DCO Data Portal, https://info.deepcarbon.net

Data publication

Get this poster