beyond a data portal: a collaborative environment for the deep carbon science communities han wang,...
TRANSCRIPT
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science CommunitiesHan Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma, and Peter FoxRensselaer Polytechnic Institute 110 8th St., Troy, NY, 12180 United States
Semantic Representation of Information and Data in DCO Knowledge BaseDeep Carbon Observatory (DCO) is a decade-long scientific endeavor to understand carbon in the complex deep Earth system. Thousands of DCO scientists from institutions across the globe are organized into communities representing four domains of exploration: Extreme Physics and Chemistry, Reservoirs and Fluxes, Deep Energy, and Deep Life. Cross-community and cross-disciplinary collaboration is one of the most distinctive features in DCO's flexible research framework.
In order to expedite research collaboration between DCO scientists and communities, the DCO Data Science team developed an infrastructure that integrates multiple open source software including Drupal, CKAN, VIVO, Handle, etc. This infrastructure not only serves as a registry and a repository of DCO resources such as datasets, documents, images, and videos, it also formulates a rapidly growing knowledge base that interconnects people, organizations, publications, activities, locations, and other entities of research interest in DCO communities to enable better browsing, searching, visualizing, and reporting.
DCO CommunityGlobal community of ‘Carbon scientists’ contributing to theDeep Earth Computer (data legacy) comprising:• Global Earth Mineral Laboratory• Global Inventory of Deep Fluids• Global Volcano Gas Emissions• Global Census of Deep Microbial
Life• State of High Pressure and
Temperature Carbon and Related Materials
• Global Inventory of Diamonds with Inclusions
DCO Statistics:• Over 4,800 people across 575
organizations.• Over 2,100 publications.• Over 210 projects including field
studies.• Over 1,600 research topics.• Over 130 datasets.• Over 590 research locations.
DCO Knowledge Base Infrastructure
Abstract
DCO Resources (e.g. datasets, documents, images, videos, etc.) stored using CKAN Repository(data.deepcarbon.net)
All resources receive a unique handle, a DCO-ID(dx.deepcarbon.net)
Semantic representation of information stored and maintained in VIVO, a Knowledge Graph(info.deepcarbon.net)
Data Information Knowledge
Producers Consumers
Context
PresentationOrganization
IntegrationConversation
CreationGathering
Experience
We take heterogeneous data and information from multiple different science domains across different organizations and organize them into a knowledge graph with over 550,000 triples.
We take the raw data, have users augment that with additional information and context, link the data together, then present it to the user from the knowledge base.
Reports, Visualization, Search and Browse all Using DCO Knowledge Base
Group data deposit and
reporting
Listings of group content
Group management
and messaging
Listings of group
documents
VIVO - represents academic research communities• Every person, organization, or other data entity in
VIVO has a unique identifier• VIVO enables the discovery of research and
scholarship across disciplines at one institution or across many
• Records are both human-readable and machine-readable
• VIVO Extension - we’ve extended (yes, ontologies) VIVO to the science network – datasets, instruments, sites, etc.
Semantic representations and ontologies
User profile page
Community portal
Faceted browser
Community network map
Report dashboard
Glossary:CKAN – DCO data repository, https://data.deepcarbon.net Drupal –DCO Community Portal, https://deepcarbon.net Handle – Resolution service for unique and persistent DCO identifiers, https://dx.deepcarbon.netVIVO – DCO Data Portal, https://info.deepcarbon.net
Data publication
Get this poster