stories of “glocality"—nations in a global infrastructure

29
Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License Stories of “Glocality"—Nations in a Global Infrastructure Mark A. Parsons 0000-0002-7723-0950 Secretary General Weaving The Internet Of Data Amsterdam, The Netherlands 6 April 2016

Upload: research-data-alliance

Post on 14-Apr-2017

41 views

Category:

Data & Analytics


1 download

TRANSCRIPT

Page 1: Stories of “Glocality"—Nations in a Global Infrastructure

Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License

Stories of “Glocality"—Nations in a Global Infrastructure

Mark A. Parsons 0000-0002-7723-0950 Secretary General

Weaving The Internet Of Data Amsterdam, The Netherlands 6 April 2016

Page 2: Stories of “Glocality"—Nations in a Global Infrastructure

research infrastructure vs. e-infrastructure

Page 3: Stories of “Glocality"—Nations in a Global Infrastructure

research infrastructure vs. e-infrastructure

a false dichotomy

Page 4: Stories of “Glocality"—Nations in a Global Infrastructure

e-Infrastructure is research infrastructure.

Modern research infrastructure is (or at least requires) e-Infrastructure.

It’s about the data

Page 5: Stories of “Glocality"—Nations in a Global Infrastructure

Infrastructure is hard to conceive and describe because when it works, it’s transparent, ubiquitous, and embedded in our daily work.

Page 6: Stories of “Glocality"—Nations in a Global Infrastructure

Research Data Alliance

Vision Researchers and innovators openly share data across technologies, disciplines, and countries to address the grand challenges of society.

Mission RDA builds the social and technical bridges that enable open sharing of data.

Page 7: Stories of “Glocality"—Nations in a Global Infrastructure

Glocality—Bridging across scales

Glocalization “means the simultaneity—the co-presence—of both universalizing and and particularizing tendencies.”

— Roland Robertson

Glocalism is playing at multiple scales at once.

Page 8: Stories of “Glocality"—Nations in a Global Infrastructure

rd-alliance.org

SouthAmerica1%

NorthAmerica34%

Europe49%

Australasia4%

Asia9%

Africa3%

OrganizationalTypeMembers (Feb2016)

Press&Media 22Policy/FundingAgency 58LargeEnterprise 85ITConsultancy/Development 119SmallandMediumEnterprise 212Other 198Government/PublicServices 583Academia/Research 2447TOTAL 3724

TheRDACommunity:3700+membersfrom110countries

(February2016)

May-July Aug-Oct Nov-Jan Feb-Apr May-July Aug-Oct Nov-Jan Feb-Apr May-July Aug-Oct Nov-Jan Feb-Apr

392

9911274

16562048

24042636

28813126

34343698 3724

65+ Working and Interest Groups

Page 9: Stories of “Glocality"—Nations in a Global Infrastructure

RDA Organisational Members

RDA Affiliate Members

https://rd-alliance.org/organisation/rda-organisation-affiliate-members.html

RDA Organisational & Affiliate members

RepresenttheinterestsofRDA’sorganisationalmembersandensurethattheirinputandneedsplayaroleinguidingtheprogramsandactivitiesoftheRDA.

Page 10: Stories of “Glocality"—Nations in a Global Infrastructure

FranBerman,ResearchDataAlliance

“Create - Adopt - Use” (in 12-18 months)

Systems Interoperability

Adopted Policy

Sustainable Economics

Common Types, Standards, Metadata

TrafficImage:MikeGonzalez

Adopted Community Practice

Training, Education, Workforce

Page 11: Stories of “Glocality"—Nations in a Global Infrastructure

FranBerman,ResearchDataAlliance

RDA: Accelerate Data Sharing and Interoperability Across Cultures, Communities, Scales, Technologies

▪ Technicalpartsofthedataengine:▪ Datatyperegistriesreferencemodel▪ Wheatdatainteroperabilityframework

▪ Rulesoftheroad:▪ Commonagreementondatacitation▪ Commonpracticefordatarepositories▪ Principlesoflegalinteroperability

▪ Betterdrivers• Summerschoolsindatascienceandcloud

computinginthedevelopingworld(withCODATA)

• Activedatamanagementplandevelopmentandmonitoring

Policy and Practice

Systems Interoperability

Sustainable Economics

Common Types, Standards, Metadata

Training, Education, Workforce

Page 12: Stories of “Glocality"—Nations in a Global Infrastructure

‹#›An Area of Convergence and Agreement

Internet Domain

nodes with IP numbers

packages being exchanged

standardized protocols

Data Domain

objects with PID numbers

objects being exchanged

standardized protocols

Slide courtesy P. Wittenberg from L. Lannom from D. Clark

Page 13: Stories of “Glocality"—Nations in a Global Infrastructure

How to feed the world RDA Agriculture Interest Group

image courtesy National Farmers Union

Page 14: Stories of “Glocality"—Nations in a Global Infrastructure

The Wheat Data Interoperability WG

Active members: Alaux Michael (INRA, France), Aubin Sophie (INRA, France), Arnaud Elizabeth (Bioversity, France), Baumann Ute (Adelaide Uni, Australia), Buche Patrice (INRA, France), Cooper Laurel (Planteome, USA), Fulss Richard (CIMMYT, Mexico), Hologne Odile (INRA, France), Laporte Marie-Angélique (Bioversity, France), Larmand Pierre (IRD, France), Letellier Thomas (INRA, France), Lucas Hélène (INRA, France), Pommier Cyril (INRA, France), Protonotarios Vassilis (Agro-Know, Greece), Quesneville Hadi (INRA, France), Shrestha Rosemary (INRA, France), Subirats Imma (FAO of the United Nations, Italy), Aravind Venkatesan (IBC, France), Whan Alex (CSIRO, Australia) Co-chairs: Esther Dzalé Yeumo Kaboré (INRA, France), Richard Allan Fulss (CIMMYT, Mexico)

� Aims: contribute to the improvement of Wheat related data interoperability by � Building a common interoperability framework (metadata, data formats and vocabularies) � Providing guidelines for describing, representing and linking Wheat related data

Contributors

Sponsors

slide courtesy Esther Dzalé

Page 15: Stories of “Glocality"—Nations in a Global Infrastructure

� Guidelines (http://wheatis.org/DataStandards.php) � Data exchange formats

� Example: VCF (Variant Call Format) for sequence variation data, GFF3 for genome annotation data, etc.

� Data description best practices � Consistent use of ontologies, consistent use of external database cross references

� Data sharing best practices � Share data matrices along with relevant metadata (example: trait along with

method, units and scales or environmental ones) � Useful tools and use cases that highlight data formats and vocabularies issues

� A portal of wheat related ontologies and vocabularies (http://agroportal.lirmm.fr/ontologies?filter=WHEAT) � Allows the access to the ontologies and vocabularies through APIs.

� A prototype � Implementation of use cases of wheat data integration within the AgroLD

(Agronomic Linked Data) tool: http://volvestre.cirad.fr:8080/agrold/

The deliverables

slide courtesy Esther Dzalé

Page 16: Stories of “Glocality"—Nations in a Global Infrastructure

� For data managers, data providers � One stop shop for relevant information related to data management Æ

arise awareness, avoid duplicated efforts, foster adoption of common practices

� Facilitate the use of common data exchange formats Æ easy data sharing/submission to international repositories

� Foster a standardized description of datasets with consistent use of ontologies and metadata Æincrease the identification, the findability and the usability of the datasets

� For data scientists, bioinfomaticians � Facilitate the access, integration and analysis of data from various

sources � Access to data of higher quality

� For top management, researchers � Increase the chance to answer complex questions

Benefits for many target users

slide courtesy Esther Dzalé

Page 17: Stories of “Glocality"—Nations in a Global Infrastructure

Crisis of Confidence in Research Data Citation

Page 18: Stories of “Glocality"—Nations in a Global Infrastructure

JointDeclarationofDataCitationPrinciples(Overview)

TheNobleEight-FoldPathtoCitingData

1. Importance2. Creditandattribution3. Evidence4. UniqueIdentification5. Access6. Persistence7. Specificityandverifiability8. Interoperabilityandflexibility

Principlesaresupplementedwithaglossary,referencesandexampleshttp://force11.org/datacitation

Page 19: Stories of “Glocality"—Nations in a Global Infrastructure

‹#›Citing Dynamic Data

Data Citation: Data + Means-of-access

▪ Data à time-stamped & versioned (aka history)

Researcher creates working-set via some interface: ▪ Access à assign PID to QUERY, enhanced with − Time-stamping for re-execution against versioned DB − Re-writing for normalization, unique-sort, mapping to history − Hashing result-set: verifying identity/correctness

leading to landing page

S. Pröll, A. Rauber. Scalable Data Citation in Dynamic Large Databases: Model and Reference Implementation. In IEEE Intl. Conf. on Big Data 2013 (IEEE BigData2013), 2013http://www.ifs.tuwien.ac.at/~andi/publications/pdf/pro_ieeebigdata13.pdf

Page 20: Stories of “Glocality"—Nations in a Global Infrastructure

‹#›Output / Results

▪ 14 Recommendationsgrouped into 4 phases: - Preparing data and query store - Persistently identifying specific data

sets - Resolving PIDs - Upon modifications to the data

infrastructure ▪ 2-page flyer ▪ Technical Report to follow ▪ Reference implementations

(SQL, CSV, XML) ▪ Pilots

Page 21: Stories of “Glocality"—Nations in a Global Infrastructure

‹#›WG Pilots

▪ Pilots and implementations by ▪ LNEC: Critical Infrastructure Monitoring System ▪ Virtual Atomic and Molecular Data Centre ▪ NERC (UK Natural Environment Research Council

Data Centres) ▪ ARGO Buoy Network ▪ River Flow Dataset

▪ ESIP (Earth Science Information Partners) ▪ BCO-DMO

▪ DEXHELPP – Social Security Data ▪ ENVRIplus: Carbon Observation System ▪ Million Song Database, IR Benchmark DBs ▪ Several others under discussion…

Page 22: Stories of “Glocality"—Nations in a Global Infrastructure

More stories of interconnection and glocality in RDA

• Unplanned interconnection Data Registries, Data Publishing and OpenAire

• Two for the price of one in WUSTL Air Quality portal.

• Data types for carbon geology across continent (plus materials science, hydrology, more)

• EOSC?

Page 23: Stories of “Glocality"—Nations in a Global Infrastructure

Moving forward

• Future Directions—communication, engagement, and coordination

• Bridging the bridges—ever more interconnection, reference implementations, engagement with ESOC and national data services.

• Diversifying the funding portfolio

Page 24: Stories of “Glocality"—Nations in a Global Infrastructure

• The group of government and non-profit science funding organisations that support the data and science communities to participate in RDA activities:

• European Commission (DG CNCT)• US Government (NSF and NIST)• Australian Government• Japanese Government (JST)• UK Government (Jisc)• Sloan Foundation• MacArthur Foundation

• Allows agencies the opportunity to share policies and funding program plans that support data sharing across the globe, and thereby amplify their impact.

• Explore actual policy impacts.

• Related to but distinct from RDA. A parallel informal organisation.

RDA Funders Forum

Page 25: Stories of “Glocality"—Nations in a Global Infrastructure

The Yin and Yang of national and international support

Page 26: Stories of “Glocality"—Nations in a Global Infrastructure

• personnel—encourage your staff and grantees to participate in RDA

• projects and use cases

• make local concerns global concerns

• establish niches or areas of excellence

How the national can help the global (and thereby the national)

Page 27: Stories of “Glocality"—Nations in a Global Infrastructure

• networks and connections to organisations, countries, resources

• amplification of local effort

• access to diverse experts

• efficiency and cost savings

How the global can help the national (and thereby the global)

Page 28: Stories of “Glocality"—Nations in a Global Infrastructure

12-16 September 2016in

Denver, Colorado, USA