bigdataeurope - empowering communities with data technologies

21
BIG DATA EUROPE HTTP://WWW.BIG-DATA-EUROPE.EU/ Integrating Big Data, Software & Communities for Addressing Europe’s Societal Challenges European Data Economy Workshop, Focus: Data Value Chain & Big Data & Open Data 15 September 2015, University of Economics Vienna

Upload: semantic-web-company

Post on 13-Apr-2017

324 views

Category:

Data & Analytics


1 download

TRANSCRIPT

Page 1: BigDataEurope - Empowering Communities with Data Technologies

BIG DATA EUROPEHTTP://WWW.BIG-DATA-EUROPE.EU/Integrating Big Data, Software & Communities for Addressing Europe’s Societal Challenges

                    

European Data Economy Workshop, Focus: Data Value Chain & Big Data & Open Data15 September 2015, University of Economics Vienna

Page 2: BigDataEurope - Empowering Communities with Data Technologies

Semantic Web Company (SWC)

SWC was founded 2001, head-quartered in Vienna

25 experts in linked data technologies

Product: PoolParty Semantic Suite (launched 2009)

Serving customers from all over the world

EU- & US-based consulting services

Page 3: BigDataEurope - Empowering Communities with Data Technologies

Semantic Web Company (SWC)

Some of our Customers● Credit Suisse● Boehringer Ingelheim● Roche● Wolters Kluwer● BMJ Publishing Group● Red Bull Media House● Canadian Broadcasting Corporation

(CBC)● Pearson● Council of the EU● DG Environment, EC● Healthdirect Australia● Ministry of Finance (Austria)● World Bank Group● Inter-American Development Bank

(IADB)● International Atomic Energy Agency

(IAEA)● Buildings Performance Institute

Europe (BPIE)● Renewable Energy & Energy Efficiency

P (REEEP)● Global Buildings Performance Network

(GBPN)● American Physical Society● Education Services Australia (ESA)● Norwegian Directorate of Immigration● Australian National Data Service

Finance / Automotive / Publisher / Health Care / Public Administration / Energy / Education

Selected Partners● EBCONT● EPAM Systems● iQuest● PwC● Tenforce● OpenLink Software● Ontotext● MarkLogic● Gravity Zero● Altotech● Wolters Kluwer● Taxonomy Strategies● Digirati● Fraunhofer (IAIS)● University of Leipzig

(INFAI)● The Open Data Instizute

(ODI)

We all have one goal in mind: Make machines smart enough so that they can help us to find those needles in the haystack, which are really relevant to us.

Page 4: BigDataEurope - Empowering Communities with Data Technologies

The Motivation – Big Data

Every day, we create 2.5 quintillion bytes of data — so much that 90% of the data in the world today has been created in the last two years alone.

This data comes from everywhere: sensors used to gather climate information, posts to social media sites, digital pictures and videos, purchase transaction records, and cell phone GPS signals to name a few.

This data is big data. Source: IBM

Page 5: BigDataEurope - Empowering Communities with Data Technologies

The Motivation – Big Data

BIG DATA

Open DataLinked

DataLinked Open Data

Data Repositories

DatabasesData

LibrariesCatalogues

Social Media

Page 6: BigDataEurope - Empowering Communities with Data Technologies

Big Data DimensionsVolume

Velocity

Variety

10001010101010101010101001010010101010101001010010101001010010101001010010100101010010101010010101000101010101001010101010101010100101010101010100101010111000101010101010101010100101001010101010100101001010100101001010100101001010010101001010101001010100010101010100101010101

10001010101010101010101001010010101010101001010010

…….………….……………..……..……………

1 0

1000101010101010101010100101001010101010100101oo11

Veracity!

Page 8: BigDataEurope - Empowering Communities with Data Technologies

Big Data in Europe: Challenges, Opportunities

HealthClimateEnergyTransportFoodSocietiesSecurity

Loremipsumdolors

KSDJOPSCKKSDKA

B

LKASJLLAWWD

S

wpweppepwpisio

we

10101001101010010101

0

Regional Data Repositories 10101

001101010010101

0

10101001101010010101

0

#2: Interlink, Centralise Access, Explore101010100101010101001011010001010101010010101010100001011010001010101010010101010100100101010100101010101001011010

001010

Data Eleme

nt

related relate

d

#3: Analyse, Discover, Visualize#4: Mashup, Cross-domain Exploitation

JournalistsCitizens Industry

Authorities

Page 9: BigDataEurope - Empowering Communities with Data Technologies

Big Data in Europe: Obstacles

2 mai 2023

#1 Big Data “Variety“ problem Multiple Data Sources Required: Integration, Harmonisation

#2 Opening-up Data concerns Loss of control, lack of tracking Reservations about large corporations

#3 Limited Skills, Training, Technology

Lack of Data Scientists Lack of Generic Architectures, components

Page 10: BigDataEurope - Empowering Communities with Data Technologies

Big Data in Europe: Obstacles

2 mai 2023

Extraction, Curation Quality, Linking, Integration

Publication, Visualization, Analysis

Extraction, Curation, Quality, Linking, Integration, Publication,

Visualization, Analysis

HealthTransport

Security

Extraction Curation Quality Linking Integration Publication Visualization Analysis

Data Repositories Linked Open Data Cloud

Stage 1

Stage 2

Stage 3

Food SocietiesClimate Energy

Page 11: BigDataEurope - Empowering Communities with Data Technologies

BDE Partners

Page 12: BigDataEurope - Empowering Communities with Data Technologies

Rationale Show societal value of Big Data Lower barrrier for using big data

technologieso Required effort and resourceso Limited data science skills

Help establishing cross-lingual/organizational/domain Data Value Chains 2 mai 2023

Page 13: BigDataEurope - Empowering Communities with Data Technologies

RationaleCOORDINATION

Stakeholder Engagement (Requirements

Elicitation)

SUPPORTDesign, Realise, Evaluate

Big Data Aggregator Platform

Create and Manage Societal Big Data Interest

Groups

Cloud-deployment ready Big Data Aggregator

Platform

CSA Measures

Results

Page 14: BigDataEurope - Empowering Communities with Data Technologies

SummaryTwo clearly defined coordination and support measures: Coordination: Engaging with a diverse range of stakeholder groups

representing particularly the Horizon 2020 societal challenges Health, Food & Agriculture, Energy, Transport, Climate, Social Sciences and Security; Collecting requirements for the ICT infrastructure needed by data-intensive science practitioners tackling a wide range of societal challenges; covering all aspects of publishing and consuming semantically interoperable, large-scale data and knowledge assets;

Support: Designing, realizing and evaluating a Big Data Aggregator platform infrastructure that meets requirements, minimises disruption to current workflows, and maximises the opportunities to take advantage of the latest European RTD developments (incl. multilingual data harvesting, data analytics & visualisation).

BigDataEurope will implement and apply two main instruments to successfully realize these measures: Build Societal Big Data Interest/Community Groups in the W3C interest group

scheme & involving a large number of stakeholders from the Horizon 2020 societal challenges as well as technical Big Data experts;

Design, integrate and deploy a cloud-deployment-ready Big Data aggregator platform comprising key open-source Big Data technologies for real-time and batch processing, such as Hadoop, Cassandra and Storm.

Page 15: BigDataEurope - Empowering Communities with Data Technologies

Orthogonal Dimensions of Big Data Ecosystems

Generic Big Data Enabling Technologies

Data Value Chain

Data Generation & Acquisition

Data Analysis & Processing

Data Storage & Curation

Data Visualization &

Usage

Data-driven Services

Socie

tal C

halle

nges

Dom

ain

Spec

ific D

ata

Asse

ts

& T

echn

olog

y

Healthcare

Food Security

Energy

Intelligent Transport

Climate & Environment

Inclusive & Reflective Societies

Secure Societies

Page 16: BigDataEurope - Empowering Communities with Data Technologies

BDE Stakeholder Engagement Approach & Activities

BDE Community Tools – JOIN IN NOW !

• Website: news, events, community, …• 7 x BDE W3C Community Groups• 7+1x Mailing Lists• 7 x SC Workshops/Year = 21 Workshops• Full set of communication tool-set…Future Outlook• BDE Aggregator Platform

• For download / internal use• Cloud Version

• Big Data Technology Support Tools

Page 17: BigDataEurope - Empowering Communities with Data Technologies

Domains, Focus Areas & Data Assets

Societal Domain Preliminary Big Data Focus area Selected Key Data assets

Life Sciences & Health

Heterogeneous data Linking & integration

Biomedical Semantic Indexing & QA

ACD Labs / ChemSpider, ChEBI, ChEMBL, Con-ceptWiki, DrugBank, EN-ZYME, Gene Ontology, GO Annotation, Swis-sProt, UniProt, Wik-iPathways, PubMed, MeSH, Disease Ontology (DO), Joint Chemical

Dic-tionary (Jochem), Bio-ASQ datasets Food &

AgricultureLarge-scale distributed data

integrationINFOODS, AQUASTAT Green Learning Network (GLN), Agricultural

Bibliography Network (ABN), AGRIS, AquaMaps, Fishbase

EnergyReal-time monitoring, stream

processing, data analytics, and decision support

European Energy Exchange Data, smart meter measurement data, gas/fuels/energy market/price data, consumption statistics,

equipment condition monitoring data)

Transport Streaming sensor network & geo-spatial data integration

GTFS data, OSM/ LinkedGeoData, MobilityMaps, Transport sensor data, ROSATTE Road safety attributes, European Road Data

Infrastructure - EuroRoadS

Climate Real-time monitoring, stream processing, and data analytics.

European Grid Infrastructure (EGI), Databases hosting atmospheric data. Several software frameworks for simulation, calibration and

reconstruction.

Social Sciences Statistical and research data linking & integration

Federated social sciences data catalogs, statistical data from public data portals and statistical offices (e.g. EuroStats, UNESCO,

WorldBank)

SecurityReal-time monitoring, stream

processing, and data analytics.Image data analysis

Earth Observation data (e.g. Very High Resolution Satellite Imagery acquired from commercial providers and governmental systems)

and collateral data for supporting CFSP/CSDP missions and operations, Databases hosting atmospheric Data. Experimental and

simulation data concerning dispersion of hazardous substances

Page 18: BigDataEurope - Empowering Communities with Data Technologies

Work Packages & Implementation Phases

Community Building

M1-M12 M13-M24 M25-M36

Enabling Technologies

Component Integration

Uptake

Integrator Deployment

Community Assessment

WP3 – Big Data Generic Enabling Technologies & Architecture

WP5 – Big Data Integrator Instances

WP7 – Dissemination & Communication

WP2 – Community Building & Requirements

WP4 – Big Data Integrator Platform

WP6 – Real-life Deployment & User Evaluation

Page 19: BigDataEurope - Empowering Communities with Data Technologies

Blueprint of the Data Aggregator Platform

Batch Layer

Speed Layer

Data Storage

Real-time data &

Transactions …

Batch View

Real-time View

mes

sage

pas

sing

message passing

Applications & ShowcasesReal-time dashboardsDomain-specific BDE apps

Big Data AnalyticsIn-stream Mining

BDE Platform &

Intelligence

Input dataStreamSpatialSocialStatistical TemporalTransactionalImagery

+ Semantic Layer (Retaining Semantics using LD approach )

Lambda Architecture

Page 20: BigDataEurope - Empowering Communities with Data Technologies

Announcements….

Workshop SC2 (Agriculture & Food): 22.9.2015, Paris, INFOSWorkshop SC7 (Secure Societies): 30.9.2015, Brussels, INFOWorkshop SC4 (Transport): .10.2015, Bordeaux, INFOWorkshop SC6 (Social Science): 18.11.2015, Luxembourg, INFO