user experience with knime for large industrial data sets ... · 3/16/2017  · text processing...

29
User Experience with KNIME for Large Industrial Data Sets and Applications Bernhard Lang at KNIME Summit in Berlin, March 2017 Siemens Corporate Technology

Upload: others

Post on 24-Jul-2020

11 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

User Experience with KNIMEfor Large Industrial Data Setsand ApplicationsBernhard Lang at KNIME Summit in Berlin, March 2017

Siemens Corporate Technology

Page 2: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 2 Berrnhard Lang - Corporate Technology

Outline – User Experience with KNIME ..

1 Data analytics applications for selected branches and domains

2 From data analytics applications to solution packages in KNIME

3 Application 1 of analytics framework: commodity price forecasting

4 Application 2 of analytics framework: semantic analytics

5 Key Takeaways

Page 3: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 3 Bernhard Lang - Corporate Technology

Siemens’ installed base of products and solution at customer sites –Tremendous amount of data that can be leveraged for new applications

The amount of data produced by Siemens products in one day

>80gigabytesper daySiemens controllersin particleaccelerator CERN

>50gigabytesper daySiemenscomputertomograph

>20gigabytesper daySiemens gasturbine

>10gigabytesper daySiemensEnergyIP smartgrid platform

>1terabytesper daySiemens trafficmanagementsystem(one city)

>160gigabytesper daySiemens windturbines

>30gigabytesper day17.000Siemens trainunits

Source: own rough estimations 2016

Page 4: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 4 Berrnhard Lang - Corporate Technology

20 years of experience in industrial data analytics applications:Selected examples for data-driven value generation

Siemens factory in Amberg

Energy Analytics

CERN Large Hadron Collider > 20 high-speed trains at Renfe Spain

Predictive Maintenance

Power plants

Condition Monitoring & Wear Prediction

Smart Grid, Seestadt Aspern, Vienna

Asset Performance Management

Deployed in > 30 steel plants

> 20 years online learning neural networks

Root Cause Analysis & Failure Prediction

Page 5: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 5 Berrnhard Lang - Corporate Technology

Outline – User Experience with KNIME ..

1 Data analytics applications for selected branches and domains

2 From data analytics applications to solution packages in KNIME

3 Application 1 of analytics framework: commodity price forecasting

4 Application 2 of analytics framework: semantic analytics

5 Key Takeaways

Page 6: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 6 Berrnhard Lang - Corporate Technology

Data Analytics Solution Packages – Let’s extract multi-purpose analyticsmodules (software) out of successful analytics applications in products

Center of competence – PortfolioSiemens Data-Driven Services require smart data analytics tools

Solution Packages for Smart Data Analytics

Purpose:

Reusable data analytics elements that can be applied for different branches and divisionsUsing open source KNIME as analytics integration environment

Benefits:

•Time to market•Cost efficiency•Standardization

•Key learnings acrossSiemens’ divisionsand branches

DescriptiveAnalytics

What happened?

DiagnosticAnalytics

Why did it happen?

PredictiveAnalytics

What will happen?

PrescriptiveAnalytics

What shall I do?

VisualAnalytics

Service Intelligence

Enterprise Search Condition Monitoring PredictiveMaintenance

Forecasting Services

Operation Planning

Diagnostic Advice Product Configuration

Autonomous Learning

Page 7: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 7 Berrnhard Lang - Corporate Technology

KNIME as a Data Analytics Workflow Editor –Integration of third party analytics tools (also Siemens’ solution packages)

Hadoop Nodes

Nodes that interact with Hadoop. The actualanalytic code is written in R and executes asMap Reduce (approx. 25 nodes alreadyavailable)

Python/C++/Java are also supported

K-means clustering

Knime workflow that executes K-Means on data residing inHadoop using Map Reduce

K-means clustering

Knime workflow that executes K-Means on data residing inTeradata using FuzzyLogix

More info about the KNIMEAnalytics Platform:

http://www.knime.org/

Page 8: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 8 Berrnhard Lang - Corporate Technology

Outline – User Experience with KNIME ..

1 Data analytics applications for selected branches and domains

2 From data analytics applications to solution packages in KNIME

3 Application 1 of analytics framework: commodity price forecasting

4 Application 2 of analytics framework: semantic analytics

5 Key Takeaways

Page 9: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 9 Berrnhard Lang - Corporate Technology

83%

74% 73% 71%67%

57%

28%

20%

30%

40%

50%

60%

70%

80%

90%

Solution Package Forecasting: Commodity Prices with Neural Networks

54'

55'15'44'

657'

[Mio. EUR]

Purchasing Volume of Exchange Traded CommoditiesFiscal Year 2014: 825’ EUR

Copper Aluminum

*Others: Tin, Zinc, Lead, Molybdenum, Gold, Platinum, Palladium

NickelOthers

Si lver

§ LME Copper: 79% of total purchasing volume§ How to cope with the increased market volatility?§ Optimal procurement decisions on the basis of accurate market

forecasts and research§ Benchmarking of different vendors for market intelligence§ Siemens internal forecast is based on Neural Networks forecast

and selected other sources§ Extension to other metals and energy

1000

2000

3000

4000

5000

6000

7000

8000

9000

10000

2000Q

1

2000Q

3

2001Q

1

2001Q

3

2002Q

1

2002Q

3

2003Q

1

2003Q

3

2004Q

1

2004Q

3

2005Q

1

2005Q

3

2006Q

1

2006Q

3

2007Q

1

2007Q

3

2008Q

1

2008Q

3

2009Q

1

2009Q

3

2010Q

1

2010Q

3

2011Q

1

2011Q

3

2012Q

1

2012Q

3

2013Q

1

2013Q

3

2014Q

1

2014Q

3

2015Q

1

LME Copper Market(Price development from 2000-2015)

USD

perM

T

Time

Market research sourcesPr

ice

Tren

dH

itR

ate

in%

6 Months Forecast of Copper Market(Evaluation from 2000-2015)

Siemensinternalforecast

NeuralNetwork(SENN)

Leading analyst and consulting bureaus providing fundamentalresearch of the copper market price development

part of

Page 10: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 10 Berrnhard Lang - Corporate Technology

§ Our neural networks model coherent markets as interacting dynamical systems§ Given the subset of the observables we reconstruct the hidden variables§ Different market structures and multiple time scale dynamics can be addressed§ From open to closed dynamical systems: The model is dynamical consistent,

symmetric in all variables and present time does not play any special role

3-ts A2-ts A

1-ts Ats

ty

A1+ts

1+ty

A2+ts

2+ty1-ty3-ty 2-ty

]0,[Id]0,[Id]0,[Id]0,[Id]0,[Id

4ty -

A]0,[Id

)tanh(1 tt ss =+ tt sy =A ]0,[Id

FX

Stocks

Metals

Energy

Bonds

hiddenvariables

4ts -

Market Price Predictions with Large Neural Networks

Zimmermann; Tietz; Grothmann: Forecasting with Recurrent Neural Networks, In: Neural Networks: Tricks of the Trade, 2nd ed.; Springer, 2012

Page 11: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 11 Berrnhard Lang - Corporate Technology

TopicRadar: A Market Mood Indicator for Commodity Price Forecasting

§ Extract named entities or events

§ query entities,

§ company names,

§ price trend indicators

Named entity and event recognition

§ Detect, locate & display relation between queryentities and events

Relation extraction

Enrich and analyze internal andexternal unstructured data

Price trend indicator

Available as structured data RelationsTime line

Page 12: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 12 Berrnhard Lang - Corporate Technology

Modular Framework of Data Analytics Solution Packages applied forCommodity Price Forecasting

Task: Forecasting of market trends supports hedging strategies and optimal procurement

Business Impact: - Identification of potential market scenarios and estimation of related market price risks- Optimal procurement decisions and decision support for competitive advantages- Forecasts are provided as a service. Performance based contracting

Page 13: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 13 Berrnhard Lang - Corporate Technology

Outline – User Experience with KNIME ..

1 Data analytics applications for selected branches and domains

2 From data analytics applications to solution packages in KNIME

3 Application 1 of analytics framework: commodity price forecasting

4 Application 2 of analytics framework: semantic analytics

5 Key Takeaways

Page 14: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Semantic Analytics.The Why and the How.Dr.-Ing. Sebastian Brandt

Siemens Corporate TechnologyUnrestricted @ Siemens AG 2017

Page 15: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 15 Bernhard Lang - Corporate Technology

The AI renaissance:the science and engineering of making intelligent machines

Memory(knowledge representation)

Text Processing

Speech recognition

Image Processing

Sensor Processing Reasoningà Draw conclusions

Learningàadapt & improve

Decision making(also in uncertainty)

Perception Cognition Decision

Creativityàgeneratehypotheses

ActionEnvironment

AB

C

Semantic knowledge fusion and reasoningfor integrated diagnosticsAutomated planning of maintenanceservice and activities

Deep learning for object recognition andlabeling in service reports

BC

Remote diagnosticcenters

Global datacenters

• Monitoring• Root cause analysis• Predictive maintenance• Reports• Global service products

Example Device Service Automation

A

Page 16: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 16 Bernhard Lang - Corporate Technology

Connecting industrial knowledge (sources)

Relational Learning (e.g. via Tensor Factorization)

Pattern Sequence Mining (e.g. via PrefixSpan)

A

ROOT

ABAA

AAA AAB AAC ABA ABB

Not frequent

Not frequent

Build in-depth prefixes

Data Sources Industrial Knowledge Graph

R&Ddata

Engineeringdata

Plantdata

Servicedata

Monitoringdata

Static aspects

Dynamic aspects

Examples for automated graph construction

Information extraction(e.g. Natural Language Processing)

Tresp, Nickel. Tensor Factorization for Multi-Relational Learning, ECML, 2013

Knowledge fusion into one coherent semantic model

B

Page 17: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 17 Bernhard Lang - Corporate Technology

Enable intuitive end-user access to industrial data

“Industrial Knowledge Graph”

R&Ddata

Engineeringdata

Plantdata

Servicedata

Monitoringdata

Static aspects

Dynamic aspects

Redesign necessary(Copy from internet)

Industrial Knowledge GraphData Sources

Query

Analytics

Normalstart?

Ontology-based Data Access

B

Page 18: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 18 Bernhard Lang - Corporate Technology

Gas turbine crash course

Rotor

Compressor Burners Power turbine

Burner

Air

Gas

Exhaust

38 MW

1 000 000 light bulbs

500 cars :-)

Page 19: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 19 Bernhard Lang - Corporate Technology

Aircraft engine(subsonic)

Aircraft engine crash course

Gas turbine

Aircraft engine(subsonic)

Page 20: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 20 Bernhard Lang - Corporate Technology

Turbine data: time series + events

Signal data• Tag• Timestamp• Number

Event messages• Category• Timestamp• Text

BSX-TC3562-XE01, 12:34:56,21.09.2015; 560

BSX-TMP12A-XE01, 12:34:57,21.09.2015; 0

BSX-TICCFB1-XE01, 12:34:55,21.09.2015; 1

BSX-TIFB2-XE02, 12:34:56,21.09.2015; 9564

Status, 12:34:56,21.09.2015, “Ignitor on”

Warning, 12:38:56,21.09.2015, “Overspeed”

up to 3000 sensors per turbine

B

Page 21: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 21 Bernhard Lang - Corporate Technology

Semantic knowledge fusion and reasoning for integrated diagnostics

BSX-TC3562-XE01

BSX-TMP12A-XE01

BSX-TICCFB1-XE01

MS-XC255-X12

BSX-TC3562-XE01

BSX-TC3562-XE01

MRR-T8901-8462

CRR-M8393-9272

“Ignitor on”

DC1, DB X1, TY2

DC2, DB X2, TY2

DC2, DB X2, TY2

DC2, DB X2, TY4

Query1

Query2

Query3

Normalstart ?

Normalstart ?

Normalstart ?

hypothetical identifiers

hypothetical identifiers

hypothetical identifiers

B

Page 22: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 22 Bernhard Lang - Corporate Technology

Query

Abstraction enables uniform solutions

BSX-TC3562-XE01

BSX-TMP12A-XE01

BSX-TICCFB1-XE01

MS-XC255-X12

BSX-TC3562-XE01

BSX-TC3562-XE01

MRR-T8901-8462

CRR-M8393-9272

“Ignitor on”

Analytics

Normalstart?Sensor types,

turbinestructure,

site con-figurations,

measurable

quantities,

Processes

Dom

ain

onto

logy

Sem

antic

map

ping

* http://optique-project.eu/

B

Page 23: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 23 Bernhard Lang - Corporate Technology

Query

Remote monitoring example:Abstraction enables uniform solutions (Optique)

Analytics

Normalstart?

BSX-TC3562-XE01

BSX-TMP12A-XE01

BSX-TICCFB1-XE01

MS-XC255-X12

BSX-TC3562-XE01

BSX-TC3562-XE01

MRR-T8901-8462

CRR-M8393-9272

“Ignitor on”

Abstraction enables uniform solutions

Sensor types,

turbinestructure,

site con-figurations,

measurable

quantities,

Processes

Dom

ain

onto

logy

Sem

antic

map

ping

* http://optique-project.eu/

B

Page 24: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 24 Bernhard Lang - Corporate Technology

Remote monitoring example:Semantics in the wider ecosystem

* http://optique-project.eu/

Images + Text

Inspections Observation sheet(hypothetical)

Digipen

Free-text Docs

NLPOCR Indexing

NLP &Deep

LearningIndexing

Query

Analytics

Normalstart?Sensor types,

turbinestructure,

site con-figurations,

measurable

quantities,

Processes

Dom

ain

onto

logy

Sem

antic

map

ping

Abstraction enables uniform solutions

B

Page 25: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 25 Bernhard Lang - Corporate Technology

Data Access & (Virtual) Integration

Controlledvocabulary

Thesaurus

Terminology /Taxonomy

Ontology

Behind theknowledgegraph…

Semantic mappings (R2RML)

Analytics servicesDashboards Query

R&Ddata

Engineeringdata

Plantdata

Servicedata

Monitoringdata

Dynamic aspects Web data Data streams

• connect knowledge graph to conventional (Big-)data sources• utilises Knowledge Graph vocabulary• supports relational database• supports Big-Data infrastructures• supports web-services, APIs, document repositories• supports federation• supports data streams• botstrapping

Static aspects

?{plantName} : sie:Plant andsie:hasCustomer ?{customerName}

select Plants.plantName, Customers.customerNamefrom Plants, Customers, Plant2CustMapwhere Plants.plantId=Plant2CustMap.plantIdand Customers.customerId=Plant2CustMap.customerId

• SPARQL• based on Knowledge Graph• no joins, columns, tables, etc.

JOIN

• lightweight: Semantic Media Wikis• proprietary: via Web Services,ETL, ….

• cross-fleet, cross-platformanalytics simplified

• ensure soundness of combined services

B

Page 26: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 26 Bernhard Lang - Corporate Technology

Remote monitoring example:Combining semantics and analytics in KNIME

Automatedreporting

Monitoring,predictive

maintenance, …

Generic KNIMEanalytics workflows

Model-based accessas KNIME building block

Page 27: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 27 Berrnhard Lang - Corporate Technology

Outline – User Experience with KNIME ..

1 Data analytics applications for selected branches and domains

2 From data analytics applications to solution packages in KNIME

3 Application 1 of analytics framework: commodity price forecasting

4 Application 2 of analytics framework: semantic analytics

5 Key Takeaways

Page 28: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 28 Berrnhard Lang - Corporate Technology

Key Takeaways

Challenge of big data is not so much the sizeor the right tool but the inconsisteny of data.

Big data analytics without domain know-howand product/context know-how often fails.

For the last years KNIME is developing fasttowards the needs of productive use J.

Page 29: User Experience with KNIME for Large Industrial Data Sets ... · 3/16/2017  · Text Processing Speech recognition Image Processing Sensor Processing Reasoning àDraw conclusions

Unrestricted © Siemens AG 2017March 2017Page 29 Berrnhard Lang - Corporate Technology

Contact Information

Bernhard LangSiemens AGCorporate Technology

Project ManagerCTI Smart Data to Business

[email protected]

+49 (173) 7337 894

Youtube ”Siemens Smart Data”:https://www.youtube.com/watch?v=ZxoO-DvHQRw

Example Semantic Analytics:

Dr.- Ing. Sebastian Brandt