introduction to knime

13
Introduction to January 23, 2017 Ali Arabi – Bernie Najlis

Upload: bernardo-najlis

Post on 03-Mar-2017

208 views

Category:

Technology


31 download

TRANSCRIPT

Page 1: Introduction to knime

Introduction to

January 23, 2017Ali Arabi – Bernie Najlis

Page 2: Introduction to knime

Agenda• What is Knime• Where to get it and online resources• What can I do with Knime• How does it compare with similar tools• Knime Lingo• Knime Workbench• How to build a Workflow• Samples and Questions

Page 3: Introduction to knime

What is Knime?• KNIME stands for Konstanz Information Miner• It is an Open Source Data Analytics, Reporting and Integration platform• Use a GUI to assembly ‘nodes’ for data preprocessing (ETL), modelling and

data analysis and visualization• Modules for:• Data Mining• Data Analysis• Data Manipulation• More modules and extensions can be added!

• Written in Java and based on Eclipse

Page 4: Introduction to knime

Where to get it and other online resources• http://knime.org/downloads/overview

• Skip the registration form, go straight to step (2) and download the version with all free extensions (~2Gb)

• Community Forum and Online Self Training• Books

KNIME EssentialsBy: Gábor BakosPublisher: Packt PublishingPub. Date: October 16, 2013Print ISBN-13: 978-1-84969-921-1Web ISBN-13: 978-1-84969-922-8Pages in Print Edition: 148

• VideosIntroduction to Data Analytics with KNIMEBy: Rosaria SilipoPublisher: Infinite SkillsPublication Date: 20-SEP-2016Insert Date: 26-SEP-2016

Page 5: Introduction to knime

What can I do with Knime?• Data Access

• File• Database I/O

• Transformation• Filtering, Grouping, Joining

• Analyze and Data mining• Weka• R• Python• Mathlab

• Visualization• Different types of charts

• Deployment• Text mining

Page 6: Introduction to knime

How does Knime compare with others?• Gartner’s Magic Quadrant for

Advance Analytics Platforms• Leaders quadrant in 2016 with SAS, IBM

and Dell• Strong Performer / Contender in

Forrester’s Wave

Page 7: Introduction to knime

Knime Lingo• Store your work in a workspace• Workspace can contain workflow groups built using the workflow editor • Workflows can contain nodes, meta nodes, connections, workflow variables, workflow

credentials and annotations• Each node has a type, which identifies the algorithm associated with it• Nodes have parameters, inports and

outports, and can have any of these states:• Misconfigured• Configured• Queued for Execution• Running• Executed

Page 8: Introduction to knime

Knime Workbench• Workflow Projects• Favorite Nodes• Node Repository• Workflow Editor• Outline• Node Description• Console

Page 9: Introduction to knime

How to Build a Knime Workflow• Search in Node Repository• Dragging nodes into Workflow Editor• Connecting Nodes• Configuring Nodes• Executing (per node or one-shot)

=> Configure => => Execute =>

Page 10: Introduction to knime

Simple Model Training for Classification

Page 11: Introduction to knime

Performing k-means Clustering

Page 12: Introduction to knime

Example for Data Preprocessing

Page 13: Introduction to knime

Example of R Snippet