the power of collaboration for data science teams

26
RapidMiner, Inc. All rights reserved. - 1 - The Power of Collaboration for Data Science Teams A RapidMiner Server Showcase Tom Ott Marketing Data Scientist RapidMiner

Upload: rapidminer

Post on 17-Jan-2017

188 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 1 -

The Power of Collaboration for Data Science Teams

A RapidMiner Server Showcase

Tom OttMarketing Data Scientist

RapidMiner

Page 2: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 2 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 2 -

TMToday’s Agenda • Introduction• Overview of RapidMiner Server• Demo• Q&A

Page 3: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 3 -

TM

INTRODUCTION

Page 4: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 4 -

TM

RapidMiner Server

RapidMiner Radoop

RapidMiner Studio

Data Prep OperationalizeModel & Validate

Data

BIG Data

Unified Data Science Platform

Process Execution Engine

Process Scheduler

Shared Repository

User Management

Web App Portal

Web

Ser

vice

s

Web

Ser

vice

sVisual Workflow Designer Guided Analytics 1500+ Functions

Process Execution Engine

Compile + Execute Workflows in Hadoop & Spark

Run

in m

ultip

le C

ompu

te E

ngin

es

R / Python / SQL Scripting, In-Memory/H2O/Weka

DESIGN COLLABORATE

Page 5: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 5 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 5 -

TMIntroductionWhat is RapidMiner Server?

Page 6: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 6 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 6 -

TMIntroductionWhat is RapidMiner Server?• It’s a collaboration engine

Page 7: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 7 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 7 -

TMIntroductionWhat is RapidMiner Server?• It’s a collaboration engine• It’s a compute engine

Page 8: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 8 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 8 -

TMIntroductionWhat is RapidMiner Server?• It’s a collaboration engine• It’s a compute engine• It’s a deployment engine

Page 9: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 9 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 9 -

TMIntroductionWhy Collaboration?

Page 10: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 10 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 10 -

TMIntroductionWhy Collaboration?• Ability to share data sets / connections

Page 11: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 11 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 11 -

TMIntroductionWhy Collaboration?• Ability to share data sets / connections• Ability to create/share processes and models

Page 12: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 12 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 12 -

TMIntroductionWhy Collaboration?• Ability to share data sets / connections• Ability to create/share processes and models• Create and work in groups• Build libraries

Page 13: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 13 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 13 -

TM

OVERVIEW

Page 14: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 14 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 14 -

TMRapidMiner Server OverviewGeneral• Can be installed on Linux/Windows

Page 15: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 15 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 15 -

TMRapidMiner Server OverviewGeneral• Can be installed on Linux/Windows• Needs a database as a backend

Page 16: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 16 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 16 -

TMRapidMiner Server OverviewGeneral• Can be installed on Linux/Windows• Needs a database as a backend• Supports LDAP/AD

Page 17: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 17 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 17 -

TMRapidMiner Server OverviewGeneral• Can be installed on Linux/Windows• Needs a database as a backend• Supports LDAP/AD• Let’s you “offload” RapidMiner Studio and

Radoop processes

Page 18: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 18 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 18 -

TMRapidMiner Server OverviewGeneral• Can be installed on Linux/Windows• Needs a database as a backend• Supports LDAP/AD• Let’s you “offload” RapidMiner Studio and

Radoop processes• Native dashboarding capability (HTML 5 based)

Page 19: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 19 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 19 -

TMRapidMiner Server OverviewTechnical• Java 8 / Runs on JBOSS

Page 20: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 20 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 20 -

TMRapidMiner Server OverviewTechnical• Java 8 / Runs on JBOSS• Database backend can be open source or

proprietary (needs JDBC driver)

Page 21: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 21 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 21 -

TMRapidMiner Server OverviewTechnical• Java 8 / Runs on JBOSS• Database backend can be open source or

proprietary (needs JDBC driver)• Can be installed anywhere on your network

(needs IP address)

Page 22: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 22 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 22 -

TMRapidMiner Server OverviewTechnical• Java 8 / Runs on JBOSS• Database backend can be open source or

proprietary (needs JDBC driver)• Can be installed anywhere on your network

(needs IP address)• Simple step installation process

Page 23: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 23 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 23 -

TM

DEMO

Page 24: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 24 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 24 -

TMNext Steps• Download RapidMiner Server here• Follow installation instructions here• Installing it headless? Read the instructions here• Installing LDAP/AD? Read the instructions here

Page 25: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved. - 25 -

TM

©2016 RapidMiner, Inc. All rights reserved. - 25 -

TMNext Steps• Download RapidMiner Server here• Follow installation instructions here• Installing it headless? Read the instructions here• Installing LDAP/AD? Read the instructions here

Tip: RapidMiner Studio and Server must be the same version. RM Studio 5 can’t talk to RM Server 7!

Page 26: The Power of Collaboration for Data Science Teams

©2016 RapidMiner, Inc. All rights reserved.

TM

[email protected]

+1 (617 41-7708

Q&A

Thomas Ott

@neuralmarket

Contact Us Email: [email protected] Visit: http://rapidminer.com/contact-sales-request-demo/