microsoft machine learning smackdown

21
Machine Learning Smackdown Lynn Langit May 7-9, 2014 | San Jose, CA

Upload: lynn-langit

Post on 14-May-2015

3.728 views

Category:

Technology


2 download

DESCRIPTION

comparison of Excel add-ins and other solutions for implementing data mining or machine learning solutions on the Microsoft stack - includes coverage of XLMiner, Analysis Services Data Mining and PredixionSoftware

TRANSCRIPT

Page 1: Microsoft Machine Learning Smackdown

Machine Learning Smackdown

Lynn Langit

May 7-9, 2014 | San Jose, CA

Page 2: Microsoft Machine Learning Smackdown

Agenda

Goal: Survey ML tools/methods that you can actually use on the Microsoft stack

• Definitions• Tools I – Understanding 3rd party Excel Machine Learning Add-ins• Tools II – Using the Microsoft SQL Server SSAS & Data Mining Add-

ins• Tools III – Using Predixion Software • Recap and Call To Action

3

Page 3: Microsoft Machine Learning Smackdown

TermsGoal: Create common definitions of key terms

• Business Analytics• Query • Aggregation

• Predictive Analytics• Machine Learning

• Statistics• Unsupervised Data Mining• Supervised Data Mining• Other

4

Page 4: Microsoft Machine Learning Smackdown

What does the market look like now?

5

Regular AnalyticsUnsupervised DMSupervised DMMachine Learning

Page 5: Microsoft Machine Learning Smackdown

CRISP DM Lifecycle applied to ML

6

Page 6: Microsoft Machine Learning Smackdown

7

Machine Learning – an Example

Page 7: Microsoft Machine Learning Smackdown

About 3rd party Excel Machine Learning Add-insWhat are they? Toolbars in Excel – many different offerings

• XLMiner• StatsMiner• XLStat• RExcel

8

Important: All of these tools assume expert statistical knowledge

Page 8: Microsoft Machine Learning Smackdown

9

An aside…about R Language

Page 9: Microsoft Machine Learning Smackdown

Viewing 3rd Party Add-ins XLMiner

Page 10: Microsoft Machine Learning Smackdown

About the Data Mining Add-ins For ExcelWhat is it? Free add-ins which add menus to use SSAS Analysis Services Data Mining

• Table Analysis Tools for Excel• Use mining models with Excel data or external data

• Data Mining Client for Excel• Create/test/explore/manage Mining Models

• Data Mining Templates for Visio• Render/share mining models as Visio Drawings

11

Important: Use requires connection to SQL Server 2012 SSAS

Page 11: Microsoft Machine Learning Smackdown

Using the Data Mining Add-ins

for Excel

DEMO

Page 12: Microsoft Machine Learning Smackdown

Checking Understanding…

Data Mining Structures• Containers for cleansed source data

Data Mining Models• Child containers for source data

plus one mining algorithm• SSAS Algorithms - Clustering, Time

Series Prediction, Market-Basket Analysis, Text Mining and Neural Networks

Model Verification, Processing and Usage Tools• Model query, Model processing

13

Page 13: Microsoft Machine Learning Smackdown

About Predixion SoftwareWhat is it? Suite of tools for predictive analytics

• Insight Now• Use mining models with Excel data or external data

• Insight Analytics• Create/test/explore/manage Mining Models

• Insight Workbench• Prepare data for model creation

• Web-based Viewers and Tools

14

Important: Runs as EITHER connected to SSAS on premise OR Connected to Predixion’s cloud-based servers

Page 14: Microsoft Machine Learning Smackdown

Using Predixion Software

DEMO

Page 15: Microsoft Machine Learning Smackdown

16

Page 16: Microsoft Machine Learning Smackdown

Understanding options…

17

Add-inServer Required

Complexity of install

OtherCost of Add-in

Cost of Solution

XLMiner none easy Assumes stats expertise

$$ $$

RExcel none easy Assumes R expertise $ $

Data Mining Add-ins

SQL Server SSAS

medium Designed for single user

0 $$$

Predixion on premise

SQL Express easy Requires local R install 0 $$-$$$

Predixion on premise

SQL Server SSAS

medium Your data is stored locally

0 $$$$

Predixion cloud none easy Supports SSAS Data Mining AND R Language

0 $$-$$$

Page 17: Microsoft Machine Learning Smackdown

18

Machine Learning Skills

Data Scientist

Store

Clean

Aggregate

ML Engineers

Selects Libraries

Applies Algorithms

Creates Solutions

ML ResearcherCreates Algorithms

Page 18: Microsoft Machine Learning Smackdown

19

Learning Paths – ML Developers

Learn a language… DMX, PAX, R, Mahout, JuliaPick your IDE, tools… SSAS, Predixion, R-Studio, WekkaPick a problem space… Marketing, Health, FinancialFind (purchase)/gather/prepare some data…

GO!

(Visualize results)

Page 19: Microsoft Machine Learning Smackdown

20

Call to Action – ML Decision Makers

• Pick one or more solutions

• Gather source data

• Prepare source data

• Try out some data mining algorithms

Evaluate it Understand it• Understand tooling costs

• Understand learning costs

• Understand data gathering costs

• Understand data preparation costs

• Understand data cleansing costs

• Understand value of results

Page 20: Microsoft Machine Learning Smackdown

JOIN US to stay ahead of the curve in the changing world of analytics with:

• 70+ sessions by the world’s top BI and BA experts• 20 hours of networking opportunities with 1,000 professionals in the analytics

community• Real-world insights into analytics strategies from leading companies 

Get $300 off using discount code: BACSV

Calling All Data Professionals

Page 21: Microsoft Machine Learning Smackdown

ThankYou SoCalDevGal on