finding hidden intelligence with predictive analysis of data mining rafal lukawiecki strategic...

30
Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd [email protected]

Upload: john-hunt

Post on 22-Dec-2015

222 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

Finding Hidden Intelligence with Predictive Analysis of Data MiningRafal LukawieckiStrategic Consultant, Project Botticelli [email protected]

Page 2: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

2

Objectives

• Show use of Microsoft SQL Server 2008 Analysis Services Data Mining

• Tantalise you with the power of DM

This seminar is based on a number of sources including a few dozen of Microsoft-owned presentations, used with permission. Thank you to Marin Bezic, Kathy Sabourin, Aydin Gencler, Bryan Bredehoeft, and Chris Dial for all the support. Thank you to Maciej Pilecki for assistance with demos.

The information herein is for informational purposes only and represents the opinions and views of Project Botticelli and/or Rafal Lukawiecki. The material presented is not certain and may vary based on several factors. Microsoft makes no warranties, express, implied or statutory, as to the information in this presentation.

Portions © 2009 Project Botticelli Ltd & entire material © 2009 Microsoft Corp. Some slides contain quotations from copyrighted materials by other authors, as individually attributed or as already covered by Microsoft Copyright ownerships. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Project Botticelli Ltd as of the date of this presentation.  Because Project Botticelli & Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft and Project Botticelli cannot guarantee the accuracy of any information provided after the date of this presentation. Project Botticelli makes no warranties, express, implied or statutory, as to the information in this presentation. E&OE.

Page 3: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

3

Agenda

• Data Mining and Predictive Analytics• Server and Process Considerations• Scenarios & Demos

Page 4: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

4

What does Data Mining Do?

Explores Your Data

Finds Patterns

Performs Prediction

s

Page 5: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

5

Typical Uses

Data Mining

Seek Profitable Customers

Understand Customer

Needs

Anticipate Customer

ChurnPredict Sales &

Inventory

Build Effective Marketing Campaigns

Detect and Prevent Fraud

Correct Data

During ETL

Page 6: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

6

Analysis ServicesServer

Mining Model

Data Mining Algorithm DataSource

Server Mining Architecture

Excel/Visio/SSRS/Your App

OLE DB/ADOMD/XMLA

Deploy

BIDSExcelVisioSSMS

AppData

Page 7: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

7

Mining Model Mining ModelMining Model

Mining Process

DM EngineDM Engine

Training data

Data to be predictedMining Model

With predictions

Page 8: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

8

SCENARIO: CUSTOMER CLASSIFICATION & SEGMENTATION

Who are our customers? Are there any relationships between their demographics and their buying power?

Page 9: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

9

Microsoft Decision Trees

• Use for:• Classification:

churn and risk analysis

• Regression: predict profit or income

• Association analysis based on multiple predictable variable

• Builds one tree for each predictable attribute

• Fast

Page 10: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

10

Decision Trees for Classification of Customers’ Buying Potential

Demo

Page 11: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

11

SCENARIO: PROFITABILITY AND RISK

Who are our most profitable customers? Can I predict profit of a future customer based on demographics? Are they creditworthy? How much should I charge them to give a good loan and protect against losses?

Page 12: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

12

Profitability and Risk

• Finding what makes a customer profitable is also classification or regression

• Typically solved with:• Decision Trees (Regression), Linear Regression,• and Neural Networks or Logistic Regression

• Often used for prediction• Important to predict probability of the predicted,

or expected profit• Risk scoring

• Logistic Regression and Neural Networks

Page 13: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

13

Neural Network & Logistic Regression• Applied to

• Classification• Regression

• Great for finding complicated relationship among attributes• Difficult to interpret

results• Gradient Descent

method• LR is NNet with no

hidden layers

Age Education Sex Income

Input Layer

Hidden Layers

Output Layer

Loyalty

Page 14: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

14

1. Neural Networks for Profitability Analysis2. Predicting Lending Risk with Neural Networks

Demo

Page 15: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

15

SCENARIO: CUSTOMER NEEDS ANALYSIS

How do they behave? What are they likely to do once they bought that really expensive car? Should I intervene?

Page 16: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

16

Sequence Clustering

• Analysis of:• Customer behaviour• Transaction patterns• Click stream• Customer

segmentation• Sequence prediction

• Mix of clustering and sequence technologies• Groups individuals

based on their profiles including sequence data

Page 17: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

17

Analysis Customer Behaviour with Sequence Clustering

Demo

Page 18: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

18

SCENARIO: FORECASTING

What are my sales going to be like in the next few months? Will I have credit problems? Will my server need an upgrade in the next 3 months?

Page 19: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

19

Time Series

• Uses:• Forecast sales• Inventory

prediction• Web hits

prediction• Stock value

estimation• Regression trees

with extras

Page 20: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

20

Forecasting Using Time Series

Demo

Page 21: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

23

Summary

• Data Mining is a powerful, predictive technology• Turns data into valuable, decision-making

knowledge• SQL Server 2008 Analysis Services support

Predictive Analytics

• Mine your mountains of data for gems of intelligence today!

Page 22: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

Summary and Q&ARafal LukawieckiStrategic Consultant, Project Botticelli [email protected]

Page 23: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

25

BI & PM in an Enterprise

Data Sources

Staging Area

Manual Cleansing

Data Marts

Data Warehouse

Client Access

Client Access

1: Clients need access to data 2: Clients may access data sources directly 3: Data sources can be mirrored/replicated to reduce contention 4: The data warehouse manages data for analyzing and reporting 5: Data warehouse is periodically populated from data sources 6: Staging areas may simplify the data warehouse population 7: Manual cleansing may be required to cleanse dirty data 8: Clients use various tools to query the data warehouse 9: Delivering BI enables a process of continuous business improvement

Page 24: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

26

Want Powerful BI Applications?

• You need a well designed Data Warehouse!

• Want BI Apps quickly with self-service abilities?• Ensure good dimensional design:

• Easy to understand for a knowledge worker• Flexible• Correct and aligned

Page 25: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

27

Three Contexts of BI Use

Personal BIBuilt by me, for me, used only by me

Team BIBuilt by someone on the team, for the team’s use

Organizational BI Built and maintained by IT, for use across company

1

2

3

Page 26: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

28

Integrated BI Platform

Page 27: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

29

Resources

• Project Botticelli at your service!• Training, mentoring, “do-it-with-you” on-the-job assistance

with all BI and SQL needs• Email me at [email protected]

• Home: www.microsoft.com/bi

• Demos on www.sqlserveranalysisservices.com, www.sqlserverdatamining.com, www.codeplex.com

• More demos and sessions at www.microsoft.com/technetspotlight

Page 28: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

30

Q&A

Page 29: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

31

Thank You!Please email your comments or requests to

[email protected]

Page 30: Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com

32

© 2009 Microsoft Corporation & Project Botticelli Ltd. All rights reserved.

The information herein is for informational purposes only and represents the opinions and views of Project Botticelli and/or Rafal Lukawiecki. The material presented is not certain and may vary based on several factors. Microsoft makes no warranties, express, implied or statutory, as to the information in this presentation.

Portions © 2009 Project Botticelli Ltd & entire material © 2009 Microsoft Corp. Some slides contain quotations from copyrighted materials by other authors, as individually attributed or as already covered by Microsoft Copyright ownerships. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Project Botticelli Ltd as of the date of this presentation.  Because Project Botticelli & Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft and Project Botticelli cannot guarantee the accuracy of any information provided after the date of this presentation. Project Botticelli makes no warranties, express, implied or statutory, as to the information in this presentation. E&OE.