rm world 2014: design and implementation of data mining case studies

17
Dr. Matthew North Professor of Business & Information Systems The College of Idaho RapidMinerWorld 2014 Boston, USA Design and Implementation of Data Mining Case Studies in RapidMiner

Upload: rapidminer

Post on 18-Nov-2014

117 views

Category:

Documents


2 download

DESCRIPTION

 

TRANSCRIPT

Page 1: RM World 2014: Design and implementation of data mining case studies

Dr. Matthew North

Professor of Business & Information Systems

The College of Idaho

RapidMinerWorld 2014

Boston, USA

Design and Implementation of Data

Mining Case Studies in RapidMiner

Page 2: RM World 2014: Design and implementation of data mining case studies

W&J/College of Idaho

Page 3: RM World 2014: Design and implementation of data mining case studies

A Focus on Teaching & Learning

Lots of folks do data mining

Lots of those use RapidMiner

Data mining education

Younger than the discipline

Strange collection of options

Science? Business? Math?

Page 4: RM World 2014: Design and implementation of data mining case studies

A Focus on Teaching & Learning

2005 – Present:

Books!

Tools!

Weka, Alphaminer, Clementine, more…

Education

Master’s, Certificates, Boot Camps

Data Mining for the Masses (2012)

Data Mining Cases in RapidMiner (2013)

Page 5: RM World 2014: Design and implementation of data mining case studies

A Focus on Teaching & Learning

Page 6: RM World 2014: Design and implementation of data mining case studies

The Case Method

Cases give context

My first Clementine class

Cases build on prior knowledge

Central Tendency > k-Means Clustering

Cases use Learning Theory

Concept Attainment

Page 7: RM World 2014: Design and implementation of data mining case studies

The Anatomy of a Data Mining Case

ActivationStimulate prior knowledge/learningRelevant to the data mining task

AdditionIntroduce the new conceptK-Means Clustering

ComparisonGood/poor examples

Conclusions

Page 8: RM World 2014: Design and implementation of data mining case studies

RapidMiner World/Boston Example

Activation: Welcome to Boston!

There’s a lot to do here

Lots of cool/smart people

After hours connections can be valuable

Can data mining help make an effective

fun/work connection?

Maybe so, if we rate options and then

build option clusters

Page 9: RM World 2014: Design and implementation of data mining case studies

RapidMiner World/Boston Example

Addition: Options + Data = Choice

List our options, then rate from 0-3

across various types of fun

Page 10: RM World 2014: Design and implementation of data mining case studies

RapidMiner World/Boston Example

Addition: Modeling the data

Page 11: RM World 2014: Design and implementation of data mining case studies

RapidMiner World/Boston Example

Comparison: What do you see?

Page 12: RM World 2014: Design and implementation of data mining case studies

RapidMiner World/Boston Example

Conclusions: So what?

Does this help you make a decision?

How can you fine tune your model?

To what other problems/datasets could

you apply what you’ve learned?

Page 13: RM World 2014: Design and implementation of data mining case studies

Response to Reviewers

Use of a toy example

Transfer of knowledge to other

scenarios is ideal

Sometimes a little help is good…

Page 14: RM World 2014: Design and implementation of data mining case studies

Loan Analyst Example

Activation:

You review loans looking for red flags

You know how to spot anomalies

Your work is time-consuming

Addition:

Problem loans don’t look like average ones

K-Means Clustering uses averages

Averages help create different groups

Page 15: RM World 2014: Design and implementation of data mining case studies

Loan Analyst Example

Comparison:Build a k-Means model with your loan data

You’re the expert, what do you see?Compare your standard method results to the

data mining results

Conclusions:Is the model useful?

Can it speed up your identification of problem loans?

Page 16: RM World 2014: Design and implementation of data mining case studies

Conclusions

Cases are fun/interesting

Cases are accessible to area experts

Learning data mining is often the hurdle

RapidMiner makes data mining

accessible to non-experts

Now…..

Page 17: RM World 2014: Design and implementation of data mining case studies

Who’s Ready to Hit the Town?!?