ethem alpaydin © the mit press, 2010ethem/i2ml2e/2e_v1-0/i2ml2e-chap3 … · apriori algorithm...

ETHEM ALPAYDIN© The MIT Press, 2010

alpaydin@boun.edu.trhttp://www.cmpe.boun.edu.tr/~ethem/i2ml2e

Lecture Slides for

Probability and Inference Result of tossing a coin is {Heads,Tails}

Random var X {1,0}

Bernoulli: P {X=1} = poX (1 ‒ po)(1 ‒ X)

Sample: X = {xt }Nt =1

Estimation: po = # {Heads}/#{Tosses} = ∑txt / N

Prediction of next toss:

Heads if po > ½, Tails otherwise

3Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

Classification Credit scoring: Inputs are income and savings.

Output is low-risk vs high-risk Input: x = [x1,x2]T ,Output: C Î {0,1} Prediction:

otherwise 0

)|()|( if 1 choose

otherwise 0

)|( if 1 choose

,xxCP ,xxCP

. ,xxCP

Bayes’ Rule

posterior

likelihoodprior

evidence

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

Bayes’ Rule: K>2 Classes

CPCpCP

xx | max | if choose

Losses and Risks Actions: αi

Loss of αi when the state is Ck : λik

Expected risk (Duda and Hart, 1973)

|min| if choose

Losses and Risks: 0/1 Loss

For minimum risk, choose the most probable class

Losses and Risks: Reject

otherwise

otherwise reject

| and || if choose 1xxx ikii CPikCPCPC

Discriminant Functions Kigi ,, , 1x xx kkii ggC max if choose

xxx kkii gg max| R

K decision regions R1,...,RK

K=2 Classes Dichotomizer (K=2) vs Polychotomizer (K>2)

g(x) = g1(x) – g2(x)

Log odds:

otherwise

if choose

Utility Theory Prob of state k given exidence x: P (Sk|x)

Utility of αi when state is k: Uik

Expected utility:

| max| if Choose

EUEUα

Association Rules Association rule: X Y

People who buy/click/visit/enjoy X are also likely to buy/click/visit/enjoy Y.

A rule implies association, not necessarily causation.

Association measures Support (X Y):

Confidence (X Y):

Lift (X Y):

customers

and bought whocustomers

YXPXYP

bought whocustomers

and bought whocustomers

Apriori algorithm (Agrawal et al., 1996) For (X,Y,Z), a 3-item set, to be frequent (have enough

support), (X,Y), (X,Z), and (Y,Z) should be frequent.

If (X,Y) is not frequent, none of its supersets can be frequent.

Once we find the frequent k-item sets, we convert them to rules: X, Y Z, ...

and X Y, Z, ...

ethem alpaydin © the mit press, 2010ethem/i2ml2e/2e_v1-0/i2ml2e-chap3 … · apriori algorithm...

Documents

introduction to machine learning - cmpe...

introduction to machine learning ethem alpaydin © the mit...

introduction to machine learningethem/i2ml/slides/v1... ·...

ethem alpaydin © the mit press,...

ethem alpaydin © the mit press, 2010 · 2016. 8. 8. ·...

ethem alpaydin © the mit press, 2010 alpaydin@boun.edu.tr...

ethem alpaydın department of computer engineering...

1 classification & clustering 魏志達 jyh-da wei --...

mine bora , aylin Özgen alpaydin, gizem akkaŞ, aydın...

machine learning cse 681 ch1 - introduction. introduction to...

ethem alpaydin © the mit press,...

pattern recognition and its application to image...

introduction to machine learning adapted from: ethem...

1 hidden markov model instructor : saeed shiry chapter 13...

introduction to machine...

machine learning part i: classification and bayesian...

introduction to machine...

scan0006 - pfri.uniri.hrstrojno utenje 0100010101 ethem...

introduction to machine...

combining multiple learners usman roshan. decision tree from...