a new informative generic base of association rules

20
A new Informative Generic Base of Association Rules Gh. Gasmi, S. Ben Yahia, E. Mephu Ngui fo, and Y. Slimani PAKDD’05 Advisor Jia-Ling Koh Speaker Tsui-Feng Yen

Upload: wyanet

Post on 16-Jan-2016

20 views

Category:

Documents


0 download

DESCRIPTION

A new Informative Generic Base of Association Rules. Gh. Gasmi, S. Ben Yahia, E. Mephu Nguifo, and Y. Slimani PAKDD’05 Advisor : Jia-Ling Koh Speaker : Tsui-Feng Yen. Outline. Introduction Mathematical background - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: A new Informative Generic Base of Association Rules

A new Informative Generic Base of Association Rules

Gh. Gasmi, S. Ben Yahia, E. Mephu Nguifo, and Y. Slimani

PAKDD’05

Advisor: Jia-Ling Koh Speaker: Tsui-Feng Yen

Page 2: A new Informative Generic Base of Association Rules

Outline

IntroductionMathematical backgroundRelated work on generic bases of asso

ciation rules -Work of Bastide et al. -Work of PhanNew generic baseConclusion

Page 3: A new Informative Generic Base of Association Rules

Introduction The problem of the relevance and the usefulness

of extracted association rules is becoming of primary importance, since an overwhelming number of association rules may be derived

In this paper, we introduce a novel generic base of association rules, The novel generic base is sound and informative

Page 4: A new Informative Generic Base of Association Rules

Mathematical backgroundBasic notions -Formal context

A formal context is a triplet K = (O,A,R), where O represents a finite set of objects (or transactions), A is a finite set of attributes and R is a binary relation (i.e., R ⊆ O×A).

Each couple (o, a) ∈R expresses that the

transaction o ∈ O contains the attribute a ∈ A, objects are denoted by numbers and attributes

by letters.

Page 5: A new Informative Generic Base of Association Rules

Mathematical background(conti)Basic notions -Formal context

define two functions: note: 找出現在這幾個 transaction 中 , 所有共同的出現的 ite

m

note: 找有出現 A 這個 itemset 的所有的 transactions -both compound operators of Φand Ψ are closure operators, i

n particular ω = Φ 。 Ψ is a closure operator.

Page 6: A new Informative Generic Base of Association Rules

Mathematical backgroundBasic notions -Frequent closed itemset :

Closed itemset note :沒有其他的 itemset 跟他的 support 一樣且又包含他的

- Formal concept:

- Minimal generator:

Page 7: A new Informative Generic Base of Association Rules

Mathematical background Basic notions

frequent closed itemset

C 、 AC 、 BE 、 BCE 、 ABCE

ex : AC=Φ(Ψ(AC))= Φ(1 、 3 、 5)=AC ,且 sup=3/5 minimal generator

A 、 B 、 C 、 E 、 AB 、 AE 、 BC 、 CE

Page 8: A new Informative Generic Base of Association Rules

Mathematical background Derivation of association rules : -Association rule R is a relation between itemsets of the for

m R : X⇒ (Y-X), in which X and Y are frequent itemsets, and X⊂ Y

- The valid association rules are those whose the strength metric conf(R)= , is greater than or equal to

the minimal threshold of confidence minconf.

-If conf(R)=1 then R is called exact association rule (ER), otherwise it is called approximative association rule (AR).

Page 9: A new Informative Generic Base of Association Rules

Related work The concept of rule redundancy can be consid

ered as follows :

Ex:R1 : A=>CD R: AB=>C

(X1) (Y1) (X) (Y)

Page 10: A new Informative Generic Base of Association Rules

Related work Work of Bastide et al.

In one paper, the couple (GBE,GBA) of generic bases form a sound and informative generic base.

In another ,the authors presented sound inference axioms for both (GBE and GBA) bases, permitting to derive all association rules from generic bases of association rules.

Page 11: A new Informative Generic Base of Association Rules

Related work Work of Bastide et al.

Page 12: A new Informative Generic Base of Association Rules

Work of Phan

Work of Phan

Page 13: A new Informative Generic Base of Association Rules

Work of Phan

For example:

∅ => AC, {A => AC, C=> AC, and A => C}

Page 14: A new Informative Generic Base of Association Rules

Drawbacks of Work of Phan the presented generic base is not informative, i.e., it

may exist a derivable rule for which it is impossible to determine exactly both its support and confidence

For example : the association rule BE=>C is derivable from the generic

rule E=>ABCE. However, it is impossible to derive the exact confidence and support of the derivable rule from the GBPhan generic base.

Page 15: A new Informative Generic Base of Association Rules

New Generic Base : IGB

Page 16: A new Informative Generic Base of Association Rules
Page 17: A new Informative Generic Base of Association Rules
Page 18: A new Informative Generic Base of Association Rules

New Generic Base : IGB

Proposition 3. The IGB generic base is informative.

proof:

-sufficient to show that it contains all the necessary information to determine the support of an itemset in a derived rule.

-means that we have to be able to reconstitute all closed itemset by concatenation of the premise and the conclusion of a generic rule

.

Page 19: A new Informative Generic Base of Association Rules

New Generic Base : IGB

-The algorithm considers all the discovered frequent closed itemsets. Hence for a given frequent

closed itemset, say c, it tries to find the smallest minimal generator, say gs, associated to frequent closed itemsets subsumed by c and fulfilling the minsup constraint

-Therefore, the algorithm generates the following ge

neric rule gs => c. Since gs⊂ c, then gs∪c=c.

Conclusion: The IGB generic base is informative.

Page 20: A new Informative Generic Base of Association Rules

Conclusion

presented a critical survey of the reported approaches for defining generic bases of association rules.

introduced a novel generic base, which is sound and informative.

provided a set of sound inference axioms for deriving all association rules from the introduced generic base of association rules.