formal concept analysis - tu dresden · what is a concept? formal concept analysis models concepts...

44
Formal Concept Analysis I Contexts, Concepts, and Concept Lattices Sebastian Rudolph Computational Logic Group Technische Universit¨ at Dresden slides based on a lecture by Prof. Gerd Stumme Sebastian Rudolph (TUD) Formal Concept Analysis 1 / 41

Upload: others

Post on 29-Aug-2019

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Formal Concept AnalysisI Contexts, Concepts, and Concept Lattices

Sebastian Rudolph

Computational Logic GroupTechnische Universitat Dresden

slides based on a lecture by Prof. Gerd Stumme

Sebastian Rudolph (TUD) Formal Concept Analysis 1 / 41

Page 2: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Agenda

1 Concept LatticesWhat is a concept?Formal ContextDerivation OperatorsFormal ConceptConcept LatticeComputing All ConceptsDrawing Concept LatticesClarifying and Reducing a Formal ContextInterlude: ConExpAdditive Line DiagramsNested Line Diagrams

Sebastian Rudolph (TUD) Formal Concept Analysis 2 / 41

Page 3: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

What is a concept?

Formal Concept Analysis models concepts as units of thought that consistof two parts:

The concept extent comprises all objects that belong to the concept.

The concept intent contains all attributes that all of the objects havein common.

FCA is used, amongst others, data analysis, information retrieval, datamining and software engineering.

Sebastian Rudolph (TUD) Formal Concept Analysis 3 / 41

Page 4: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

What is a concept?DIN 2330/ISO 704: Concepts and their Denomination

FCA is working on the conceptual layer. The representational layer playsonly a minor role.representational layer Denomination Definition

concept layer

conceptattribute aattribute battribute c

object layer

object 1

property Aproperty Bproperty Cproperty D

object 2

property Aproperty Bproperty Cproperty D

object 3

property Aproperty Bproperty Cproperty D

Sebastian Rudolph (TUD) Formal Concept Analysis 4 / 41

Page 5: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Formal Context

Def.: A formalcontext is a triplepG,M, Iq. where

G is a set ofobjects,

M is a set ofattributes, and

I is a relationbetween G andM .

We read pg,mq P I as“object g hasattribute m”.

National Parksin California

NP

SG

uid

edT

ours

Hik

ing

Hor

seb

ack

Rid

ing

Sw

imm

ing

Boa

tin

g

Fis

hin

g

Bic

ycle

Tra

il

Cro

ssC

oun

try

Tra

il

Cabrillo Natl. Mon. ˆ ˆ

Channel Islands Natl. Park ˆ ˆ ˆ

Death Valley Natl. Mon. ˆ ˆ ˆ ˆ ˆ

Devils Postpile Natl. Mon. ˆ ˆ ˆ ˆ ˆ

Fort Point Natl. Historic Site ˆ ˆ

Golden Gate Natl. Recreation Area ˆ ˆ ˆ ˆ ˆ ˆ

John Muir Natl. Historic Site ˆ

Joshua Tree Natl. Mon. ˆ ˆ ˆ

Kings Canyon Natl. Park ˆ ˆ ˆ ˆ ˆ

Lassen Volcanic Natl. Park ˆ ˆ ˆ ˆ ˆ ˆ ˆ

Lava Beds Natl. Mon. ˆ ˆ

Muir Woods Natl. Mon. ˆ

Pinnacles Natl. Mon. ˆ

Point Reyes Natl. Seashore ˆ ˆ ˆ ˆ ˆ ˆ

Redwood Natl. Park ˆ ˆ ˆ ˆ ˆ

Santa Monica Mts. Natl. Recr. Area ˆ ˆ ˆ ˆ ˆ ˆ

Sequoia Natl. Park ˆ ˆ ˆ ˆ ˆ

Whiskeytown-Shasta-Trinity Natl. Recr. Area ˆ ˆ ˆ ˆ ˆ ˆ

Yosemite Natl. Park ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ

Sebastian Rudolph (TUD) Formal Concept Analysis 5 / 41

Page 6: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Derivation Operators

For A Ď G we defineA1 :“ tm PM | @g PA : pg,mq P Iu.

For B ĎM we defineB1 :“ tg P G | @m P

B : pg,mq P Iu.

(X 1 is spoken“X prime”)

National Parksin California

NP

SG

uid

edT

ours

Hik

ing

Hor

seb

ack

Rid

ing

Sw

imm

ing

Boa

tin

g

Fis

hin

g

Bic

ycle

Tra

il

Cro

ssC

oun

try

Tra

il

Cabrillo Natl. Mon. ˆ ˆ

Channel Islands Natl. Park ˆ ˆ ˆ

Death Valley Natl. Mon. ˆ ˆ ˆ ˆ ˆ

Devils Postpile Natl. Mon. ˆ ˆ ˆ ˆ ˆ

Fort Point Natl. Historic Site ˆ ˆ

Golden Gate Natl. Recreation Area ˆ ˆ ˆ ˆ ˆ ˆ

John Muir Natl. Historic Site ˆ

Joshua Tree Natl. Mon. ˆ ˆ ˆ

Kings Canyon Natl. Park ˆ ˆ ˆ ˆ ˆ

Lassen Volcanic Natl. Park ˆ ˆ ˆ ˆ ˆ ˆ ˆ

Lava Beds Natl. Mon. ˆ ˆ

Muir Woods Natl. Mon. ˆ

Pinnacles Natl. Mon. ˆ

Point Reyes Natl. Seashore ˆ ˆ ˆ ˆ ˆ ˆ

Redwood Natl. Park ˆ ˆ ˆ ˆ ˆ

Santa Monica Mts. Natl. Recr. Area ˆ ˆ ˆ ˆ ˆ ˆ

Sequoia Natl. Park ˆ ˆ ˆ ˆ ˆ

Whiskeytown-Shasta-Trinity Natl. Recr. Area ˆ ˆ ˆ ˆ ˆ ˆ

Yosemite Natl. Park ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ

Sebastian Rudolph (TUD) Formal Concept Analysis 6 / 41

A

A1

Page 7: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Derivation Operators

For A Ď G we defineA1 :“ tm PM | @g PA : pg,mq P Iu.

For B ĎM we defineB1 :“ tg P G | @m P

B : pg,mq P Iu.

(X 1 is spoken“X prime”)

National Parksin California

NP

SG

uid

edT

ours

Hik

ing

Hor

seb

ack

Rid

ing

Sw

imm

ing

Boa

tin

g

Fis

hin

g

Bic

ycle

Tra

il

Cro

ssC

oun

try

Tra

il

Cabrillo Natl. Mon. ˆ ˆ

Channel Islands Natl. Park ˆ ˆ ˆ

Death Valley Natl. Mon. ˆ ˆ ˆ ˆ ˆ

Devils Postpile Natl. Mon. ˆ ˆ ˆ ˆ ˆ

Fort Point Natl. Historic Site ˆ ˆ

Golden Gate Natl. Recreation Area ˆ ˆ ˆ ˆ ˆ ˆ

John Muir Natl. Historic Site ˆ

Joshua Tree Natl. Mon. ˆ ˆ ˆ

Kings Canyon Natl. Park ˆ ˆ ˆ ˆ ˆ

Lassen Volcanic Natl. Park ˆ ˆ ˆ ˆ ˆ ˆ ˆ

Lava Beds Natl. Mon. ˆ ˆ

Muir Woods Natl. Mon. ˆ

Pinnacles Natl. Mon. ˆ

Point Reyes Natl. Seashore ˆ ˆ ˆ ˆ ˆ ˆ

Redwood Natl. Park ˆ ˆ ˆ ˆ ˆ

Santa Monica Mts. Natl. Recr. Area ˆ ˆ ˆ ˆ ˆ ˆ

Sequoia Natl. Park ˆ ˆ ˆ ˆ ˆ

Whiskeytown-Shasta-Trinity Natl. Recr. Area ˆ ˆ ˆ ˆ ˆ ˆ

Yosemite Natl. Park ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ

Sebastian Rudolph (TUD) Formal Concept Analysis 6 / 41

A

A1

Page 8: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Derivation Operators: Properties

For A,A1, A2 Ď G

A1 Ď A2 ñ

A12 Ď A11A Ď A2

A1 “ A3

holds.

For B,B1, B2 ĎM

B1 Ď B2 ñ

B12 Ď B11B Ď B2

B1 “ B3

holds.

National Parksin California

NP

SG

uid

edT

ours

Hik

ing

Hor

seb

ack

Rid

ing

Sw

imm

ing

Boa

tin

g

Fis

hin

g

Bic

ycle

Tra

il

Cro

ssC

oun

try

Tra

il

Cabrillo Natl. Mon. ˆ ˆ

Channel Islands Natl. Park ˆ ˆ ˆ

Death Valley Natl. Mon. ˆ ˆ ˆ ˆ ˆ

Devils Postpile Natl. Mon. ˆ ˆ ˆ ˆ ˆ

Fort Point Natl. Historic Site ˆ ˆ

Golden Gate Natl. Recreation Area ˆ ˆ ˆ ˆ ˆ ˆ

John Muir Natl. Historic Site ˆ

Joshua Tree Natl. Mon. ˆ ˆ ˆ

Kings Canyon Natl. Park ˆ ˆ ˆ ˆ ˆ

Lassen Volcanic Natl. Park ˆ ˆ ˆ ˆ ˆ ˆ ˆ

Lava Beds Natl. Mon. ˆ ˆ

Muir Woods Natl. Mon. ˆ

Pinnacles Natl. Mon. ˆ

Point Reyes Natl. Seashore ˆ ˆ ˆ ˆ ˆ ˆ

Redwood Natl. Park ˆ ˆ ˆ ˆ ˆ

Santa Monica Mts. Natl. Recr. Area ˆ ˆ ˆ ˆ ˆ ˆ

Sequoia Natl. Park ˆ ˆ ˆ ˆ ˆ

Whiskeytown-Shasta-Trinity Natl. Recr. Area ˆ ˆ ˆ ˆ ˆ ˆ

Yosemite Natl. Park ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ

Sebastian Rudolph (TUD) Formal Concept Analysis 7 / 41

A

A1

Page 9: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Formal Concept

Def.: A formalconcept is a pairpA,Bq with

A Ď G andB ĎM

A1 “ B

B1 “ A

A is the extent andB the intent of theconcept.

National Parksin California

NP

SG

uid

edT

ours

Hik

ing

Hor

seb

ack

Rid

ing

Sw

imm

ing

Boa

tin

g

Fis

hin

g

Bic

ycle

Tra

il

Cro

ssC

oun

try

Tra

il

Cabrillo Natl. Mon. ˆ ˆ

Channel Islands Natl. Park ˆ ˆ ˆ

Death Valley Natl. Mon. ˆ ˆ ˆ ˆ ˆ

Devils Postpile Natl. Mon. ˆ ˆ ˆ ˆ ˆ

Fort Point Natl. Historic Site ˆ ˆ

Golden Gate Natl. Recreation Area ˆ ˆ ˆ ˆ ˆ ˆ

John Muir Natl. Historic Site ˆ

Joshua Tree Natl. Mon. ˆ ˆ ˆ

Kings Canyon Natl. Park ˆ ˆ ˆ ˆ ˆ

Lassen Volcanic Natl. Park ˆ ˆ ˆ ˆ ˆ ˆ ˆ

Lava Beds Natl. Mon. ˆ ˆ

Muir Woods Natl. Mon. ˆ

Pinnacles Natl. Mon. ˆ

Point Reyes Natl. Seashore ˆ ˆ ˆ ˆ ˆ ˆ

Redwood Natl. Park ˆ ˆ ˆ ˆ ˆ

Santa Monica Mts. Natl. Recr. Area ˆ ˆ ˆ ˆ ˆ ˆ

Sequoia Natl. Park ˆ ˆ ˆ ˆ ˆ

Whiskeytown-Shasta-Trinity Natl. Recr. Area ˆ ˆ ˆ ˆ ˆ ˆ

Yosemite Natl. Park ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ

Sebastian Rudolph (TUD) Formal Concept Analysis 8 / 41

exte

nt

intent

Page 10: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Formal Concept

Lemma: pA,Bq is aformal concept iffA Ď G, B ĎM andA and B are bothmaximal with respectto AˆB Ď I.I.e., every conceptcorresponds to amaximal rectangle inthe relation I.

Def.: The set of allconcepts of pG,M, Iqis depicted asBpG,M, Iq.

National Parksin California

NP

SG

uid

edT

ours

Hik

ing

Hor

seb

ack

Rid

ing

Sw

imm

ing

Boa

tin

g

Fis

hin

g

Bic

ycle

Tra

il

Cro

ssC

oun

try

Tra

il

Cabrillo Natl. Mon. ˆ ˆ

Channel Islands Natl. Park ˆ ˆ ˆ

Death Valley Natl. Mon. ˆ ˆ ˆ ˆ ˆ

Devils Postpile Natl. Mon. ˆ ˆ ˆ ˆ ˆ

Fort Point Natl. Historic Site ˆ ˆ

Golden Gate Natl. Recreation Area ˆ ˆ ˆ ˆ ˆ ˆ

John Muir Natl. Historic Site ˆ

Joshua Tree Natl. Mon. ˆ ˆ ˆ

Kings Canyon Natl. Park ˆ ˆ ˆ ˆ ˆ

Lassen Volcanic Natl. Park ˆ ˆ ˆ ˆ ˆ ˆ ˆ

Lava Beds Natl. Mon. ˆ ˆ

Muir Woods Natl. Mon. ˆ

Pinnacles Natl. Mon. ˆ

Point Reyes Natl. Seashore ˆ ˆ ˆ ˆ ˆ ˆ

Redwood Natl. Park ˆ ˆ ˆ ˆ ˆ

Santa Monica Mts. Natl. Recr. Area ˆ ˆ ˆ ˆ ˆ ˆ

Sequoia Natl. Park ˆ ˆ ˆ ˆ ˆ

Whiskeytown-Shasta-Trinity Natl. Recr. Area ˆ ˆ ˆ ˆ ˆ ˆ

Yosemite Natl. Park ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ

Sebastian Rudolph (TUD) Formal Concept Analysis 9 / 41

exte

nt

intent

Page 11: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Formal Concept: Subconcept and Superconcept

The blue concept is asubconcept of theyellow conceptbecause

the blue extent iscontained in theyellow extent

(ô the yellowintent iscontained in theblue intent)

Def.:pA1, B1q ď pA2, B2q

:ô A1 Ď A2

pô B1 Ě B2q

National Parksin California

NP

SG

uid

edT

ours

Hik

ing

Hor

seb

ack

Rid

ing

Sw

imm

ing

Boa

tin

g

Fis

hin

g

Bic

ycle

Tra

il

Cro

ssC

oun

try

Tra

il

Cabrillo Natl. Mon. ˆ ˆ

Channel Islands Natl. Park ˆ ˆ ˆ

Death Valley Natl. Mon. ˆ ˆ ˆ ˆ ˆ

Devils Postpile Natl. Mon. ˆ ˆ ˆ ˆ ˆ

Fort Point Natl. Historic Site ˆ ˆ

Golden Gate Natl. Recreation Area ˆ ˆ ˆ ˆ ˆ ˆ

John Muir Natl. Historic Site ˆ

Joshua Tree Natl. Mon. ˆ ˆ ˆ

Kings Canyon Natl. Park ˆ ˆ ˆ ˆ ˆ

Lassen Volcanic Natl. Park ˆ ˆ ˆ ˆ ˆ ˆ ˆ

Lava Beds Natl. Mon. ˆ ˆ

Muir Woods Natl. Mon. ˆ

Pinnacles Natl. Mon. ˆ

Point Reyes Natl. Seashore ˆ ˆ ˆ ˆ ˆ ˆ

Redwood Natl. Park ˆ ˆ ˆ ˆ ˆ

Santa Monica Mts. Natl. Recr. Area ˆ ˆ ˆ ˆ ˆ ˆ

Sequoia Natl. Park ˆ ˆ ˆ ˆ ˆ

Whiskeytown-Shasta-Trinity Natl. Recr. Area ˆ ˆ ˆ ˆ ˆ ˆ

Yosemite Natl. Park ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ

Sebastian Rudolph (TUD) Formal Concept Analysis 10 / 41

Page 12: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Concept Lattice(Recapitulation: Partial Order)

Def. (recap.): pA1, B1q ď pA2, B2q :ô A1 Ď A2pô B1 Ě B2q

Def.: The set of all concepts BpG,M, Iq together with the partial orderď is the concept lattice of the context pG,M, Iq and is depicted withBpG,M, Iq.

On the blackboard:

definition of partial order

definition of total order

examples

Sebastian Rudolph (TUD) Formal Concept Analysis 11 / 41

Page 13: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Concept Lattice: as Line Diagram

The conceptlattice for thenational parkcontext.

National Parksin California

NP

SG

uid

edT

ours

Hik

ing

Hor

seb

ack

Rid

ing

Sw

imm

ing

Boa

tin

g

Fis

hin

g

Bic

ycle

Tra

il

Cro

ssC

oun

try

Tra

il

Cabrillo Natl. Mon. ˆ ˆ

Channel Islands Natl. Park ˆ ˆ ˆ

Death Valley Natl. Mon. ˆ ˆ ˆ ˆ ˆ

Devils Postpile Natl. Mon. ˆ ˆ ˆ ˆ ˆ

Fort Point Natl. Historic Site ˆ ˆ

Golden Gate Natl. Recreation Area ˆ ˆ ˆ ˆ ˆ ˆ

John Muir Natl. Historic Site ˆ

Joshua Tree Natl. Mon. ˆ ˆ ˆ

Kings Canyon Natl. Park ˆ ˆ ˆ ˆ ˆ

Lassen Volcanic Natl. Park ˆ ˆ ˆ ˆ ˆ ˆ ˆ

Lava Beds Natl. Mon. ˆ ˆ

Muir Woods Natl. Mon. ˆ

Pinnacles Natl. Mon. ˆ

Point Reyes Natl. Seashore ˆ ˆ ˆ ˆ ˆ ˆ

Redwood Natl. Park ˆ ˆ ˆ ˆ ˆ

Santa Monica Mts. Natl. Recr. Area ˆ ˆ ˆ ˆ ˆ ˆ

Sequoia Natl. Park ˆ ˆ ˆ ˆ ˆ

Whiskeytown-Shasta-Trinity Natl. Recr. Area ˆ ˆ ˆ ˆ ˆ ˆ

Yosemite Natl. Park ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ

Devils Postpile

Death Valley

Fort Point

JohnMuir

Cabrillo

ChannelIslands

Golden Gate

Kings Canyon

Joshuas Tree

Lassen Volcanic

Cross Country

Ski Trail

Boating

Fishing

NPS Guided Tours

Hiking

Point Rayes

Sequoia

Yosemite

Horseback Riding

Lava Beds

PinnaclesMuir Woods

Whiskeytown-Shasta-Trinity

Santa Monica Mountains

Bicycle Trail

Swimming

Redwood

Sebastian Rudolph (TUD) Formal Concept Analysis 12 / 41

Page 14: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Concept Lattice: Implications (Preview)

Def.: An implication X Ñ Yholds in a context, if everyobject that has all attributesfrom X also has all attributesfrom Y .

Examples:

Devils Postpile

Death Valley

Fort Point

JohnMuir

Cabrillo

ChannelIslands

Golden Gate

Kings Canyon

Joshuas Tree

Lassen Volcanic

Cross Country

Ski Trail

Boating

Fishing

NPS Guided Tours

Hiking

Point Rayes

Sequoia

Yosemite

Horseback Riding

Lava Beds

PinnaclesMuir Woods

Whiskeytown-Shasta-Trinity

Santa Monica Mountains

Bicycle Trail

Swimming

Redwood

{Swimming} Ñ {Hiking}{Boating} Ñ {Swimming, Hiking, NPS Guided Tours, Fishing, Horseback Riding}{Bicycle Trail, NPS Guided Tours} Ñ {Swimming, Hiking, Horseback Riding}

Sebastian Rudolph (TUD) Formal Concept Analysis 13 / 41

Page 15: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Concept Lattice: Dual Context

Def.: Let pG,M, Iq be acontext. Then pM,G, I´1qwithpm, gq P I´1 ðñ pg,mq P Iis the dual context ofpG,M, Iq.

Theorem: Its concept latticeis isomorphic topBpG,M, Iq,ďq.

Remark: In general, G andM need not be disjunct, theycan even be identical.

Devils Postpile

Death Valley

Fort Point

JohnMuir

Cabrillo

ChannelIslands

Golden Gate

Kings Canyon

Joshuas Tree

Lassen Volcanic

Cross Country

Ski Trail

Boating

Fishing

NPS Guided Tours

Hiking

Point Rayes

Sequoia

Yosemite

Horseback Riding

Lava Beds

PinnaclesMuir Woods

Whiskeytown-Shasta-Trinity

Santa Monica Mountains

Bicycle Trail

Swimming

Redwood

Sebastian Rudolph (TUD) Formal Concept Analysis 14 / 41

Page 16: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Concept LatticeRecapitulation: Lattices

On the blackboard:

lower bound, upper bound

infimum (meet,Ź

), supremum (join,Ž

)

Lemma: For two formal concepts pA1, B1q, pA2, B2q we get§ the infimum pA1, B1q ^ pA2, B2q as pA1 XA2, pB1 YB2q

2q

§ the supremum pA1, B1q _ pA2, B2q as ppA1 YA2q2, B1 XB2q

Def. (complete) lattice pV,ďq

0V , 1V

Sebastian Rudolph (TUD) Formal Concept Analysis 15 / 41

Page 17: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Concept Lattice: The Basic Theorem on Concept Lattices

On the blackboard:

supremum/infimumreducible, irreducible,dense

isomorphisms oflattices

Basic TheoremDevils Postpile

Death Valley

Fort Point

JohnMuir

Cabrillo

ChannelIslands

Golden Gate

Kings Canyon

Joshuas Tree

Lassen Volcanic

Cross Country

Ski Trail

Boating

Fishing

NPS Guided Tours

Hiking

Point Rayes

Sequoia

Yosemite

Horseback Riding

Lava Beds

PinnaclesMuir Woods

Whiskeytown-Shasta-Trinity

Santa Monica Mountains

Bicycle Trail

Swimming

Redwood

Sebastian Rudolph (TUD) Formal Concept Analysis 16 / 41

Page 18: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Concept Lattice: The Duality Principle

Let pV,ďq be a (complete) lattice. Then pV,ěq is also a (complete)lattice.

(cf. with the definition of the dual context)

If a theorem holds for (complete) lattices, then the ‘dual theorem’also holds, i.e., the theorem where all occurrences ofď,X,Y,^,_,1V ,0V , etc. have been replaced byě,Y,X,_,^,0V ,1V , etc.

Sebastian Rudolph (TUD) Formal Concept Analysis 17 / 41

Page 19: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Computing All Concepts

There are several algorithms to compute all concepts:

naive approach

intersection method

Next-Closure (Ganter 1984) Þ Chapter 3

Titanic (Stumme et al. 2001) Þ Chapter 3

and several incremental algorithms

Sebastian Rudolph (TUD) Formal Concept Analysis 18 / 41

Page 20: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Computing All Concepts: Naive Approach

Theorem

Each concept of a context pG,M, Iq has the form pX2, X 1q for somesubset X Ď G and pY 1, Y 2q for some subset Y ĎM .Conversely, all such pairs are concepts.

Algorithm

Determine for every subset Y of M the pair pY 1, Y 2q.

Inefficient! (Too) many concepts are generated multiple times.

Sebastian Rudolph (TUD) Formal Concept Analysis 19 / 41

Page 21: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Computing All Concepts: Naive Approach

Theorem

Each concept of a context pG,M, Iq has the form pX2, X 1q for somesubset X Ď G and pY 1, Y 2q for some subset Y ĎM .Conversely, all such pairs are concepts.

Algorithm

Determine for every subset Y of M the pair pY 1, Y 2q.

Inefficient! (Too) many concepts are generated multiple times.

Sebastian Rudolph (TUD) Formal Concept Analysis 19 / 41

Page 22: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Computing All Concepts: Intersection Method

Suitable for manual computation (Wille 1982)

Best worst-case time complexity (Nourine, Raynoud 1999)

Based on the following

Theorem

Every extent is the intersection of attribute extents. (I.e., the closuresystem of all extents is generated by the attribute extents.)

Which intersections of attribute extents should we take?

On the blackboard: attribute extent tmu1, closure system

Sebastian Rudolph (TUD) Formal Concept Analysis 20 / 41

Page 23: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Computing All Concepts: Intersection Method

How to determine all formal concepts of a formal context:

1 For each attribute m PM compute the attribute extent tmu1.

2 For any two sets in this list, compute their intersection. If it is not yetcontained in the list, add it.

3 Repeat until no new extents are generated.

4 If G is not yet contained in the list, add it.

5 For every extent A in the list compute the corresponding intent A1.

Sebastian Rudolph (TUD) Formal Concept Analysis 21 / 41

Page 24: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Computing All Concepts: Intersection Method

On the blackboard: “triangle” example

Triangles

abbreviation coordinates diagram

T1 (0,0) (6,0) (3,1)

T2 (0,0) (1,0) (1,1)

T3 (0,0) (4,0) (1,2)

T4 (0,0) (2,0) (1,√3)

T5 (0,0) (2,0) (5,1)

T6 (0,0) (2,0) (1,3)

T7 (0,0) (2,0) (0,1)

Attributes

symbol property

a equilateralb isocelesc acute angledd obtuse anglede right angled

a b c d e

T1 × ×T2 × ×T3 ×T4 × × ×T5 ×T6 × ×T7 ×

Sebastian Rudolph (TUD) Formal Concept Analysis 22 / 41

Page 25: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Drawing Concept Lattices

How to draw a concept lattice by hand:

1 Draw a small circle for the extent G at the top.

2 For every extent (starting with the one’s with the most elements)draw a small circle and connect it with the lowest circle(s) whoseextent contains the current extent.

3 Every attribute is written slightly above the circle of its attributeextent.

4 Every object is written slightly below the circle that is exactly belowthe circles that are labeled with the attributes of the object.

Sebastian Rudolph (TUD) Formal Concept Analysis 23 / 41

Page 26: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Drawing Concept Lattices

How you can check the drawn diagram:

1 Is it really a lattice? (that’s often skipped)

2Is every concept with exactly one upper neighborlabeled with at least one attribute?

3Is every concept with exactly one lower neighborlabeled with at least one object?

4

Is for every g P G and m P M the label of theobject g below the label of the attribute m iffpg,mq P I holds?

Sebastian Rudolph (TUD) Formal Concept Analysis 24 / 41

Page 27: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Clarifying and Reducing a Formal Context

On the blackboard:

proper subconcept (ă)

lower neighbor (ă)

reducible objects/attributes

clarifying and reducing

reduced context, standard context

Theorem

A finite context and its reduced context have isomorphic concept lattices.For every finite lattice L there is (up to isomorphism) exactly one reducedcontext, the concept lattice of which is isomorphic to L, namely itsstandard context.

Sebastian Rudolph (TUD) Formal Concept Analysis 25 / 41

Page 28: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Interlude: ConExp

Sebastian Rudolph (TUD) Formal Concept Analysis 26 / 41

Page 29: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Interlude: ConExp

Sebastian Rudolph (TUD) Formal Concept Analysis 26 / 41

Page 30: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Additive Line Diagrams

Def.: An attribute m PM is called irreducible, if there is no set X ofattributes with m R X such that tmu1 “

Ş

xPXtxu1.

The set of irreducible attributes is depicted as Mirr.

We define the map irr : BpG,M, Iq Ñ PpMirrq as

irrpA,Bq :“ tm P B | m irreducibleu.

Let vec : Mirr Ñ Rˆ Ră0. Then

pos : BpG,M, Iq Ñ R2 with pospA,Bq :“ÿ

mPirrpA,Bq

vecpmq

is an additive line diagram of the concept lattice BpG,M, Iq.

Sebastian Rudolph (TUD) Formal Concept Analysis 27 / 41

Page 31: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Additive Line Diagrams

An additive line diagram ofthe triangles context.

The position of the attributeconcepts defines the positionof all remaining concepts. Ifwe consider the distancebetween 1B and the attributeextents as vectors, then theposition of any concept isequal to the sum of thevectors that belong to itsconcept intent.

equilateral

isoceles

acute angled

obtuse angled

right angled

Sebastian Rudolph (TUD) Formal Concept Analysis 28 / 41

Page 32: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Additive Line Diagrams

Sebastian Rudolph (TUD) Formal Concept Analysis 29 / 41

Page 33: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Nested Line Diagrams: Motivation and Idea

readability of line diagrams oftenlost for many concepts (Ç 50)

nested line diagrams allow us togo further

and: support the visualization ofchanges caused by the additionof further attributes

basic idea: cluster parts of anordinary diagram and replacebundles of parallel lines betweenthese parts by one line each

Sebastian Rudolph (TUD) Formal Concept Analysis 30 / 41

Page 34: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Nested Line Diagrams: Example

The previous concept lattice as ordinary line diagram and as nested linediagram. (For simplification, object and attribute labels have beenomitted.)

Sebastian Rudolph (TUD) Formal Concept Analysis 31 / 41

Page 35: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Nested Line Diagrams

a nested line diagram consists ofboxes which contain parts of theordinary diagram and which areconnected by lines

simplest case: two boxes thatare connected by a line arecongruent Þ correspondingcircles are direct neighbors

double lines between two boxes:every element of the upper boxis larger than every element ofthe lower box

Sebastian Rudolph (TUD) Formal Concept Analysis 32 / 41

Page 36: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Nested Line Diagrams

two boxes connected by a singleline need not be congruent butcontain a part of two congruentfigures

the two congruent figures aredrawn as “backgroundstructure” into the boxes

elements are drawn as boldcircles if they are part of therespective substructure

the line connecting both boxesindicates that the respectivepairs of elements of thebackground shall be connectedwith each other

Sebastian Rudolph (TUD) Formal Concept Analysis 33 / 41

Page 37: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Nested Line Diagrams: Drawing ExampleDie Ducks. Psychogramm einer Sippe.

generation sex financial statusolder middle younger D B rich carefree indebted

Tick ˆ ˆ ˆ

Trick ˆ ˆ ˆ

Track ˆ ˆ ˆ

Donald ˆ ˆ ˆ

Daisy ˆ ˆ ˆ

Gustav ˆ ˆ ˆ

Dagobert ˆ ˆ ˆ

Annette ˆ ˆ ˆ

Primusv. Quack

ˆ ˆ ˆ

Taken from: Grobian Gans: Die Ducks. Psychogramm einer Sippe.Rowohlt, Reinbek bei Hamburg 1972, ISBN 3-499-11481-X

Sebastian Rudolph (TUD) Formal Concept Analysis 34 / 41

Page 38: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Nested Line Diagrams: Construction

1 split the attribute set: M “M1 YM2

(needs not be disjoint, more important: both sets bear meaning)

2 draw the line diagrams of the subcontexts

Ki :“ pG,Mi, I XGˆMiq, i P t1, 2u

and label them with with objects and attributes, as usual

On the blackboard: Theorem 2 (script, p. 35)

3 sketch a nested diagram of the product of the concept lattices BpKiq

1 draw a large diagram of BpK1q where the concepts are large boxes2 draw a copy of BpK2q into each box

Sebastian Rudolph (TUD) Formal Concept Analysis 35 / 41

Page 39: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Nested Line Diagrams: Labeling

by Theorem 2, BpG,M, Iq is embedded in this product asŽ

-semilattice

if a list of elements of BpG,M, Iq exists, enter them according totheir intents

otherwise, enter the object concepts (whose intents can be read offdirectly from the context) and form all suprema

This gives us another method for determining a concept lattice by hand:

split up the attribute set as appropriate

determine the (small) concept lattices of the subcontexts

draw their product as nested line diagram

enter the object concepts and close against suprema

This is particularly advisable in order to arrive at a useful diagram quickly.

Sebastian Rudolph (TUD) Formal Concept Analysis 36 / 41

Page 40: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Nested Line Diagrams: ExampleBaurecht in Nordrhein-Westfalen

Taken from: D. Eschenfelder, W.Kollewe, M. Skorsky, R. Wille: EinErkundungssystem zum Baurecht:Methoden der Entwicklung einesTOSCANA-Systems. In: G. Stumme,R. Wille (Eds.): BegrifflicheWissensverarbeitung – Methodenund Anwendungen. Springer 2000

Sebastian Rudolph (TUD) Formal Concept Analysis 37 / 41

Page 41: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Nested Line Diagrams: ExampleBaurecht in Nordrhein-Westfalen

Sebastian Rudolph (TUD) Formal Concept Analysis 38 / 41

Page 42: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Nested Line Diagrams: ExampleBaurecht in Nordrhein-Westfalen

Sebastian Rudolph (TUD) Formal Concept Analysis 39 / 41

Page 43: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Nested Line Diagrams: ExampleBaurecht in Nordrhein-Westfalen

Sebastian Rudolph (TUD) Formal Concept Analysis 40 / 41

Page 44: Formal Concept Analysis - TU Dresden · What is a concept? Formal Concept Analysis models concepts as units of thought that consist of two parts: The concept extent comprises all

Nested Line Diagrams: Reading off Implications

implications within the inner scale can be read off at the top concept:

tTreppenu ñ tTreppenraumu

implications within the outer scale can be read off at it:

tWandu ñ tBrandwandu

tDecke, Brandwandu ñ tWand, Brandwandu

tDecke, Fundamentu ñ t?u

implications between the inner and the outer scale are shown by “notrealized” concepts: premise “ intent of the not-realized concept,conclusion “ intent of the largest realized subconcept:

tDecke, Kellerfußbodenu ñ tTreppenraumu

tTreppenraum, Schornsteinu ñ tDecke, Wand, Brandwand, Dachu

tFundamentu ñ t?u

tWand, Dach, Schornsteinu ñ t?u

Sebastian Rudolph (TUD) Formal Concept Analysis 41 / 41