data visualisation using topographic mappings

53
Data Visualisation using Topographic Mappings Colin Fyfe The University of Paisley, Scotland

Upload: catherine-mccall

Post on 02-Jan-2016

40 views

Category:

Documents


4 download

DESCRIPTION

Data Visualisation using Topographic Mappings. Colin Fyfe The University of Paisley, Scotland. Outline. Topographic clustering. Topographic Product of Experts, ToPoE Simulations Products and mixtures of experts. Harmonic Topographic Mapping, HaToM 2 Varieties of HaToM. Trunk. Head. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Data Visualisation using Topographic Mappings

Data Visualisation using Topographic Mappings

Colin Fyfe

The University of Paisley,

Scotland

Page 2: Data Visualisation using Topographic Mappings

Outline

• Topographic clustering.

• Topographic Product of Experts, ToPoE

• Simulations

• Products and mixtures of experts.

• Harmonic Topographic Mapping, HaToM

• 2 Varieties of HaToM

Page 3: Data Visualisation using Topographic Mappings
Page 4: Data Visualisation using Topographic Mappings

The somatosensory homunculus

• Larger area of cortex for more sensitive body parts.

Lower lip

Upper lipFace

Eye

Thumb

GenitalsToesFoot

LegTrunkHeadArmHand

LittleRing

MiddleIndex

Thumb

Pharynx

Page 5: Data Visualisation using Topographic Mappings

Kohonen’s Self-organizing Map

• camtasia\som.html

Page 6: Data Visualisation using Topographic Mappings

Orientation selectivity

Page 7: Data Visualisation using Topographic Mappings

Topology preservation

• Data space Feature Space

• Nearby Nearby

• Distant Distant (SOM)

• Nearby Nearby (SOM)

• Distant Distant

Page 8: Data Visualisation using Topographic Mappings

The Model

• K latent points in a latent space with some structure.

• Each mapped through M basis functions to feature space.

• Then mapped to data space to K points in data space using W matrix (M by D)

• Aim is to fit model to data to make data as likely as possible by adjusting W

Page 9: Data Visualisation using Topographic Mappings

Mental Model

• 1 2 3 4 5 6 ……

Page 10: Data Visualisation using Topographic Mappings

Details

• t1, t2, t3, … ,tK (points in latent space)

• f1(), f2(), …, fM() (basis functions creating feature space)

• Matrix Φ (K by M), where φkm = fm(tk), projections of latent points to feature space.

• Matrix W (M by D) so that ΦW maps latent points to data space. tk mk

Page 11: Data Visualisation using Topographic Mappings

Products of Gaussian Experts

)||||2

exp()(

))||||(2

exp()(

2

1

2

1

nk

K

kn

nk

K

kn

xmxp

xmxp

Page 12: Data Visualisation using Topographic Mappings

Maximise the likelihood of the data under the model

M

mkmmd

kd

kd

K

k

ndkmmdn

wm

mxw

1

)(

)(

1

)( )(

K

kknn mxxp

1

2))(log(

Page 13: Data Visualisation using Topographic Mappings

Using Responsibilities

M

mkmmd

kd

knkd

K

k

ndkmmdn

wm

rmxw

1

)(

)(

1

)( )(

||||||||

)exp(

)exp(

1

2

2

jnjm

M

mmnjn

jjn

knkn

mxwxd

d

dr

Page 14: Data Visualisation using Topographic Mappings
Page 15: Data Visualisation using Topographic Mappings
Page 16: Data Visualisation using Topographic Mappings

Comparison with GTM

)||||2

exp()2(

1)|()()( 22

kn

D

kknn mx

Kkxpkpxp

))||||(2

exp()2()( 2

1

2knnk

K

k

D

n rxmxp

t nt

nkknxxk d

dr

n )exp(

)exp(2

2

|

Page 17: Data Visualisation using Topographic Mappings

Growing ToPoEs

Page 18: Data Visualisation using Topographic Mappings

Demonstration

Page 19: Data Visualisation using Topographic Mappings
Page 20: Data Visualisation using Topographic Mappings

Advantages

• Growing : need only change Φ which goes from K by M to (K+1) by M.

• W is approximately correct and just refines its learning.

• Pruning uses the responsibility: if a latent point is never the most responsible point for any data point, remove it.

• Keep all other points at their positions in latent space and keep training.

Page 21: Data Visualisation using Topographic Mappings

)||(2

exp(1

)( 11

knnk

K

kn rxm

Zxp

Page 22: Data Visualisation using Topographic Mappings

Twinned ToPoEs

• Single underlying latent space

• Single Φ

• Two sets of weights

• Single responsibility – distance between current projections and both data points.

Page 23: Data Visualisation using Topographic Mappings

Twinned ToPoEs - 2

Page 24: Data Visualisation using Topographic Mappings

Using φkm=tanh(mtk)

Page 25: Data Visualisation using Topographic Mappings

Products of Experts

Page 26: Data Visualisation using Topographic Mappings
Page 27: Data Visualisation using Topographic Mappings
Page 28: Data Visualisation using Topographic Mappings

Responsibilities with Tanh()

1

19

37

55

S1

S1

1

0

0.2

0.4

0.6

0.8

1

Page 29: Data Visualisation using Topographic Mappings
Page 30: Data Visualisation using Topographic Mappings
Page 31: Data Visualisation using Topographic Mappings

A close up

Page 32: Data Visualisation using Topographic Mappings
Page 33: Data Visualisation using Topographic Mappings
Page 34: Data Visualisation using Topographic Mappings
Page 35: Data Visualisation using Topographic Mappings
Page 36: Data Visualisation using Topographic Mappings
Page 37: Data Visualisation using Topographic Mappings

Harmonic Averages

• Walk d km at 5 km/h, then d km at 10 km/h• Total time = d/5 +d/10• Average Speed = 2d/(d/5+d/10)=

• Harmonic Average =

10

1

5

12

K

k ka

K

1

1

Page 38: Data Visualisation using Topographic Mappings

K-Harmonic Means beats K-Means and MoG using EM

• Perf =

N

iK

kki mx

K

1

12

1

N

iK

l ilik

ki

k

dd

mxK

m

Perf

1

1

22

4 )1

(

)(4

N

iK

l ilik

N

iiK

l ilik

k

dd

x

dd

m

1

1

22

4

1

1

22

4

)1

(

1

)1

(

1

Page 39: Data Visualisation using Topographic Mappings

Growing Harmony Topology Preservation

• Initialise K to 2. Init W randomly.

1. Init K latent points and M basis functions.

2. Calculate mk=φkW, k=1,…,K.

1. Calculate dik, i=1,…,N, k=1,…,K

2. Re-calculate mk, k=1,…,K. (Harmonic alg.)

3. If more, go back to 1.

3. Re-calculate W=(ΦTΦ+γI)-1ΦTШ

4. K= K + 1. If more, go back to 1.

Page 40: Data Visualisation using Topographic Mappings

Disadvantages-Advantages ?

• Don’t have special rules for points for which no latent point takes responsibility.

• But must grow otherwise twists.

• Independent of initialisation ?

• Computational cost ?

Page 41: Data Visualisation using Topographic Mappings

Generalised K-Harmonic Meansfor Automatic Boosting

• Perf =

N

iK

kp

ki mx

K

1

1

1

N

iK

lpil

pik

ki

k

dd

mxpK

m

Perf

1

1

21 )1

(

)(2

N

iK

lpil

pik

N

iiK

lpil

pik

k

dd

x

dd

m

1

1

22

1

1

22

)1

(

1

)1

(

1

Page 42: Data Visualisation using Topographic Mappings
Page 43: Data Visualisation using Topographic Mappings
Page 44: Data Visualisation using Topographic Mappings
Page 45: Data Visualisation using Topographic Mappings
Page 46: Data Visualisation using Topographic Mappings
Page 47: Data Visualisation using Topographic Mappings
Page 48: Data Visualisation using Topographic Mappings
Page 49: Data Visualisation using Topographic Mappings
Page 50: Data Visualisation using Topographic Mappings

Two versions of HaToM

• D-HaToM (Data driven HaToM) :– W and m change only when adding a new

latent point– Allows the data to influence more the

clustering

• M-HaToM (Model driven HaToM) :– W and m change in every iteration– The data is continually constrained by the

model

Page 51: Data Visualisation using Topographic Mappings

Simulations(1): 1D dataset

K=2 K=4 K=8

K=20

M-HaToMD-HaToM

Page 52: Data Visualisation using Topographic Mappings

Simulations(3): algae dataset

D-HaToM

M-HaToM

Wider rij Narrower rij

Page 53: Data Visualisation using Topographic Mappings

Conclusion

• New forms of topographic mapping.

• Based on latent space concept but– free from probabilistic constraints.

• Product Mixture of experts.– automatic setting of local variances.

• Two types based on K-harmonic Means

• Very sensitive to data, or not, as required