Transcript
Page 1: Data Visualisation using Topographic Mappings

Data Visualisation using Topographic Mappings

Colin Fyfe

The University of Paisley,

Scotland

Page 2: Data Visualisation using Topographic Mappings

Outline

• Topographic clustering.

• Topographic Product of Experts, ToPoE

• Simulations

• Products and mixtures of experts.

• Harmonic Topographic Mapping, HaToM

• 2 Varieties of HaToM

Page 3: Data Visualisation using Topographic Mappings
Page 4: Data Visualisation using Topographic Mappings

The somatosensory homunculus

• Larger area of cortex for more sensitive body parts.

Lower lip

Upper lipFace

Eye

Thumb

GenitalsToesFoot

LegTrunkHeadArmHand

LittleRing

MiddleIndex

Thumb

Pharynx

Page 5: Data Visualisation using Topographic Mappings

Kohonen’s Self-organizing Map

• camtasia\som.html

Page 6: Data Visualisation using Topographic Mappings

Orientation selectivity

Page 7: Data Visualisation using Topographic Mappings

Topology preservation

• Data space Feature Space

• Nearby Nearby

• Distant Distant (SOM)

• Nearby Nearby (SOM)

• Distant Distant

Page 8: Data Visualisation using Topographic Mappings

The Model

• K latent points in a latent space with some structure.

• Each mapped through M basis functions to feature space.

• Then mapped to data space to K points in data space using W matrix (M by D)

• Aim is to fit model to data to make data as likely as possible by adjusting W

Page 9: Data Visualisation using Topographic Mappings

Mental Model

• 1 2 3 4 5 6 ……

Page 10: Data Visualisation using Topographic Mappings

Details

• t1, t2, t3, … ,tK (points in latent space)

• f1(), f2(), …, fM() (basis functions creating feature space)

• Matrix Φ (K by M), where φkm = fm(tk), projections of latent points to feature space.

• Matrix W (M by D) so that ΦW maps latent points to data space. tk mk

Page 11: Data Visualisation using Topographic Mappings

Products of Gaussian Experts

)||||2

exp()(

))||||(2

exp()(

2

1

2

1

nk

K

kn

nk

K

kn

xmxp

xmxp

Page 12: Data Visualisation using Topographic Mappings

Maximise the likelihood of the data under the model

M

mkmmd

kd

kd

K

k

ndkmmdn

wm

mxw

1

)(

)(

1

)( )(

K

kknn mxxp

1

2))(log(

Page 13: Data Visualisation using Topographic Mappings

Using Responsibilities

M

mkmmd

kd

knkd

K

k

ndkmmdn

wm

rmxw

1

)(

)(

1

)( )(

||||||||

)exp(

)exp(

1

2

2

jnjm

M

mmnjn

jjn

knkn

mxwxd

d

dr

Page 14: Data Visualisation using Topographic Mappings
Page 15: Data Visualisation using Topographic Mappings
Page 16: Data Visualisation using Topographic Mappings

Comparison with GTM

)||||2

exp()2(

1)|()()( 22

kn

D

kknn mx

Kkxpkpxp

))||||(2

exp()2()( 2

1

2knnk

K

k

D

n rxmxp

t nt

nkknxxk d

dr

n )exp(

)exp(2

2

|

Page 17: Data Visualisation using Topographic Mappings

Growing ToPoEs

Page 18: Data Visualisation using Topographic Mappings

Demonstration

Page 19: Data Visualisation using Topographic Mappings
Page 20: Data Visualisation using Topographic Mappings

Advantages

• Growing : need only change Φ which goes from K by M to (K+1) by M.

• W is approximately correct and just refines its learning.

• Pruning uses the responsibility: if a latent point is never the most responsible point for any data point, remove it.

• Keep all other points at their positions in latent space and keep training.

Page 21: Data Visualisation using Topographic Mappings

)||(2

exp(1

)( 11

knnk

K

kn rxm

Zxp

Page 22: Data Visualisation using Topographic Mappings

Twinned ToPoEs

• Single underlying latent space

• Single Φ

• Two sets of weights

• Single responsibility – distance between current projections and both data points.

Page 23: Data Visualisation using Topographic Mappings

Twinned ToPoEs - 2

Page 24: Data Visualisation using Topographic Mappings

Using φkm=tanh(mtk)

Page 25: Data Visualisation using Topographic Mappings

Products of Experts

Page 26: Data Visualisation using Topographic Mappings
Page 27: Data Visualisation using Topographic Mappings
Page 28: Data Visualisation using Topographic Mappings

Responsibilities with Tanh()

1

19

37

55

S1

S1

1

0

0.2

0.4

0.6

0.8

1

Page 29: Data Visualisation using Topographic Mappings
Page 30: Data Visualisation using Topographic Mappings
Page 31: Data Visualisation using Topographic Mappings

A close up

Page 32: Data Visualisation using Topographic Mappings
Page 33: Data Visualisation using Topographic Mappings
Page 34: Data Visualisation using Topographic Mappings
Page 35: Data Visualisation using Topographic Mappings
Page 36: Data Visualisation using Topographic Mappings
Page 37: Data Visualisation using Topographic Mappings

Harmonic Averages

• Walk d km at 5 km/h, then d km at 10 km/h• Total time = d/5 +d/10• Average Speed = 2d/(d/5+d/10)=

• Harmonic Average =

10

1

5

12

K

k ka

K

1

1

Page 38: Data Visualisation using Topographic Mappings

K-Harmonic Means beats K-Means and MoG using EM

• Perf =

N

iK

kki mx

K

1

12

1

N

iK

l ilik

ki

k

dd

mxK

m

Perf

1

1

22

4 )1

(

)(4

N

iK

l ilik

N

iiK

l ilik

k

dd

x

dd

m

1

1

22

4

1

1

22

4

)1

(

1

)1

(

1

Page 39: Data Visualisation using Topographic Mappings

Growing Harmony Topology Preservation

• Initialise K to 2. Init W randomly.

1. Init K latent points and M basis functions.

2. Calculate mk=φkW, k=1,…,K.

1. Calculate dik, i=1,…,N, k=1,…,K

2. Re-calculate mk, k=1,…,K. (Harmonic alg.)

3. If more, go back to 1.

3. Re-calculate W=(ΦTΦ+γI)-1ΦTШ

4. K= K + 1. If more, go back to 1.

Page 40: Data Visualisation using Topographic Mappings

Disadvantages-Advantages ?

• Don’t have special rules for points for which no latent point takes responsibility.

• But must grow otherwise twists.

• Independent of initialisation ?

• Computational cost ?

Page 41: Data Visualisation using Topographic Mappings

Generalised K-Harmonic Meansfor Automatic Boosting

• Perf =

N

iK

kp

ki mx

K

1

1

1

N

iK

lpil

pik

ki

k

dd

mxpK

m

Perf

1

1

21 )1

(

)(2

N

iK

lpil

pik

N

iiK

lpil

pik

k

dd

x

dd

m

1

1

22

1

1

22

)1

(

1

)1

(

1

Page 42: Data Visualisation using Topographic Mappings
Page 43: Data Visualisation using Topographic Mappings
Page 44: Data Visualisation using Topographic Mappings
Page 45: Data Visualisation using Topographic Mappings
Page 46: Data Visualisation using Topographic Mappings
Page 47: Data Visualisation using Topographic Mappings
Page 48: Data Visualisation using Topographic Mappings
Page 49: Data Visualisation using Topographic Mappings
Page 50: Data Visualisation using Topographic Mappings

Two versions of HaToM

• D-HaToM (Data driven HaToM) :– W and m change only when adding a new

latent point– Allows the data to influence more the

clustering

• M-HaToM (Model driven HaToM) :– W and m change in every iteration– The data is continually constrained by the

model

Page 51: Data Visualisation using Topographic Mappings

Simulations(1): 1D dataset

K=2 K=4 K=8

K=20

M-HaToMD-HaToM

Page 52: Data Visualisation using Topographic Mappings

Simulations(3): algae dataset

D-HaToM

M-HaToM

Wider rij Narrower rij

Page 53: Data Visualisation using Topographic Mappings

Conclusion

• New forms of topographic mapping.

• Based on latent space concept but– free from probabilistic constraints.

• Product Mixture of experts.– automatic setting of local variances.

• Two types based on K-harmonic Means

• Very sensitive to data, or not, as required


Top Related