cluster1 291013

51
Cluster Analysis What is cluster analysis? Why use cluster analysis? Nomenclature Strengths and weaknesses 1

Upload: ddahly

Post on 17-May-2015

193 views

Category:

Business


2 download

TRANSCRIPT

Page 1: Cluster1 291013

1

Cluster Analysis

What is cluster analysis?

Why use cluster analysis?

Nomenclature

Strengths and weaknesses

Page 2: Cluster1 291013
Page 3: Cluster1 291013
Page 4: Cluster1 291013
Page 5: Cluster1 291013

5

What cluster analysis does

• Identifies subgroups in a population based on some set of characteristics

Page 6: Cluster1 291013

6

Some ~ Synonyms

• Cluster analysis• Unsupervised learning• Unsupervised classification• Automatic classification• Numerical taxonomy• Typological analysis

Page 7: Cluster1 291013

7

Types of cluster analysis

• Connectivity models: Hierarchical clustering • Centroid models: K-means algorithm

• Distribution models: Latent Class Models

• Density models

• Subspace models

• Graph-based models

Page 8: Cluster1 291013

8

Heirarchical

Page 9: Cluster1 291013

9

Page 10: Cluster1 291013

10

K Means

Page 11: Cluster1 291013

11

Finite mixture model

Page 12: Cluster1 291013

12

Density

Page 13: Cluster1 291013

13

Model based cluster analysis

• Distributional• Latent class• Latent profile• Finite mixture model• Gaussian mixture model

Page 14: Cluster1 291013

14

Hard vs. Fuzzy Clustering

Page 15: Cluster1 291013

15

Strengths

• Empirical (Data driven)• Handles complexity better than linear model• Better suited for some theoretical tasks

Page 16: Cluster1 291013

16

Limitations

• Subjective• Largely data-driven (empirical)• Computationally intensive• Variable selection • Model selection (how many clusters?)• Limited options for validation

Page 17: Cluster1 291013

17

RESULTS

Page 18: Cluster1 291013

18

Page 19: Cluster1 291013

19

SIMPLE STUNTING MODELS/ss

These models only include binary measures of stunting at each of 4 time points.

Page 20: Cluster1 291013

20The prevalence of stunting at each age.

Page 21: Cluster1 291013

21

Page 22: Cluster1 291013

22These make the most sense. They also suggest that stunting at 24 and 249 will tell most of the

story.

Page 23: Cluster1 291013

23These make sense, with group 3 as a “leftovers” group.

Page 24: Cluster1 291013

24Nonsensical group

Page 25: Cluster1 291013

25Nonsensical groups and/or trivial differences.

Page 26: Cluster1 291013

26Nonsensical groups and/or trivial differences.

Page 27: Cluster1 291013

27Nonsensical groups and/or trivial differences.

Page 28: Cluster1 291013

28Nonsensical groups and/or trivial differences.

Page 29: Cluster1 291013

29Dramatic improvement in model fit from the 2 to 3-class model. Minima for all three fit

measures found in the 4-class model.

Page 30: Cluster1 291013

30Dramatic improvement in model fit from the 2 to 3-class model. Best fit for the 4-class model.

Page 31: Cluster1 291013

31Best in the 2-class solution. Marginal differences between the 3 to 6 class solutions.

Page 32: Cluster1 291013

32Model >5 have classes with no modal assignments. 2 and 3 look good; 4 slightly less so.

Page 33: Cluster1 291013

33

Page 34: Cluster1 291013

34

Page 35: Cluster1 291013

35

SIMPLE BODYSIZE MODELS/bs

These models only include IOTF measures of bodysize at each of 4 time points.

Page 36: Cluster1 291013

36

Page 37: Cluster1 291013

37

Page 38: Cluster1 291013

38

Page 39: Cluster1 291013

39

Page 40: Cluster1 291013

40

Page 41: Cluster1 291013

41

Page 42: Cluster1 291013

42

Page 43: Cluster1 291013

43

Page 44: Cluster1 291013

44

Page 45: Cluster1 291013

45Dramatic improvement in model fit from the 2 to 3-class model. Minima for all three fit

measures found in the 6-class model.

Page 46: Cluster1 291013

46

Page 47: Cluster1 291013

47Not great overall. Best for 5 an 6.

Page 48: Cluster1 291013

48

Page 49: Cluster1 291013

49

Page 50: Cluster1 291013

50

Page 51: Cluster1 291013

51