ft a lifactor analysis - university of pittsburgh 2011 [compatibility mode].pdfpatients with chronic...

30
F t A l i Factor Analysis MERMAID Series 12/11 Galen E. Switzer, PhD Rachel Hess, MD, MS

Upload: others

Post on 07-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

F t A l iFactor Analysis

MERMAID Series 12/11

Galen E. Switzer, PhDRachel Hess, MD, MS

Page 2: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Ways to Examine Groups of Things

Groups of People Groups of Indicators

Ways to Examine Groups of Things

Groups of People Groups of Indicators

Cluster Analysis ExploratoryFactor Analysis

Latent ClassAnalysis

ConfirmatoryFactor AnalysisAnalysis Factor Analysis

(Structural Equation Modeling)

Page 3: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Groups of People vs Groups of Indicators/Items

Groups of People Groups of Indicators

Group Individual Learning MentoringpLearners Learners

gStyle

gQuality

MixedLearners

LifeStressorsLearners Stressors

Which set of learners is most successfulin a traditional medical training environment?

Which is the strongest predictor of medicalschool success?

Page 4: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Latent Structure and Factor AnalysisLatent Structure and Factor Analysis

I I III

I

III II

I I

I IIII

II Expertise • Expertise is an unobserved,

latent construct/conceptI

III II

III

IIII

II II

• Expertise is measured with multiple indicators

• Expertise could be a single construct or a construct with subdomains

I5I1I3I2 I

• These subdomains are also referred to as “factors”

I3I2 I4

ExperienceKnowledge

Page 5: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

SF-12 ExampleSF 12 Example

Physical Dimensiony• General health• Physical limits on daily activities• Physical limits on strenuous activities• Physical limits on ability to accomplish things• Physical limits on work activities• Pain

Emotional Health Dimension• Emotional limits on ability to accomplish things• Emotional limits on work and other activities• Calm and peaceful• Energy

F li d• Feeling down• Interference with social activities

Page 6: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Factor Analysis in Medical EducationFactor Analysis Methods and Validity Evidence: A Systematic Review ofInstrument Development Across the Continuum of Medical Education

Angela Wetzel; May, 2011

Systematic reviews indicate a lack of reporting of reliability and validity evidence in subsets of the medical education literature. A comprehensive review of exploratory factor analysis in instrument development across the continuum of medical education had not been ppreviously identified. This study is a critical review of instrument development articles employing exploratory factor or principal component analysis published in medical education. Data extraction of 64 articles published in the peer-reviewed medical education literature indicates significant errors in the translation of exploratory factor analysis best practices to current practice. p y y p pInstruments reviewed for this study lacked supporting evidence based on relationships with other variables and response process, and evidence based on consequences of testing was not evident.Findings suggest a need for further professional development within the medical education researcher community related to 1) appropriate factor analysis methodology and reporting and 2) the importance of pursuing multiple sources of reliability and validity evidence to construct a well-supported argument for the inferences made from the instrument. Medical education researchers and educators should be cautious in adopting instruments from the literature and carefully review available evidence. Finally, editors and reviewers are encouraged to recognize this gap i b t ti d b tl t t i t t d l t h th t i i t tin best practices and subsequently to promote instrument development research that is more consistent through the peer-review process.

Page 7: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Two Types of Factor Analysisy y

Exploratory (EFA) Confirmatory (CFA)link between observed & latentvariables unknown

• Principal Components

link between observed & latent variables known/hypothesized

• Factor Analysis Modelsp p

• Factor Analysis

• max. likelihood factoring

y

• structural equation methods

• alpha factoring

• unweighted least squares

• generalized least squares

• Hybrids of FA/PCA

• principal factors• principal factors

• image factoring

Page 8: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Exploratory Factor Analysis

Specific GoalsSpecific Goals

• reduce the number of indicators/items required to describe a construct

• assess and improve a measure’s propertiesassess and improve a measure s properties

• provide an operational definition for an underlying constructunderlying construct

• generate hypotheses

Page 9: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Exploratory Factor AnalysisPublished Examples

• Differential behavioral patterns related to alcohol use in rodents: A factor analysis. (Salimov, 1999)

• Factor analysis of laboratory and clinical measurements of dyspnea in patients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary disease. (Nguyen et al., 2003)

• Morphological abstraction of thyroid tumor cell nuclei using p g y gmorphometry with factor analysis. (Murata et al., 2003)

• Factor analysis of the interrelationships between clinical variables in h ith li (Th f t l 2001)horses with colic. (Thoefner, et al., 2001)

• Factor analysis shows that female rat behavior is characterized primarily by activity male rats are driven by sex and anxietyprimarily by activity, male rats are driven by sex and anxiety. (Fernandes et al., 1999)

Page 10: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

EFA Terminology• Observed variable – a variable measured directly• Latent variable – an unobserved variable that “causes” observed

variables• Correlation matrix – a table showing all pairs of correlations between

observed variables/indicators/items • Factor analysis – a set of statistical procedures and judgments used to

uncover and examine latent variables• Factor – measurement term for a latent variable• Factor loading – correlation between an indicator and an underlying

factor – higher loading means the indicator better represents the factor• Eigenvalue – a numeric value statistically assigned to each factor thatEigenvalue a numeric value statistically assigned to each factor that

corresponds to how much explanatory power that factor has• Scree plot – a visual depiction of eigenvalues plotted by factor• Rotation a statistical procedure designed to find the “cleanest”• Rotation – a statistical procedure designed to find the cleanest

possible factor solution – orthogonal or oblique

Page 11: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Observed vs Latent Variables

Observed Variables• measured directly• almost always contain error• examples: body mass, lung volume, blood pressure, cell morphology

Latent Variables• cannot be measured directly (unobserved)• cause or produce observed variables• examples: student satisfaction, academic success, family stress, burnout

latent observeditem 1itb t item 2item 3

burnout

Page 12: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

EFA: Steps & Issues

1. Examine descriptive statistics-- indicators on 1-5 scale

Mean Std. Dev. Analysis N

Ind 1 2 5 0 40 309Ind 1 2.5 0.40 309Ind 2 5.1 0.38 318 Ind 3 4.2 3.90 317

. . . .

.

. . .

.

.

.

.

• Data accurate?• Indicators normally distributed?

?• Missing data?• Adequate sample size? (good=300, very good=500, excellent=1,000)

Page 13: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

EFA: Steps & Issues2. Examine correlation matrix of variables

Var 1 Var 2 Var 3 Var 4 Ind 1 1.00 Ind 2 0 40 1 00Ind 2 0.40 1.00 Ind 3 0.05 0.15 1.00 Ind 4 0.27 0.30 0.68 . . .

..

.

.

• Variability in sizes of correlations?• Matrix factorable? (several > .30)• Outliers? (indicators that don’t correlate with any others)

Page 14: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

EFA: Steps & Issues3. Examine eigenvalues to decide how many factors to include

Initial EigenvaluesInitial EigenvaluesFactor

Total % of Variance Cumulative % 1 6.211 43.48 43.48 2 1.784 12.49 55.973 0.856 5.99 61.97

.

. . .

.

. . .

. .N = 7 items

. .100.00

• eigenvalue indicates the % of variance accounted for by the factor

• provides first evidence of whether Factor Analysis is useful

• generally identify factors with eigenvalues > 1.0 as important

Page 15: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Scree Plot Examples4. Examine scree plot to decide how many factors to extract

6

5

4

6

5

4

6

5

4

3

2

1

3

2

1

3

2

1

1 2 3 4 5 6 7 8 9 10No. of factors

Plot

1 2 3 4 5 6 7 8 9 10No. of factors

Plotb

1 2 3 4 5 6 7 8 9 10No. of factors

Plota b c

• factors are plotted by eigenvalue• important factors are to the left of the scree• important factors are to the left of the scree• tendency to over-extract using eigenvalues and under-extract using

scree plot

Page 16: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

EFA: Steps & Issues

5. Rotate the factor solutionGoal: clarify interpretation by maximizing high and minimizing lowGoal: clarify interpretation by maximizing high and minimizing low factor loadings

Factor/Component p 1 2 3 . . .

Ind 1 .45 .30 .12 Ind 2 .60 .14 .02 I d 3 01 20 20Ind 3 .01 .20 .20Ind 4

.

.

.17 .20 .30

.

l di l ti b t i di t d i f t• loading = correlation between indicator and a given factor• the matrix above is difficult to interpret

Page 17: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Pain Example

Victor, et al., (2008) Clinical Journal of Pain, Jul-Aug;24(6):550-5.

Page 18: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

EFA: Steps & Issues

7. Determine if the factor solution is adequate

• patterns should be evident in correlation matrix

• two or more indicators per factor

indicator’s loading on its factor sho ld e ceed 0 30• indicator’s loading on its factor should exceed 0.30

• “simple structure” has been achieved (i.e., fewer factors than indicators)indicators)

• examine factors’ internal consistency reliability in separate alpha analysisalpha analysis

Page 19: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Summary: EFA Steps and Issues

Exploratory Factor Analysis

• examine descriptive statistics• examine descriptive statistics

• examine correlation matrix

• conduct EFA and examine eigenvalues

• examine scree plot

• select rotation type and rotate solution

• evaluate the adequacy of the final solutionevaluate the adequacy of the final solution

Page 20: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Summary: Exploratory Factor Analysisy p y y

E l t F t A l iExploratory Factor Analysis

• can be useful for empirically summarizing &can be useful for empirically summarizing & reducing data

d i l i i l i ifi i• does not involve statistical significance testing

• relies on guidelines rather than rigid rulesrelies on guidelines rather than rigid rules

Page 21: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Confirmatory Factor Analysis

G l G lGeneral Goals

• to test a hypothesized set of relationships amongto test a hypothesized set of relationships among observed and unobserved variables

f l• to compare factor structure across samples

• to test construct/factor validityto test construct/factor validity

Page 22: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

CFA: Steps & IssuesCFA: Steps & Issues

1. Generate a visual model

• depict relationships among observed and unobserved variables

• depict relationships among unobserved variables

Page 23: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

stabbing e11

1

1

sharp e21

frozen e31

1

throbbing e41

hot e51

2

sore e61

tender e711

dull e811

sore e6

3 achy e91

stiff e101

Page 24: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

CFA: Steps & IssuesCFA: Steps & Issues

2. Evaluate Goodness of Fit Indices

• select appropriate indices

GFI AGFI CFI t > 90• GFI, AGFI, CFI etc. > .90

• RMSEA < .08

Page 25: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

CFA: Steps & IssuesCFA: Steps & Issues

3. Evaluate alternate models• alternate models from EFA• alternate models from EFA

• alternate models suggested by CFA

• compare goodness of fit indices

Page 26: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

CFA Fit Indices for Five ModelsCFA Fit Indices for Five Models

Model GFI AGFI CFI RMSEA

Single Factor 937 901 859 076 Single Factor .937 .901 .859 .076

2 Factor Uncorrelated .917 .870 .753 .101

2 Factor Correlated .944 .909 .873 .073

3 Factor Uncorrelated .862 .795 .540 .134

3 Factor Correlated .966 .947 .960 .041

Page 27: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Three Correlated-Factor Solution forThree Correlated Factor Solution for Subgroups

Group GFI AGFI CFI RMSEA Group GFI AGFI CFI RMSEA

Total .966 .947 .960 .041

White .934 .896 .913 .061

African American .953 .926 .999 .000

Page 28: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Next StepsNext Steps

• evaluate internal consistency of items belonging to a single factor (alpha)

• create subscales by averaging or summing scores on indicators belonging to a single factorscores on indicators belonging to a single factor

• name the factors/subscales

• use the subscales as predictors or outcomes in other analysesother analyses

Page 29: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Factor Subscales as Outcomes

Demographics- race/ethnicity- age- SES

Acute PainPsycholosocial

mental health Acute Pain- mental health- social support- perceived access

Clinical- symptomsy p- diagnosis

Page 30: Ft A liFactor Analysis - University of Pittsburgh 2011 [Compatibility Mode].pdfpatients with chronic obstructive pulmonary disease (Nguyen et alpatients with chronic obstructive pulmonary

Factor Analysis ReferencesFactor Analysis References

Comrey AL, Lee HB. A first course in factor analysis, 2nd ed. Hillsdale, NJ: Erlbaum, 1992.

Gorsuch, RL. Factor analysis, 2nd ed. Hillsdale, NJ: Erlbaum, 1983.

Tabachnick BG, Fidell LS. Using multivariate statistics, 5th

ed. NY: HarperCollins, 2007.

Kline P. An easy guide to factor analysis. London: Routledge, 1994.Routledge, 1994.