efa manual.doc
TRANSCRIPT
-
7/28/2019 EFA MANUAL.doc
1/11
Lab 9a -- Factor Analysis
Factor Analysis
SPSS Steps
Output -- Principal Components
Output -- Common Factor Analysis Interpretation
The area of Factor Analysis is, in many ways, diffuse and large. We will consider
only a portion of the subject. Factor Analysis is generally used to find parsimony
among several variables. The quest is usually for underlying constructs
orfactors that each explain the variation among several variables. We will consider
two mathematical models in this lab, though there are many more, and variants of
these.
SPSS Steps
Enter the variable names in an SPSS spreadsheet as usual in the "Variable View."Although it is possible, under some circumstances to use categorical variables in
factor analysis, you should use only ordinal, interval, or ratio scaled variables in this
lab. In the "Data View" enter the values for the variables for each case. Remember a
case is a row and a variable is a column. Also remember to save your data.
For the factor analysis lab, you will need a minimum of 6 variables with about 30
subjects. This is not enough for a serious factor analysis, but will be enough for your
lab. The example herein is one in which we use eight biometric
measures: height, armspan, forearem, lowerleg, weight, diameter, girth,
and width. The purpose is to attempt to represent these eight variables by a smallernumber of dimensions, or factors. This file, "PCPADEMO.sav" is included in this
folder.
To run the factor analysis lab go to the top of the spreadsheet and click on
Analyze/Data Reduction/Factor. A dialogue box will appear. Highlight the
variables in the variable list on the left that you wish to factor analyze and move them
out of the variable list on the left into the Variables: box or the right by clicking on
the right arrow between the boxes.
Now click on the Descriptives button. In the Descriptives dialog box that appears
check Univariate descriptives, Coefficients, Significance levels, and KMO
and Bartletts test of sphericity." Using the example data set, your screen should look
something like this:
http://www.coe.fau.edu/faculty/morris/sta7114%20files/lab%209/9a%20instructions/factor_analysis_page.htm#Factor%20Analysishttp://www.coe.fau.edu/faculty/morris/sta7114%20files/lab%209/9a%20instructions/factor_analysis_page.htm#SPSS%20Stepshttp://www.coe.fau.edu/faculty/morris/sta7114%20files/lab%209/9a%20instructions/factor_analysis_page.htm#Output%20--%20Principal%20Componentshttp://www.coe.fau.edu/faculty/morris/sta7114%20files/lab%209/9a%20instructions/factor_analysis_page.htm#Output%20--%20Common%20Factor%20Analysishttp://www.coe.fau.edu/faculty/morris/sta7114%20files/lab%209/9a%20instructions/factor_analysis_page.htm#Interpretationhttp://www.coe.fau.edu/faculty/morris/sta7114%20files/lab%209/9a%20instructions/factor_analysis_page.htm#SPSS%20Stepshttp://www.coe.fau.edu/faculty/morris/sta7114%20files/lab%209/9a%20instructions/factor_analysis_page.htm#Output%20--%20Principal%20Componentshttp://www.coe.fau.edu/faculty/morris/sta7114%20files/lab%209/9a%20instructions/factor_analysis_page.htm#Output%20--%20Common%20Factor%20Analysishttp://www.coe.fau.edu/faculty/morris/sta7114%20files/lab%209/9a%20instructions/factor_analysis_page.htm#Interpretationhttp://www.coe.fau.edu/faculty/morris/sta7114%20files/lab%209/9a%20instructions/factor_analysis_page.htm#Factor%20Analysis -
7/28/2019 EFA MANUAL.doc
2/11
-
7/28/2019 EFA MANUAL.doc
3/11
Now click the Extraction button in the "Factor Analysis" box. In the Extraction
dialog box (below), all you need to do is to click Scree plot. Before you click
Continue however, note that in this dialog box you can set the minimum eigenvalue
to retain (SPSS uses "Kaiser's Rule" of larger than 1.00 if you do not select another
minimum), and you can also specify the exact number of factors to retain regardless
of the eigenvalues. These are features that I leave for your experimentation. In
addition, note that there is the possibility of selecting different "Methods:" of factor
analysis here. The default method, highlighted in the image below is "Principal
components." As that is, by far, the most frequently used method, start with it.
When you click on the down arrow, you will see that there are many possible
methods. I also include the Principal Axis results (obtained by exactly the same steps
as the Principal Component results, except that the Principal Axis method was
selected at this step) which is a Common Factor method for contrast. Now click
"Continue" in the "Extraction" box and it disappears.
Now click the Rotation button in the Factor Analysis box. In this Rotation
dialog box (below) click Varimax and "Loading plot(s)." Again, before you
click the "Continue" button, note that a variety of rotations are offered. Now click
the Continue button in the Rotation box and it disappears..
-
7/28/2019 EFA MANUAL.doc
4/11
Now you are ready to click the OK button in the Factor Analysis box, and the
analysis will run.
The output file will appear, and for this example both the Principal Components
output and Principal Axis (Common Factor) output are below.
Principal Components -- Factor Analysis
Descriptive Statistics
Mean Std. Deviation Analysis N
HEIGHT 72.0000 20.03285 305
ARMSPAN 60.0000 15.02465 305
FOREARM 14.0000 5.00822 305
LOWERLEG 18.0000 10.01647 305
WEIGHT 180.0000 30.04926 305
DIAMETER 18.0000 7.01152 305
GIRTH 18.0000 7.01151 305
WIDTH 20.0000 7.01152 305
Correlation Matrix
HEIGH ARMSPA FOREAR LOWERLE WEIGH DIAMETE GIRT WIDT
-
7/28/2019 EFA MANUAL.doc
5/11
T N M G T R H H
Correlatio
n
HEIGHT 1.000 .846 .805 .859 .473 .398 .301 .382
ARMSPAN .846 1.000 .881 .826 .376 .326 .277 .415
FOREARM .805 .881 1.000 .801 .380 .319 .237 .345
LOWERLE
G.859 .826 .801 1.000 .436 .329 .327 .365
WEIGHT .473 .376 .380 .436 1.000 .762 .730 .629
DIAMETE
R.398 .326 .319 .329 .762 1.000 .583 .577
GIRTH .301 .277 .237 .327 .730 .583 1.000 .539
WIDTH .382 .415 .345 .365 .629 .577 .539 1.000
Sig. (1-
tailed)
HEIGHT .000 .000 .000 .000 .000 .000 .000
ARMSPAN .000 .000 .000 .000 .000 .000 .000
FOREARM .000 .000 .000 .000 .000 .000 .000
LOWERLE
G.000 .000 .000 .000 .000 .000 .000
WEIGHT .000 .000 .000 .000 .000 .000 .000
DIAMETE
R.000 .000 .000 .000 .000 .000 .000
GIRTH .000 .000 .000 .000 .000 .000 .000
WIDTH .000 .000 .000 .000 .000 .000 .000
KMO and Bartlett's Test
Kaiser-Meyer-Olkin Measure of Sampling Adequacy. .845
Bartlett's Test of Sphericity
Approx. Chi-Square 2085.738
df 28
Sig. .000
Communalities
Initial Extraction
HEIGHT 1.000 .877
-
7/28/2019 EFA MANUAL.doc
6/11
ARMSPAN 1.000 .903
FOREARM 1.000 .872
LOWERLEG 1.000 .861
WEIGHT 1.000 .850
DIAMETER 1.000 .739
GIRTH 1.000 .717
WIDTH 1.000 .625
Extraction Method: Principal Component Analysis.
Total Variance Explained
Initial Eigenvalues Extraction Sums of SquaredLoadings Rotation Sums of SquaredLoadings
Component Total% of
Variance
Cumulative
%Total
% of
Variance
Cumulative
%Total
% of
Variance
Cumulative
%
1 4.673 58.411 58.411 4.673 58.411 58.411 3.497 43.717 43.717
2 1.771 22.137 80.548 1.771 22.137 80.548 2.947 36.832 80.548
3 .481 6.013 86.561
4 .421 5.268 91.829
5 .233 2.915 94.744
6 .187 2.333 97.078
7 .137 1.716 98.794
89.646E-
021.206 100.000
Extraction Method: Principal Component Analysis.
-
7/28/2019 EFA MANUAL.doc
7/11
Component Matrix(a)
Component
1 2
HEIGHT .859 -.372
ARMSPAN .842 -.441
FOREARM .813 -.459
LOWERLEG .840 -.395
WEIGHT .758 .525
DIAMETER .674 .533
GIRTH .617 .580
WIDTH .671 .418
Extraction Method: Principal Component Analysis.
a 2 components extracted.
Rotated Component Matrix(a)
-
7/28/2019 EFA MANUAL.doc
8/11
Component
1 2
HEIGHT .900 .260
ARMSPAN .930 .195
FOREARM .919 .164
LOWERLEG .899 .229
WEIGHT .251 .887
DIAMETER .181 .840
GIRTH .107 .840
WIDTH .251 .750
Extraction Method: Principal Component Analysis.
Rotation Method: Varimax with Kaiser Normalization.
a Rotation converged in 3 iterations.
Component Transformation Matrix
Component 1 2
1 .771 .636
2 -.636 .771
Extraction Method: Principal Component Analysis.
Rotation Method: Varimax with Kaiser Normalization.
_____________________________________________________________________
_________________________
Common Factor Analysis
Communalities
Initial Extraction
-
7/28/2019 EFA MANUAL.doc
9/11
HEIGHT .816 .838
ARMSPAN .849 .889
FOREARM .801 .821
LOWERLEG .788 .808
WEIGHT .749 .888
DIAMETER .604 .640
GIRTH .562 .583
WIDTH .478 .492
Extraction Method: Principal Axis Factoring.
Total Variance Explained
Initial EigenvaluesExtraction Sums of Squared
Loadings
Rotation Sums of Squared
Loadings
Factor Total% of
Variance
Cumulative
%Total
% of
Variance
Cumulative
%Total
% of
Variance
Cumulative
%
1 4.673 58.411 58.411 4.449 55.611 55.611 3.315 41.438 41.438
2 1.771 22.137 80.548 1.510 18.875 74.486 2.644 33.049 74.486
3 .481 6.013 86.561
4 .421 5.268 91.829
5 .233 2.915 94.744
6 .187 2.333 97.078
7 .137 1.716 98.794
89.646E-
021.206 100.000
Extraction Method: Principal Axis Factoring.
Factor Matrix(a)
Factor
1 2
HEIGHT .856 -.324
ARMSPAN .848 -.411
FOREARM .808 -.409
-
7/28/2019 EFA MANUAL.doc
10/11
LOWERLEG .831 -.342
WEIGHT .750 .571
DIAMETER .631 .492
GIRTH .569 .510
WIDTH .607 .351
Extraction Method: Principal Axis Factoring.
a 2 factors extracted. 9 iterations required.
Rotated Factor Matrix(a)
Factor
1 2HEIGHT .872 .278
ARMSPAN .920 .204
FOREARM .887 .182
LOWERLEG .864 .248
WEIGHT .233 .913
DIAMETER .188 .778
GIRTH .129 .753
WIDTH .258 .652
Extraction Method: Principal Axis Factoring.
Rotation Method: Varimax with Kaiser Normalization.
a Rotation converged in 3 iterations.
Interpretation
The descriptive information shows the means and standard deviations for all of the
eight variables, as well as all possible bivariate correlations and their p values. We
note that all of the correlations are positive and significant as might be expected of
these variables.
Barlett's test of spericity is significant, thus the hypothesis that the intercorrelation
matrix involving these eight variables is an identity matrix is rejected. Thus from the
perspective of Bartlett's test, factor analysis is feasible. As Bartlett's test is almost
always significant, a more discriminating index of factor analyzability is the KMO.
-
7/28/2019 EFA MANUAL.doc
11/11
For this data set, it is .845, which is very large, so the KMO also supports factor
analysis.
Kaiser's rule of retaining factors with eigenvalues larger than 1.00 was used in this
analysis as the default. As the eigenvalues for the first two principal components (no
distinction is made in deciding dimensionality by SPSS in the principal component
and common factor analysis) with eigenvalues of 4.673 and 1.771 were retained.
The Principal Component communalities (Extraction, as the Initial are always 1.00)
range from .625 to .903, thus most of the variance of these variables was accounted
for by this two dimensional factor solution. One can see that the corresponding
Extraction communalities for the Common Factor analysis were a bit smaller (as
would be expected) but still show the majority of the variance of all variables
represented in the two factor solution. Note that the "Initial" communality estimates
for the SPSS version of a Principal Axis Common Factor Analysis are the R2 s
predicting each of the variables from all other variables -- a usual choice.
Also note the Scree Plot in the Principal Components output (the same thing is
produced in the Common Factor Analysis). The Scree Plot is a graphic aid proposed
by Cattell. It is simply a plot of the monotonically descending eigenvalues. It is
intended to help in deciding where a the "trivial" dimensions begin. One might argue
that the Kaiser Rule opting for two dimensions is fairly well supported by the Scree
Plot.
In the Principal Components Output, the Rotated Component Matrix gives thecorrelation of each variable with each factor. From the contribution of the variables
(also called a "loading") we can name these factors something like "Lankiness" and
"Heaviness." One might come up with a variety of other names that are equally
descriptive. You will note that the results of the Common Factor analysis are much
the same with loadings that are a bit smaller. One might argue that the two methods,
therefore, give the same result. However, that would be dangerous as it depends on
the number of variables, their communalities, and also we are restricting the results to
the same dimensionality in this case.