introduction to cfa sem sept 19 2k5
TRANSCRIPT
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
1/35
CFA SEM:Modeling Causal Processes
BUSI6280
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
2/35
The term structural equation modeling
conveys two key aspects of the procedure:
That the causal processes under study are
represented by a series of structural (i.e.,regression ) equations, That these structural relations can be
modeled pictorially to enable a clearerconceptualization of the theory understudy.
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
3/35
Why Use SEM ?
SEM lends itself well to the analysis of data forinferential purposes.
Whereas, traditional multivariate procedures areincapable of either assessing or correcting formeasurement error, SEM provides explicitesti mates of these parameters.
SEM procedures can incorporate both unobserved
(i .e. latent) and observed var iables.
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
4/35
The factor analytic model (EFA or CFA) focuses solelyon how the observed variables are l inked to theirunder lying latent f actors .
Factor analysis is concerned with the extent to which
the observed variables are generated by the underlyinglatent constructs and thus str ength of the regression paths f rom the factors to the observed var iables (the factorloadings) are of primary interest.
Although inter-factor relations are also of interest, anyregression str ucture among them is not considered in thefactor analytic model.
Purpose of Factor Analysis
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
5/35
Measurement model: latent variables and their observedmeasures (i.e., the CFA model)
Structural model: Model with links among the latentvariables.
Full (Complete) Model : a measurement model and astructural model
Recursive model: Direction of cause is from one direction
only Non-recursive model: reciprocal or feedback effects (oftendifferent from one another).
Type of Models
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
6/35
Three different scenarios or models
(Jreskog 1993)
Strictly Confirmatory (SC)
Alternative Models (AM) Model Generating (MG)
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
7/35
The SC Scenario
The researcher postulates a single modelbased on theory , collects the appropriatedata, and then tests the fit of thehypothesized model to the sample data.
From the results of this test, the researcher
either rejects or fails to reject the model. Nofur ther modif ications to the model are made.
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
8/35
The AM Scenario
The researcher proposes several alternative(i.e., competing) models , all of which are
grounded in theory .
Following analysis of a single set of empirical
data, the researcher selects one model asmost appropr iate in representing the sampledata.
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
9/35
The MG Scenario
The researcher, having postulated and rejected atheoretically derived model on the basis of its poor fit tothe sample data, proceeds in an exploratory (rather thanconfirmatory) fashion to modify and re-estimate themodel.
The primary focus here is to locate the source of misfitin the model. Jreskog noted that, althoughrespecification may be either theory- or data-driven, theul timate objective is to f ind a model that is both substantivelymeaningful and statistically well f i tting.
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
10/35
SEM procedures alternative computer programs
AMOS-Arbuckle, 1995 EQS-Bentler, 1995 LISCOMP-Muthn, 1998
CALIS-SAS Institute, 1992 RAMONA-Browne, Mels, & Coward, 1994 SEPATH-Steiger, 1994
LISREL program is the most widely used, 1970s
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
11/35
Exogenous latent variables are synonymous with independentvariables; they cause fluctuations in the values of otherlatent variables in the model.
Changes in the values of exogenous variables are not
explained by the model. Rather, they are considered to beinfluenced by other factors external to the model.
Endogenous latent variables are synonymous with dependentvariables and, as such, are influenced by the exogenous
variables in the model, either directly, or indirectly.
SEM - Language
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
12/35
By convention, observed measures are represented by Roman
letters and latent constructs by Greek letters:
Those that are exogenous are termed X-variables . Those that are endogenous are termed Y-variables .
The measurement model may be specified either in termsof LISREL exogenous notation (i.e., X-variables), or interms of its endogenous notation (i.e., Y-variables).
The exogenous latent constructs are termed as (xi).
The endogenous latent constructs are termed as (eta).
SEM - Language
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
13/35
SEM LanguageThe Measurement Model
x is a q x 1 vector of observed exogenous variables y is a p x 1 vector of observed endogenous variables. is an n x 1 vector of latent exogenous variables is an m x 1 vector of latent endogenous variables. is a q x n matrix of coefficients ( ij) linking x and
. is a q x 1 vector of random disturbance term
(errors of measurement) associated with x vector. is a p x 1 vector of random disturbance term
(errors of measurement) associated with y vector.
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
14/35
SEM LanguageThe Structural Model
(gamma) is an m x n matrix of coefficients ( ij ) thatrelates the n exogenous factors to the m endogenous
factors .
B(beta) is an m x m matrix of coefficients ( ij ) that relatesthe m endogenous factors to one another.
(zeta) is an m x 1 vector of residuals ( i) representingerrors in the equation relating and .
(phi) is an n x n matrix of coefficients ( ij) thatcaptures the variance/covariance between s. (psi) is the m x m matrix of covariance between s.
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
15/35
The Structural Model
Measurement Model for the X-variables (1):x= x+
Measurement Model for the Y-variables (2):y= y+
Structural Equation Model (3):=B + +
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
16/35
The following minimal assumptions are presumed to hold for the system of equations
is uncorrelated with (construct) is uncorrelated with (construct) is uncorrelated with and (construct) , , and are mutually uncorrelated.
E() = 0 E() = 0 E() = 0 E() = 0
E() = 0 (I-B) is nonsingular so that (I-B) exists. This makes theequation 3 to be written in the reduced form.
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
17/35
Symbol Representation
Unobserved (latent) Factors Observed Variable Path coefficient for regression of
observed variable on unobservedfactors
Path coefficient for regression ofone factor on another.
Residual error (disturbance) in prediction of unobserved factors
Measurement error associatedwith observed variable.
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
18/35
Summary of Matrices, Greek Notation, and Programs CodesMatrix Program Matrix
Greek Letter Matrix Element Code Type
Measurement Model
Lambda-X x x LX Regression Lambda-Y y y LY Regression Theta delta Q TD Var/cov Theta epsilon Q e e TE Var/cov
Structural Model Gamma GA Regression Beta BE Regression Phi PH Var/cov
Psi PS Var/cov Xi (or Ksi) --- --- Vector Eta --- --- Vector Zeta --- --- Vector Var/cov = variance-covariance
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
19/35
The Structural Model - 1 predicted by 1
x11
x21
x31
1
X1
X2
X3
2
3
1
1
Y1
Y2
1 1
2
y11
y21
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
20/35
An important corollary of SEM is that the
variances and covariance of dependent (orendogenous) variables, whether they beobserved or unobserved, are neverparameters of the model ; these areexplained by the exogenous variables.
In contrast, the var iance and covariance of
independent variable are impor tantparameters that need to be estimated.
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
21/35
MEASUREMENT (CFA) MODELS
1 X1 11
2 X2 21 1
3 X3 31
CFA Part
1 1 11 Y1 1
21Y2 2
CFA Part
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
22/35
CFA Model
error ReadSCASC
error WriteSC
error TalkSC
SSCerror InteractSC
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
23/35
CFA with Greek Notation
1 x1 11 1
2 x2 1121
3
x3
32
2 4 x4 42
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
24/35
Regression Equations (Xs)
1. x 1 = 111 + 1 2. x 2 = 211 + 23. x 3 = 322 + 3 4. x 4 = 422 + 4Or in matrix form
X = x +
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
25/35
The parameters of this model are x,,and Where:
x represents the matrix of regressioncoefficients related to the s (described earlier).
(phi) is an x symmetrical variance-covariance matrix among the exogenousfactors.
(theta-delta) is a symmetrical q x q variance-covariance matrix among the error ofmeasurement for the q exogenous observedvariables
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
26/35
The general factor analytic model
can be expanded as:X = x +
x1 11 0 1 x2 21 0 1 2
= +
x3 0 32 2 3 x4 0 42 4
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
27/35
is the Loadings Matrix
The matrix is often termed the factor-loading matrix because it portrays the
pattern by which each observed variable islinked to its respective factor.
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
28/35
The Ys
1. y 1 = 111 + 1 2. y 2 = 211 + 2
3. y 3 = 322 + 3 4. y 4 = 422 + 4
Or in matrix form
Y = y +
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
29/35
Matrix Notation for Loadings
with Regression ModelY = y + y1 11 0 1
y2 = 21 0 1 + 2y3 0 32 2 3y4 0 32 4
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
30/35
A just-identified model is one in whichthere is a one-to-one correspondence between thedata and the structural parameters.
Number of data variances and covariances equalnumber of parameters to be estimated.
However, despite the capability of the model toyield a unique solution for all parameters, the just-identified model is not scienti f ically interesting
because it has no degrees of f reedom and
therefore can never be rejected.
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
31/35
Overidentified Model
An overidentified model is one in which thenumber of estimable parameters is less than the
number of data points (i.e., variance, covariance ofthe observed variable). This situation results in positi ve degrees of
freedom that allows for rejection of the model,
thereby rendering it scientific use. The aim inSEM , then, is to specify a model such that i tmeets the cr i ter ion of over identif ication.
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
32/35
Underidentified Model
An underidentified model is one in which thenumber of parameters to be estimated exceeds the
number of variances and covariances.As such, the model contains insufficientinformation (from the input data) for the
purpose of attaining a determinate solution of
parameter estimation; that is, an inf ini te numberof solutions are possible for an under identi f iedmodel.
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
33/35
Suppose there are 12 observed variable, this means thatwe have 12(12+ 1)/2= 78 data points.
Suppose that there are 30 unknown parameters.Thus, with 78 data points and 30 parameters to beestimated, we have an overidentified model with 48degrees of freedom.
It is important to point out, however, that thespecification of an over identi f ied model is a
necessary, but not suff icient condition to resolve theidenti f ication problem. Indeed, the imposition ofconstraints on particular parameters can sometime be
beneficial in helping the researcher to attain an
overidentified model
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
34/35
No Scale Set for Constructs
Linked to the issue of identification is therequirement that every latent variable have
its scale determined. This requirementarises because these variable are unobservedand therefore have no definite metric scale;
-
8/12/2019 Introduction to CFA SEM Sept 19 2k5
35/35
Assume CFA Model with 12 variables (items) and4 factors (3 items per factor) .
We can assume that there are 12 regression coefficient ( s) There are 12 error variance (s). There are 4 factors variances (which may be standardized
and therefore set to 1).
There are 6 covariances between factors.
If the factor variances are not set to 1 then then one of theparameters for each factor can be fixed to a value of1.00 (they are therefore not to be estimated). The rationaleunderlying this constraint is tied to the issue of statisticalidentification. In total, then, there are 30 parameters to beestimated for this CFA model .