1 introduction to statistical mediation david p. mackinnon arizona state university center for aids...
Post on 20-Dec-2015
216 views
TRANSCRIPT
1
Introduction to Statistical MediationDavid P. MacKinnon
Arizona State University
Center for AIDS Prevention Studies, UCSF, June 12-13, 2007
Brown, Cheong, Fairchild, Fritz, Lockwood, Morgan-Lopez, Taylor, Tein, Williams, West, Wang, Yoon
Undergraduate Social Psychology ClassGraduate School UCLA Quantitative Psychology
Drug Prevention Research at USCSupport from the National Institute on Drug Abuse
http://www.public.asu.edu/~davidpm/MacKinnon, D. P. (2007)
Introduction to Statistical Mediation Analysis, Mahwah, NJ: Erlbaum.
2
Goals of CAPS Presentation
• Describe many mediating variable examples.• Describe reasons for mediation analysis--it can help
improve prevention programs and reduce their cost. It is also useful for testing theories.
• Describe the latest methods to assess mediation.• Describe limitations of mediation analysis. • Describe experimental as well as non-experimental
designs to investigate mediating variables.
3
Overview of Presentation
• Mediation Examples and Definition
• Statistical Mediation Analysis
• New tests for Mediation
• Limitations of Statistical Mediation Analysis
• Designs to address limitations of Mediation Analysis
• Summary and Future Directions
4
Psychology Example Stimulus: Multiply 24 and 16Organism:YouResponse: Your AnswerOrganism as a Black BoxStimulus>Organism >Response (SOR) theory
whereby the effect of a Stimulus on a Response depends on mechanisms in the organism (Woodworth, 1928). These mediating mechanisms translate the Stimulus to the Response. SOR theory is ubiquitous in psychology.
5
Mediation Statements
• If norms become less tolerant about smoking then smoking will decrease.
• If you increase positive parental communication then there will be reduced symptoms among children of divorce.
• If children are successful at school they will be less anti-social.
• If unemployed persons can maintain their self-esteem they will be more likely to be reemployed.
• If pregnant women know the risk of alcohol use for the fetus then they will not drink alcohol during pregnancy.
6
Mediator Definition and ExamplesA variable that is intermediate in the causal process relating an
independent to a dependent variable.
Attitudes cause intentions which then cause behavior (Azjen & Fishbein, 1980)
Prevention programs change norms which promote healthy behavior (Judd & Kenny, 1981)
Exposure to an argument affects agreement with the argument which affects behavior (McGuire,
1968)
7
More Mediation ExamplesP Psychotherapy induces catharsis, insight, and other mediators which lead to a better outcome
(Freedheim & Russ, 1981)PPsychotherapy changes attributional style which reduces depression (Hollon, Evans, & DeRubies,
1991)PParenting programs reduce parents’ negative
discipline which reduces symptoms among children with ADHD (Hinshaw, 2002).
8
CAPS Mediation ExamplesSocial problem solving affects psychological health
which affects adherence to HIV medications (Johnson et al., 2006)
Girl/boy friend in 7th grade affects peer norms about sexual behavior which affects sexual behavior in 9 th
grade (VanOss et al., 2006)Condom promotion program changes attitudes about
sexual enjoyment from condoms which changes condom use (Choi et al., 2007).
Affective regulation affects stimulant use and nonadherence to medications which affects viral
load (Carrico et al., 2007).
9
Mediation Analysis in Treatment and Prevention Research
• Mediation is important for prevention and treatment research. Practical implications include reduced cost and more effective treatments.
• Mediation analysis is based on theory for the processes underlying treatments. Action theory corresponds to how the treatment will affect mediators—the X to M relation. Conceptual Theory focuses on how the mediators are related to the outcome variables—the M to Y relation (Chen, 1990, Lipsey, 1993).
10
Questions about mediators for treatment and prevention.
• Are these the right mediators? Are they causally related to the outcome? Is self-esteem causally related to symptoms? Conceptual Theory
• Can these mediators be changed? Can personality be changed? Action Theory
• Will the change in these mediators that we can muster with our treatment be sufficient to lead to desired change in the outcome? Do we have the resources to change self-esteem in four sessions? Both Action and Conceptual Theory.
11
Quotes about mediation analysis
In the absence of a concern for such mediating or intervening mechanisms, one ends up with facts, but with incomplete understanding (Rosenberg, 1968, p. 63).
.. much of what social psychologists do is attempt to understand how internal processes mediate the effect of the situation on behavior (Kenny, Kashy, & Bolger, 1998, p. 259).
12
More Quotes
Nursing “.. Should consider hypotheses about mediators …. that could provide additional information about why an observed phenomenon occurs” (Bennett, 2000).
Children’s programs “.. Including even one mediator ….. in a program theory and testing it with the evaluation .. will yield more fruit….” (Petrosino, 2000)
Child mental health “rapid progress … depends on efforts to identify … mediators of treatment outcome. We recommend randomized clinical trials routinely include and report such analyses” (Kraemer et al., 2002).
13
“Everyone talks about the weather but nobody does anything about it.” (Mark Twain)
14
Mediation Examples
Residential instability reduced collective efficacy which increased violence (neighborhoods, Sampson et al., 1997)
Anabolic prevention program affects norms regarding healthy behavior which reduced intentions to use steroids (Krull & MacKinnon, 1999; 2001).
Alcohol prevention program affected norms which reduced alcohol use, (Komro et al., 2001)
15
Mediation is important because …
Central questions in many fields are about mediating processes.
Important for basic research on mechanisms of effects.
Critical for applied research, especially prevention and treatment.
Many interesting statistical and mathematical issues.
16
2, 3, or 4, variable effects Two variables: X Y, Y X , X Y are reciprocally related. Measures of effect include the correlation, covariance, regression coefficient, odds ratio, mean difference.
Three variables: X M Y, XY M, YXM, and all combinations of reciprocal relations. Special names for third-variable effects, confounder, mediator, moderator/interaction.
Four variables: many possible relations among variables, e.g., XZMY
17
Mediator versus Confounder Confounder is a variable related to two
variables of interest that falsely obscures or accentuates the relation between them (Meinert & Tonascia, 1986).
The definition below is also true of a confounder because a confounder also accounts for the relation but it is not intermediate in a causal sequence.
In general, a mediator is a variable that accounts for all or part of the relation between a predictor and an outcome (Baron & Kenny, 1986, p.1176).
18
Mediator versus Moderator
Moderator is a variable that affects the strength of the relation between two variables. The variable is not intermediate in the causal sequence so it is not a mediator.
Moderator is usually an interaction, the relation between X and Y depends on a third variable. There are other more detailed definitions of a moderator.
19
Other names for Variables in the Mediation Model
Antecedent to Mediating to Consequent (James & Brett, 1984).
Initial to Mediator to Outcome (Kenny, Kashy & Bolger, 1998).
Program to surrogate endpoint to ultimate endpoint (Prentice, 1989).
Independent to Mediating to Dependent used in this presentation.
20
Three ways to specify a model
Verbal description: A variable M is intermediate in the causal sequence relating X to Y.
Diagram Equations
21
Mediation Regression Equations
-Start here with the simplest mediation model with one mediator.
-Tests of mediation use information from some or all of three equations
-The coefficients in the equations may be obtained using methods such as ordinary least squares regression, covariance structure analysis, or logistic regression.
22
Single Mediator Model
MEDIATOR
M
INDEPENDENT VARIABLE
X Y
DEPENDENT VARIABLE
a b
c’
23
Relation of X to Y
MEDIATOR
M
INDEPENDENT VARIABLE
X Y
DEPENDENT VARIABLE
c
1. The independent variable is related to the dependent variable:
Y = i1 + cX +
24
Relation of X to M
MEDIATOR
M
INDEPENDENT VARIABLE
X Y
DEPENDENT VARIABLE
2. The independent variable is related to the potential mediator:
M = i2 + aX +
a
25
Relation of X and M to Y
MEDIATOR
M
INDEPENDENT VARIABLE
X Y
DEPENDENT VARIABLE
a
3. The mediator is related to the dependent variable controlling for exposure to the independent variable:
Y = i3+ c’X + bM +
b
c’
26
Mediated Effect Measures
Mediated effect=ab Standard error=
Mediated effect=ab=c-c’ (MacKinnon et al., 1995)
Direct effect= c’ Total effect= ab+c’=c
Test for significant mediation:
z’= Compare to empirical distribution
of the mediated effect
2 22 2aba bs s
ab
2 22 2aba bs s
27
Assumptions I For each method of estimating the mediated effect
based on Equations 1 and 3 (c-c’) or Equations 2 and 3(ab): Predictor variables are uncorrelated with the error in
each equation. Errors are uncorrelated across equations. Predictor variables in one equation are uncorrelated
with the error in other equation.
Reliable and valid measures No omitted influences. Normally distributed variables
28
Assumptions II Data are a random sample from the population of interest. Coefficients, a, b, c’ reflect true causal relations and the
correct functional form. Mediation chain is correct: Temporal ordering is correct
X before M before Y. Any mediation model is part of a longer mediation chain. The researcher decides what part of the micromediational chain to examine.
Homogeneous effects across subgroups: The relation from X to M and from M to Y are homogeneous across subgroups or other characteristics of participants in the study. Routine to test XM interaction in Equation 3. This means there are not moderator effects.
29
Three Major Types of Single Sample Tests for the Mediation Effect
(1) Causal Steps: Series of tests described in Baron and Kenny (1986) for example.
(2) Difference in Coefficients: c-c’, e.g., from Clogg et al. (1992)
(3) Product of Coefficients: ab, e.g., from Sobel (1982)
See MacKinnon et al., Psychological Methods (2002) for a review and comparison of single
sample tests
30
Causal Steps Tests of Mediation
• Judd & Kenny (1981), 3 Steps plus Step 4 c’ is nonsignificant
• Baron & Kenny (1986), 3 Steps plus Step 4
drop from c to c’
• Test of whether the a and b paths are
statistically significant (MacKinnon et al.,
2002).
31
Difference in Coefficients
Significance test: tN-2= (c-c’)/sc-c’
• General formula for s2c-c’ :
s2c-c’= s2
c+ s2c’-2scc’
• Clogg, Petkova, and Shihadeh (1992)
s2c-c’=(sc’|rxm|)2
32
Product of CoefficientsFormulas for the variance of ab• Multivariate delta variance: Sobel (1982), Folmer (1981)
s2ab=a2s2
b+ b2s2a
• Exact variance: Aroian (1944)
s2ab=a2s2
b+ b2s2a+s2
as2b
• Unbiased variance: Goodman (1960)
s2ab=a2s2
b+ b2s2a-s2
as2b
• Test based on the distribution of the product of two random variables using critical values from Meeker et al. (1988) using a program called PRODCLIN.
33
Empirical Sample size estimates for .8 power to detect the mediated effect
Test S-S S-M S-L M-S M-M M-L L-S L-M L-LBaron/Kenny 20886 3039 1561 2682 397 204 1184 175 92(τ’ = 0)a & b Joint 530 403 403 405 74 58 405 59 36
Delta 667 422 412 421 90 66 410 67 42
PRODCLIN 539 401 402 404 74 57 404 58 35
Note: Table entries are based on empirical simulation so they are not exact. Fritz &
MacKinnon (2007).
34
Reasons for Differences Among Methods
Requirement for significant total effect, c, and requirement that c’ is nonsignificant reduces accuracy of causal steps methods.
Assumption that the mediated effect divided by its standard error has a normal distribution is incorrect for some values.
Mediation is a test of two paths corresponding to a and b paths.
35
Distribution of the Product
The mediated effect is the product of two coefficients a and b. The distribution of the
product has a normal distribution only in special cases.
At low values of a and b, the distribution has excess kurtosis and skewness, e.g. when a and b are both zero, kurtosis is 6. It is not surprising that the confidence limits are inaccurate if the
distribution is assumed to be normal.One solution is to use the distribution of the
product in statistical tests and confidence limits.
36
37
PRODCLIN (distribution of the PRODuct Confidence Limits for the
INdirect effect)MacKinnon, Fritz, Williams, and Lockwood, (In
Press, Behavior Research Methods) describes program to compute critical values for the
distribution of the product. Web location includes programs in SAS, SPSS,
and R that access a FORTRAN program. http://www.public.asu.edu/~davidpm/ripl/Prodclin/
Input a, sa, b, sb, correlation between a and b, and
Type I error rate. Output includes the input values and normal and distribution of the product
confidence limits.
38
Critical Values for Distribution of the Product
Because the distribution of the product is not normal, there are different critical values for the
distribution for each value of a/sa and b/sb. The critical values are -1.96 and +1.96 for the
95% confidence interval from the normal distribution. There are different upper and lower critical values for the distribution of the product. Confidence limits and significance tests are more
accurate using the critical values from the distribution of the product (MacKinnon et al.
2004).
39
Example Calculations using the Distribution of the Product
For example, a = .3386, sa = .1224, b= .4510, sb = .1460. Enter these values in the PRODCLIN
program. PRODCLIN returns the critical value for the
2.5% percentile, Mlower =-1.6175 and Mupper = 2.2540 the critical value for the 97.5% percentile.Use the critical values to calculate upper and
lower confidence limits.LCL= ab + Mupper sab = .1527 +(-1.6175) (.0741)
UCL= ab + Mlower sab = .1527 + (2.2540 )(.0741)Asymmetric Confidence Limits are
(.0329, .3197)
40
Resampling Methods
-Another good option for data that do not have a normal distribution is resampling methods (MacKinnon et al. 2004).-Bootstrap method for mediated effects was described by Bollen & Stine (1991), Lockwood & MacKinnon (1998),
MacKinnon et al., (2004) and Shrout & Bolger (2002)-Purpose is to use the data itself to form a distribution of a statistic (Manly, 1997). Does not make as many assumptions
and can handle nonnormal distributions.-The value of a statistic in the observed sample is compared to the distribution of the statistic formed by resampling from the
data a large number of times.
41
Bootstrap Test for Mediation-Estimate the mediated effect in the sample.
-Make a new data set by sampling N subjects data with replacement and estimating the mediated effect
in each of a large number (1000) of bootstrap samples.
-Determine significance level by locating the mediated effect for the observed sample in the distribution of the bootstrap sample. Find 2.5% and 97.5% values
for confidence interval.
-Bias-corrected bootstrap makes a correction for the difference between the observed and average
bootstrapped mediated effect.
42
Statistical Mediation Tests Summary
Three general types of tests, causal steps, difference in coefficients, and product of
coefficients. Tests differ substantially in Type I error and
statistical power.Requirement of significant X to Y relation and
assumed normal distribution of the mediated effect reduces power.
Best tests are based on the distribution of the product and resampling methods.
43
Quotes about mediation analysis
In the absence of a concern for such mediating or intervening mechanisms, one ends up with facts, but with incomplete understanding (Rosenberg, 1968, p. 63).
.. much of what social psychologists do is attempt to understand how internal processes mediate the effect of the situation on behavior (Kenny, Kashy, & Bolger, 1998, p. 259).
44
Reasons for Mediation analysis in prevention research.
1. Manipulation check. Did the program change the mediators it was designed to change?
2. Program Improvement. What do the program effects on mediators suggest about program improvements?
3. Measurement Improvement. Is a lack of program effects due to poor measurement?
4. Delayed effects. Will program effects on the dependent variable emerge later?
5. Test the process of mediation. Was the theory-based prediction of mediation correct?
6. Practical implications. Can the program be redesigned to cost less and be more efficient?
45
Interpretation of Mediation Results in prevention research.
• Program effect on mediator but not outcome. The mediator may not be causally related to the outcome. Lack of power or insufficient measurement—explanations for all null effects below.
• Program effect on the outcome but not the mediator. The program did not affect the intended mediator. Other constructs were mediators.
• No program effects on the outcome or the mediator. Program was ineffective, lack of statistical power.
• Program effects on the mediator and the outcome but nonsignificant mediation. The mediator may not be causally related to the outcome.
• Program effects on the mediator and the outcome and significant mediation. Program was effective and there is evidence for the hypothesized mediating mechanism.
46
Causal Inference for Mediation
The Rubin Causal Model (RCM, Rubin, 1974) describes a general way to interpret evidence for
causal relations, developed to interpret non-experimental as well as experimental research. It
is a solution not a problem. Helpful because the RCM clearly displays limits
and strengths of models, including mediation.
47
Counterfactual
Counterfactual is central to modern causal inference. The counterfactual refers to conditions
in which a participant could serve, not just the condition that they did serve in.
For example, for a participant in the treatment group, the counterfactual is the same participant
in the control group. For a participant in the control group, the counterfactual is the same
participant in the treatment group.
48
Why b and c’ do not reflect a causal relation?
Because M is not under experimental control, and M is both a dependent and independent variable, b and c’
do not necessarily represent causal effects.Need: The relation between M and Y for participants
in the treatment group if they were in the control group; the relation between M and Y for control participants if they instead were in the treatment
group. Coefficients b and c’ are not clearly causal effects, because M is not randomly assigned making the counterfactuals for these relations complicated.
49
Causal inference for mediation-Counterfactual idea helps organize causal inference and highlights ambiguity regarding interpretation of c’ and b coefficients as causal effects.
-In treatment and prevention, the M to Y, b, relation is based on prior research and theory. It is all we consider known.
-Do we need to know the true causal structure to make good decisions based on research? Is a descriptive model sufficient?
-Can we ever know the true causal relation among variables? “Science in no case can demonstrate any inherent necessity in a sequence, nor prove with absolute certainty that it must be repeated” (Pearson p. 113, Grammar of Science, 1957).
50
Improving Mediation Inference using the Rubin Causal Model
Statistical approaches to improving causal inference from a mediation study:
(I). Instrumental Variable Methods, Holland 1988; Sobel 2006.
(II). Principal Stratification and latent classes; Frangakis & Rubin, 2002; Jo, 2006.
Both approaches use aspects of the data such as no direct effect or stratifications of types of
participants, such as compliers, never compliers etc. to improve inference regarding b and c’.
51
Design Approaches to Causal Inference
Statistical mediation analysis answers the following question, “How does a
researcher use measures of the hypothetical intervening process to increase the amount of information
from a research study?” Another question is, “What is the best next study or studies to conduct after
a statistical mediation analysis to further test mediation theory.”
Five general approaches: (1) double randomization, (2) blockage, (3)
enhancement, (4) purification, (5) pattern matching for multiple
variables, subgroups, settings, time, and alternative manipulations (Mark,
1986).
52
(1) Double RandomizationIf the problem with the b path is that M is not randomly
assigned, then how about randomizing both X in the X to M relation and randomizing M in the M to Y
relation. Say X is randomized and there was a significant effect
of X on M in Study 1. In Study 2, an experiment was set up so that M was randomized to levels defined by
how X changed M in Study 1. If there was a significant relation of M to Y in Study 2, then there is
more evidence for mediation.
53
Wood et al. (1974) Overview
Study of self-fulfilling prophecy cited in Spencer et al., (2005).
Race (X) predicts quality of interview (M) and quality of interview predicts performance (Y).
Confederate—Person assisting with the experiment. The confederates are used to manipulate factors.
Confederate applicants were used in Study 1 for the X to M relation and confederate interviewers were
used in Study 2 for the M to Y relation.
54
Wood et al., (1974)Study 1. White participants interviewed either Black or White confederate applicants (X). The dependent variable M, was interview quality and participants
with Black confederate applicants gave poorer quality interviews (M).
Study 2. Confederates gave either an interview (M) like White applicants were interviewed in Study 1 or like Black applicants in Study 1. This manipulation had a significant effect on applicant performance.
So randomization was used for the X to M relation and the M to Y relation.
55
Prevention Example (MacKinnon et al., 2002)
Norms increase exercise which decreases depression.Study 1, X to M: Similar to existing prevention studies, participants either receive a social norm manipulation to increase exercise or not (X) and
exercise is measured (M). Study 2, M to Y: Participants are randomly assigned to
conduct an amount of exercise (M) obtained in the program group or the control from Study 1 and
depression is measured (Y). Help. If you know or think of other studies like this
please let me know! [email protected]
56
Double Randomization Problems
Most problems center around the randomization of the mediator so that it corresponds to the change in the
mediator in the X to M study.Study 2 is a mediation model with a manipulation (X)
that should change M in the same way as X changed M in Study 1. So Study 2 data is analyzed with
statistical mediation analysis.
57
(2) Blockage DesignsThe goal of blockage designs is to test a mediation relation with a manipulation that blocks the mediator
from operating. For example, lets say that an exercise program appears
to reduce depression by increasing endorphins-- the hypothesized mediator. A blockage manipulation
would administer a drug to prevent endorphins so that persons receiving the exercise program would no
longer experience reduced depression if the endorphins is the mediator.
58
(3) Enhancement DesignsThe goal of enhancement designs is to deliver
interventions that enhance the effects of a hypothesized mediator.
For example, lets say that an addiction treatment program reduces remission by improving social support. An enhancement design would increase social support even more to demonstrate a larger
effect on remission. Social support may be increased by more exposure to a therapist, additional contact
with friends and family etc.
59
(4) Purification DesignsThe goal of purification designs is to reduce a
manipulation to its critical ingredients. For example, in drug prevention research, it appears
that changes in norms, beliefs about positive consequences of drugs, and intentions to avoid drugs
appear to the most important mediators of drug prevention programs. A purification design would retain only those program components that address these mediators to test whether the purer program
changes drug use.
60
(5) Pattern Matching
The goal of pattern matching is to specify patterns of results based on mediation theory. Different types of
studies and information are used to assess whether the pattern of results is consistent with mediation theory.
Multiple variables: a mediation relation is observed for one variable but not another. For example, change in
beliefs about positive consequences of alcohol use is a mediator for alcohol use but not for tobacco use. Changes in beliefs about positive consequences is a
statistical mediator but changes in beliefs about negative consequences is not.
61
More Pattern Matching Examples
Moderators: For example, prevention program effects are most effective for persons low on the mediator at
baseline.Setting: An intervention to change norms that then
changes behavior should be more successful in a setting where more norm change may occur.
Different Manipulations: A different manipulation that should change the same theoretical mediator should
lead to the same results.
62
Summary
Mediation theory is central to many fields and critical for treatment and prevention research.
Statistical mediation analysis of a single study yields important but potentially limited information.
Experimental designs to follow mediation analysis provide more evidence for a mediation relation.
Note that statistical mediation analysis of data from experimental designs may also yield additional information.
63
Future Directions
Causal inference for mediation will continue to be an active area of research.
Programs of research are needed to investigate mediators. Must consider other evidence including clinical judgment, theory, case studies, and replication studies.
Statistical mediation analysis for some methods is still needed, e.g. survival analysis, longitudinal data, generalized linear model.
Need more applications of mediation analysis.
64
Hypothesized Effects of a Presentation on Mediation Analysis
CAPS Talk on Mediation Analysis
# Studies with Mediation Analysis
Interest in Mediation Methods
Norms Regarding Reporting Results of Studies
Comprehension of Reasons for
Mediation Analysis
Beliefs About the Importance
of Theory Testing
65
THE END