bayesian bias correction: critically evaluating sets … bias correction: critically evaluating sets...

43
Bayesian Bias Correction: Critically evaluating sets of studies in the presence of publication bias Alexander Etz UC Irvine Alexander Etz (UC Irvine) Bayesian Bias Correction 1 / 43

Upload: ledang

Post on 22-Mar-2018

233 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Bayesian Bias Correction: Critically evaluating sets ofstudies in the presence of publication bias

Alexander Etz

UC Irvine

Alexander Etz (UC Irvine) Bayesian Bias Correction 1 / 43

Page 2: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Introduction

When we read a published paper, we have two broadly related goals

Goal 1: Evaluate the evidence in favor of a claim

Goal 2: Estimate how big the reported effect could reasonably be

Alexander Etz (UC Irvine) Bayesian Bias Correction 2 / 43

Page 3: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Introduction

Solution (in principle): Compute a Bayes factor (goal 1), or computethe posterior distribution (goal 2)

Alexander Etz (UC Irvine) Bayesian Bias Correction 3 / 43

Page 4: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Challenge

Challenge: Literature is biased

You only see a subset of all studies performed, because only the studiesthat “worked” make it past the gatekeepers (editors, reviewers)

Alexander Etz (UC Irvine) Bayesian Bias Correction 4 / 43

Page 5: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Publication bias

Statistical significance filter: Only studies with p < .05 get published

Censored data, based on the stochastic outcome of the study

Alexander Etz (UC Irvine) Bayesian Bias Correction 5 / 43

Page 6: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Publication bias

Publication bias ruins our ability to evaluate the claims in a paper

Alexander Etz (UC Irvine) Bayesian Bias Correction 6 / 43

Page 7: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Example

Imagine we estimate a group mean and perform a one-sample test

We find observed mean of 17, with a standard error of 8

Alexander Etz (UC Irvine) Bayesian Bias Correction 7 / 43

Page 8: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Example

If the null hypothesis is true, this would be what we predict:

Alexander Etz (UC Irvine) Bayesian Bias Correction 8 / 43

Page 9: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Example

If the effect is small and positive, this is what we predict:

Alexander Etz (UC Irvine) Bayesian Bias Correction 9 / 43

Page 10: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Implications

What gets published if we only acknowledge significant results? Onlystudies with observed mean greater than 16

If there is no effect, any published result is a type-1 error (i.e., falsepositive)

If there is non-null effect, our effect size estimate will certainly beinflated

Publication bias is clearly a problem for evaluating single studies, andits effects can only compound if we meta-analyze many studies at once

Alexander Etz (UC Irvine) Bayesian Bias Correction 10 / 43

Page 11: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Modeling the bias

We are modelers, so naturally we try to model this publication filteringprocess

Try to patch up our inferences using Bayesian bias correction

Alexander Etz (UC Irvine) Bayesian Bias Correction 11 / 43

Page 12: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Modeling the bias

Key idea: Modify the sampling distribution to more accurately describethe results that will be making it to the literature

Alexander Etz (UC Irvine) Bayesian Bias Correction 12 / 43

Page 13: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Modeling the bias

Guan and Vandekerckhove (2016) suggested the following plausiblecensoring processes...

(Note added: The x-axis is the p-value and y-axis is relative probability ofpublication. The x-axes are truncated at .5 merely for aesthetics)

Alexander Etz (UC Irvine) Bayesian Bias Correction 13 / 43

Page 14: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Modeling the bias

... which make the following predictions about the t-values that make it tothe literature

−5 0 50

0.2

0.4

No bias

−5 0 50

0.2

0.4

−5 0 50

0.5

1

Extreme bias

−5 0 50

0.5

1

1.5

−5 0 50

0.2

0.4

Constant bias

−5 0 50

0.5

1

−5 0 50

0.2

0.4

0.6

0.8

Exponential bias

−5 0 50

0.5

1

Alexander Etz (UC Irvine) Bayesian Bias Correction 14 / 43

Page 15: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Modeling the bias

(Note added: The first model corresponds to the typical central andnon-central t distributions, where all results are published. The second saysn.s. results are never published, so all mass from the middle of thedistribution is pushed to the tails (hence the name “extreme”). The thirdsays all n.s. results are published at some rate π that does not depend onthe p-value (hence the name, “constant”). The fourth says n.s. results withp-values close to .05 have higher publication rate than higher p-values.)

Alexander Etz (UC Irvine) Bayesian Bias Correction 15 / 43

Page 16: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Estimating and testing

Let’s consider a single study that finds t = 2.2 with n = 25.

Start with a prior on δ of p(δ|H1) = N (0, 1)

Goal 1: Calculate Bayes factor in favor of nonzero effect size

Goal 2: Compute posterior distribution for δ, p(δ|data,H1)

Alexander Etz (UC Irvine) Bayesian Bias Correction 16 / 43

Page 17: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Estimating and testing

We can compute the posterior distribution1 for each biasing model:

1Note: Displayed probs do not sum to 1 since we do not show the probabilities of therespective null models. A reference line is added at δ = .5 to aid comparisons.

Alexander Etz (UC Irvine) Bayesian Bias Correction 17 / 43

Page 18: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Estimating and testing

We can synthesize the posteriors according to their posterior modelprobabilities using Bayesian model averaging (the ratio of the heights of theblack dots gives the model-averaged Bayes factor)

Alexander Etz (UC Irvine) Bayesian Bias Correction 18 / 43

Page 19: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Estimating and testing

Compare to results from the naive posterior, assuming no bias (blue)

Alexander Etz (UC Irvine) Bayesian Bias Correction 19 / 43

Page 20: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Estimating and testing

To summarize, if we observe t = 2.2 with n = 25:

The posterior distribution is shifted to smaller values

The mitigated Bayes factor is BF01 = 1.4, where the original Bayesfactor BF10 = 2.5 slightly favored the alternative. But neither are tooconclusive

Alexander Etz (UC Irvine) Bayesian Bias Correction 20 / 43

Page 21: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Estimating and testing

Consider: The authors publish a followup study, with two new (direct)replications of the effect: In total we have t = (2.2, 2.3, 2.1) andn = (25, 20, 35)

Now we want to do a fixed-effects meta-analysis to synthesize thethree sets of observations

Alexander Etz (UC Irvine) Bayesian Bias Correction 21 / 43

Page 22: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Estimating and testing

Alexander Etz (UC Irvine) Bayesian Bias Correction 22 / 43

Page 23: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Estimating and testing

Alexander Etz (UC Irvine) Bayesian Bias Correction 23 / 43

Page 24: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Estimating and testing

To summarize, for t = (2.2, 2.3, 2.1) and n = (25, 20, 35):

The cumulative posterior distribution is largely shifted to smaller values

The mitigated Bayes factor is BF01 = 1.2, again indecisive, whereasthe original Bayes factor BF10 = 17 largely favored the alternative

Alexander Etz (UC Irvine) Bayesian Bias Correction 24 / 43

Page 25: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Extending to random effects

Previously, the method could only apply to single studies or fixedeffects meta-analysis

We have extended it to apply to random effects meta-analysis, byintroducing study-specific effect sizes drawn from a higher levelpopulation

Allows for there to be heterogeneity in true effects across studies (e.g.,conceptual replications with new measures, replications on differentpopulations, etc.)

We now demonstrate this extension on some recently published data

Alexander Etz (UC Irvine) Bayesian Bias Correction 25 / 43

Page 26: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

The studies

Alexander Etz (UC Irvine) Bayesian Bias Correction 26 / 43

Page 27: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

The studies

“Recently, it has been claimed that risk-taking and spending behaviormay be triggered in part by evolutionarily driven motives”

“Sexual cues in advertising product categories such as casinos, fashion,jewelry, cosmetic surgery, cars, cigarettes, and alcohol, and theevidence for their effectiveness, suggests that controlling such primes inreal-world settings might constitute a valuable intervention.”

The authors conduct a meta-analysis of the published literature, andsubsequently perform various replications of selected studies

Alexander Etz (UC Irvine) Bayesian Bias Correction 27 / 43

Page 28: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

The studies

Alexander Etz (UC Irvine) Bayesian Bias Correction 28 / 43

Page 29: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

The studies

Various designs, manipulations, outcomes, all call for a random effectsmeta-analysis that allows each study its own effect size:

Can evaluate each study’s individual effect size

Can evaluate overall population mean

Can evaluate variability across studies

Alexander Etz (UC Irvine) Bayesian Bias Correction 29 / 43

Page 30: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

The published literature

Alexander Etz (UC Irvine) Bayesian Bias Correction 30 / 43

Page 31: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

The replications

Alexander Etz (UC Irvine) Bayesian Bias Correction 31 / 43

Page 32: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Synthesis?

How can we synthesize these two sets of results?

Option: Throw them all together in a joint meta-analysis?

What about publication bias?

Option: Throw out the published literature, and only look at thereplications?

Isn’t this wasteful? Presumably there is at least some signal in therewe can extract

We attempt to jointly analyze all studies using our Bayesian bias correctionmethod

Alexander Etz (UC Irvine) Bayesian Bias Correction 32 / 43

Page 33: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

The model

Random effects meta-analysis can be instantiated as a Bayesian hierarchicalmodel, but we would like to modify it to account for bias in the publishedliterature

Alexander Etz (UC Irvine) Bayesian Bias Correction 33 / 43

Page 34: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

The model

We can estimate the effect sizes using model 3, of which 1 and 2 are specialcases...

Alexander Etz (UC Irvine) Bayesian Bias Correction 34 / 43

Page 35: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

The model

...with an added hierarchical structure:

µ ∼ N (0, 1)

σ ∼ G(4, 4)

π ∼ B(5, 95)

δi | µ, σ ∼ N (µ, σ2)

ti | δi ,Ui = 0 ∼ biased-tν,π(ncp = δi√νi/2)

ti | δi ,Ui = 1 ∼ tν(ncp = δi√νi/2)

where δi is the study-specific effect size for study i , Ui indicates whetherstudy i has not been through a publication filter. These priors, like always,are up for debate, but we think they are reasonable

Alexander Etz (UC Irvine) Bayesian Bias Correction 35 / 43

Page 36: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

The data, sorted by ascending ES

Alexander Etz (UC Irvine) Bayesian Bias Correction 36 / 43

Page 37: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Naive hierarchical model

Alexander Etz (UC Irvine) Bayesian Bias Correction 37 / 43

Page 38: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Debiasing hierarchical model

Alexander Etz (UC Irvine) Bayesian Bias Correction 38 / 43

Page 39: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Comparing the model estimates

Alexander Etz (UC Irvine) Bayesian Bias Correction 39 / 43

Page 40: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Summary

In both models we see the shrinkage typical of Bayesian methods

Large effects are pulled down, small effects are pulled up, both“shrinking” toward the overall mean

The main difference between the naive hierarchical model and ourdebiasing model: Extra shrinkage for published effects!

Alexander Etz (UC Irvine) Bayesian Bias Correction 40 / 43

Page 41: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

More plots

We summarize our results as follows, showing our best estimates of themean and sd for the distribution from which we draw each study’s δ

Alexander Etz (UC Irvine) Bayesian Bias Correction 41 / 43

Page 42: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Model predictions

We can generate predictions from the models (i.e., posterior predictives) forwhat we should expect for a new romance priming study with N=50

For the naive model, the predictive has a mean of .43 and sd of .42. Thebias correction model predictive has mean of .30 and sd of .38.

Alexander Etz (UC Irvine) Bayesian Bias Correction 42 / 43

Page 43: Bayesian Bias Correction: Critically evaluating sets … Bias Correction: Critically evaluating sets of ... Censored data, ... Bayesian Bias Correction: Critically evaluating sets

Acknowledgements

Funding for this project is provided by the NSF Graduate ResearchFellowship Program, as well as a grant from the NSF’s Methods,Measurements, and Statistics panel.

Alexander Etz (UC Irvine) Bayesian Bias Correction 43 / 43