analysis of longitudinal imaging data with ols & sandwich ... · bryan guillaume analysis of...

30
Introduction The Sandwich Estimator method An adjusted Sandwich Estimator method Remarks and summary Analysis of longitudinal imaging data with OLS & Sandwich Estimator standard errors Bryan Guillaume GSK / University of Liège / University of Warwick FMRIB Centre - 07 Mar 2012 Supervisors: Thomas Nichols (Warwick University) and Christophe Phillips (Liège University) Bryan Guillaume Analysis of longitudinal imaging data

Upload: others

Post on 11-Aug-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Analysis of longitudinal imaging data with OLS& Sandwich Estimator standard errors

Bryan Guillaume

GSK / University of Liège / University of Warwick

FMRIB Centre - 07 Mar 2012

Supervisors: Thomas Nichols (Warwick University) andChristophe Phillips (Liège University)

Bryan Guillaume Analysis of longitudinal imaging data

Page 2: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Outline

1 Introduction

2 The Sandwich Estimator method

3 An adjusted Sandwich Estimator method

4 Remarks and summary

Bryan Guillaume Analysis of longitudinal imaging data

Page 3: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Example of longitudinal studies in neuroimagingExample 1

Effect of drugs (morphine andalcohol) versus placebo overtime on Resting State Networksin the brain(Khalili-Mahani et al, 2011)

12 subjects21 scans/subject!!!Balanced design

Study design:

Bryan Guillaume Analysis of longitudinal imaging data

Page 4: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Example of longitudinal studies in neuroimagingExample 2

fMRI study of longitudinalchanges in a population ofadolescents at risk for alcoholabuse(Heitzeg et al, 2010)

86 subjects2 groups1, 2, 3 or 4 scans/subjects(missing data)Total of 224 scansVery unbalanced design(no common time pointsfor scans)

●●

●●

●●

●●

0 50 100 150 200

1618

2022

2426

Observation index

Age

Bryan Guillaume Analysis of longitudinal imaging data

Page 5: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Why is it challenging to model longitudinal data inneuroimaging ?

Longitudinal modeling is a standard biostatistical problem andstandard solutions exist:

Gold standard: Linear Mixed Effects (LME) modelIterative method→ generally slow and may fail to converge

E.g., 12 subjects, 8 visits, Toeplitz, LME with unstructuredintra-visit correlation fails to converge 95 % of the time.E.g., 12 subjects, 8 visits, CS, LME with random int. andrandom slope fails to converge 2 % of the time.

LME model with a random intercept per subjectMay be slow (iterative method) and only valid withCompound Symmetric (CS) intra-visit correlation structure

Naive-OLS (N-OLS) model which include subject indicatorvariables as covariates

Fast, but only valid with CS intra-visit correlation structure

Bryan Guillaume Analysis of longitudinal imaging data

Page 6: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Why is it challenging to model longitudinal data inneuroimaging ?

Longitudinal modeling is a standard biostatistical problem andstandard solutions exist:

Gold standard: Linear Mixed Effects (LME) modelIterative method→ generally slow and may fail to converge

E.g., 12 subjects, 8 visits, Toeplitz, LME with unstructuredintra-visit correlation fails to converge 95 % of the time.E.g., 12 subjects, 8 visits, CS, LME with random int. andrandom slope fails to converge 2 % of the time.

LME model with a random intercept per subjectMay be slow (iterative method) and only valid withCompound Symmetric (CS) intra-visit correlation structure

Naive-OLS (N-OLS) model which include subject indicatorvariables as covariates

Fast, but only valid with CS intra-visit correlation structure

Bryan Guillaume Analysis of longitudinal imaging data

Page 7: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Why is it challenging to model longitudinal data inneuroimaging ?

Longitudinal modeling is a standard biostatistical problem andstandard solutions exist:

Gold standard: Linear Mixed Effects (LME) modelIterative method→ generally slow and may fail to converge

E.g., 12 subjects, 8 visits, Toeplitz, LME with unstructuredintra-visit correlation fails to converge 95 % of the time.E.g., 12 subjects, 8 visits, CS, LME with random int. andrandom slope fails to converge 2 % of the time.

LME model with a random intercept per subjectMay be slow (iterative method) and only valid withCompound Symmetric (CS) intra-visit correlation structure

Naive-OLS (N-OLS) model which include subject indicatorvariables as covariates

Fast, but only valid with CS intra-visit correlation structure

Bryan Guillaume Analysis of longitudinal imaging data

Page 8: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Why is it challenging to model longitudinal data inneuroimaging ?

Longitudinal modeling is a standard biostatistical problem andstandard solutions exist:

Gold standard: Linear Mixed Effects (LME) modelIterative method→ generally slow and may fail to converge

E.g., 12 subjects, 8 visits, Toeplitz, LME with unstructuredintra-visit correlation fails to converge 95 % of the time.E.g., 12 subjects, 8 visits, CS, LME with random int. andrandom slope fails to converge 2 % of the time.

LME model with a random intercept per subjectMay be slow (iterative method) and only valid withCompound Symmetric (CS) intra-visit correlation structure

Naive-OLS (N-OLS) model which include subject indicatorvariables as covariates

Fast, but only valid with CS intra-visit correlation structure

Bryan Guillaume Analysis of longitudinal imaging data

Page 9: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Outline

1 Introduction

2 The Sandwich Estimator method

3 An adjusted Sandwich Estimator method

4 Remarks and summary

Bryan Guillaume Analysis of longitudinal imaging data

Page 10: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

The Sandwich Estimator (SwE) method

Use of a simple OLS model (without subject indicatorvariables)The fixed effects parameters β are estimated by

β̂OLS =

(M∑

i=1

X ′i Xi

)−1 M∑i=1

X ′i yi

The fixed effects parameters covariance var(β̂OLS) areestimated by

SwE =

(M∑

i=1

X ′i Xi

)−1

︸ ︷︷ ︸Bread

(M∑

i=1

X ′i V̂iXi

)︸ ︷︷ ︸

Meat

(M∑

i=1

X ′i Xi

)−1

︸ ︷︷ ︸Bread

Bryan Guillaume Analysis of longitudinal imaging data

Page 11: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Property of the Sandwich Estimator (SwE)

SwE =

(M∑

i=1

X ′i Xi

)−1( M∑i=1

X ′i V̂iXi

)(M∑

i=1

X ′i Xi

)−1

If Vi are consistently estimated, the SwE tends asymptotically(Large samples assumption) towards the true variancevar(β̂OLS). (Eicker, 1963; Eicker, 1967; Huber, 1967; White,1980)

Bryan Guillaume Analysis of longitudinal imaging data

Page 12: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

The Heterogeneous HC0 SwE

In practice, Vi is generally estimated from the residualsri = yi − Xi β̂ by

V̂i = ri r′

i

and the SwE becomes

Het. HC0 SwE =

(M∑

i=1

X ′i Xi

)−1( M∑i=1

X ′i ri r′

i Xi

)(M∑

i=1

X ′i Xi

)−1

Bryan Guillaume Analysis of longitudinal imaging data

Page 13: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Simulations: setup

Monte Carlo Gaussian null simulation (10,000 realizations)For each realization,

1 Generation of longitudinal Gaussian null data (no effect)with a CS or a Toeplitz intra-visit correlation structure:

Compound Symmetric1 0.8 0.8 0.8 0.8

0.8 1 0.8 0.8 0.80.8 0.8 1 0.8 0.80.8 0.8 0.8 1 0.80.8 0.8 0.8 0.8 1

Toeplitz1 0.8 0.6 0.4 0.2

0.8 1 0.8 0.6 0.40.6 0.8 1 0.8 0.60.4 0.6 0.8 1 0.80.2 0.4 0.6 0.8 1

2 Statistical test (F-test at α) on the parameters of interest

using each different methods (N-OLS, LME and SWE) andrecording if the method detects a (False Positive) effect

For each method, rel. FPR= Number of False Positive10,000α

Bryan Guillaume Analysis of longitudinal imaging data

Page 14: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Simulations: LME vs N-OLS vs Het. HC0 SwE

●● ●

Number of subjects

Rel

ativ

e F

PR

(%

)

12 50 100 200

010

020

030

040

050

060

070

0

● Het. HC0 SwEN−OLSLME

Linear effect of visitsGroup 2 versus group 1Compound symmetry8 vis., F(1,N−p) at 0.05

Number of subjects

Rel

ativ

e F

PR

(%

)

●● ●

Number of subjects

Rel

ativ

e F

PR

(%

)

12 50 100 200

010

020

030

040

050

060

070

0

● Het. HC0 SwEN−OLSLME

Linear effect of visitsGroup 2 versus group 1

Toeplitz8 vis., F(1,N−p) at 0.05

Number of subjects

Rel

ativ

e F

PR

(%

)

Bryan Guillaume Analysis of longitudinal imaging data

Page 15: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Outline

1 Introduction

2 The Sandwich Estimator method

3 An adjusted Sandwich Estimator method

4 Remarks and summary

Bryan Guillaume Analysis of longitudinal imaging data

Page 16: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Bias adjustments: the Het. HC2 SwE

In an OLS model, we have

(I − H)var(y)(I − H) = var(r)

where H = X (X′X )−1X

Under independent homoscedastic errors,

(I − H)σ2 = var(r)

(1− hik )σ2 = var(rik )

σ2 = var(

rik√1− hik

)This suggests to estimate Vi by

V̂i = r∗i r∗′

i where r∗ik =rik√

1− hik

Bryan Guillaume Analysis of longitudinal imaging data

Page 17: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Bias adjustments: the Het. HC2 SwE

Using in the SwE

V̂i = r∗i r∗′

i where r∗ik =rik√

1− hik

We obtain

Het. HC2 SwE =

(M∑

i=1

X′

i Xi

)−1( M∑i=1

X′

i r∗i r∗′

i Xi

)(M∑

i=1

X′

i Xi

)−1

Bryan Guillaume Analysis of longitudinal imaging data

Page 18: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Homogeneous SwE

In the standard SwE, each Vi is normally estimated from onlythe residuals of subject i . It is reasonable to assume a commoncovariance matrix V0 for all the subjects and then, we have

V̂0kk ′ =1

Nkk ′

Nkk′∑i=1

rik rik ′

V̂0kk ′ : element of V̂0 corresponding to the visits k and k ′

Nkk ′ : number of subjects with both visits k and k ′

rik : residual corresponding to subject i and visit krik ′ : residual corresponding to subject i and visit k ′

V̂i = f (V̂0)

Bryan Guillaume Analysis of longitudinal imaging data

Page 19: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Null distribution of the test statistics with the SwE

H0 : Lβ̂ = 0,H1 : Lβ̂ 6= 0L: contrast matrix of rank qUsing multivariate statistics theory and assuming abalanced design, we can derive the test statistic

M − pB − q + 1(M − pB)q

(Lβ̂)′(LSwEL′)−1(Lβ̂) ∼ F (q,M − pB − q + 1)

q=1, the test becomes

(Lβ̂)′(LSwEL′)−1(Lβ̂) ∼ F (1,M − pB) 6= F (1,N − p)

Bryan Guillaume Analysis of longitudinal imaging data

Page 20: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Simulations: LME vs N-OLS vs unadjusted SwE

●● ●

Number of subjects

Rel

ativ

e F

PR

(%

)

12 50 100 200

010

020

030

040

050

060

070

0

● Het. HC0 SwEN−OLSLME

Linear effect of visitsGroup 2 versus group 1Compound symmetry8 vis., F(1,N−p) at 0.05

Number of subjects

Rel

ativ

e F

PR

(%

)

●● ●

Number of subjects

Rel

ativ

e F

PR

(%

)

12 50 100 200

010

020

030

040

050

060

070

0

● Het. HC0 SwEN−OLSLME

Linear effect of visitsGroup 2 versus group 1

Toeplitz8 vis., F(1,N−p) at 0.05

Number of subjects

Rel

ativ

e F

PR

(%

)

Bryan Guillaume Analysis of longitudinal imaging data

Page 21: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Simulations: unadjusted SwE vs adjusted SwE

Number of subjects

Rel

ativ

e F

PR

(%

)

12 50 100 200

8010

012

014

016

018

020

0

● Het. HC0 SwE with F(1,N−p)Hom. HC2 SwE with F(1,M−2)

Linear effect of visitsGroup 2 versus group 1Compound symmetry8 vis., F−test at 0.05

Number of subjects

Rel

ativ

e F

PR

(%

)

●●

Number of subjects

Rel

ativ

e F

PR

(%

)

12 50 100 200

8010

012

014

016

018

020

0

● Het. HC0 SwE with F(1,N−p)Hom. HC2 SwE with F(1,M−2)

Linear effect of visitsGroup 2 versus group 1

Toeplitz8 vis., F−test at 0.05

Number of subjects

Rel

ativ

e F

PR

(%

)

Bryan Guillaume Analysis of longitudinal imaging data

Page 22: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Simulation with real designExample 2

fMRI study of longitudinalchanges in a population ofadolescents at risk for alcoholabuse

86 subjects2 groups1, 2, 3 or 4 scans/subjects(missing data)Total of 224 scansVery unbalanced design(no common time pointsfor scans)

●●

●●

●●

●●

0 50 100 150 200

1618

2022

2426

Observation index

Age

Bryan Guillaume Analysis of longitudinal imaging data

Page 23: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Simulation with real designExample 2

●●

0 50 100 150 200

01

23

45

67

Scans index

Rel

ativ

e ag

e

Bryan Guillaume Analysis of longitudinal imaging data

Page 24: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Simulation with real designExample 2

●●

0 50 100 150 200

01

23

45

67

Scans index

rela

tive

Age

Scans index

rela

tive

Age

Bryan Guillaume Analysis of longitudinal imaging data

Page 25: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Real design

Gr1Age.LGr1

Age.QGr1 Gr2 vs Gr1

Age.LGr2 vs Gr1

Age.QGr2 vs Gr1

Het. HC0 SwEHom. HC2 SwEN−OLSLME

Rel

ativ

e F

PR

(%

)

1092 % 1079 %

5010

015

020

025

030

035

0

Real designCompound Symmetry

F(1,M−2) at 0.05 for Hom. HC2OTW F(1,N−p) at 0.05

Bryan Guillaume Analysis of longitudinal imaging data

Page 26: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Real design

Gr1Age.LGr1

Age.QGr1 Gr2 vs Gr1

Age.LGr2 vs Gr1

Age.QGr2 vs Gr1

Het. HC0 SwEHom. HC2 SwEN−OLSLME

Rel

ativ

e F

PR

(%

)

1074 % 954 %

5010

015

020

025

030

035

0

Real designToeplitz

F(1,M−2) at 0.05 for Hom. HC2OTW F(1,N−p) at 0.05

Bryan Guillaume Analysis of longitudinal imaging data

Page 27: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Outline

1 Introduction

2 The Sandwich Estimator method

3 An adjusted Sandwich Estimator method

4 Remarks and summary

Bryan Guillaume Analysis of longitudinal imaging data

Page 28: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Remarks about the SwE method

Power of the SwE method generally lower than the powerof the LME method

Power loss not significant with a high number of subject(e.g., 86 subjects)Power loss may be significant with a low number of subjectand a low significance level α

Solution: spatial regularization of the SwE

Test statistic with an unbalanced design and a low numberof subject

Estimation of the effective degrees of freedom of the testneeded

Bryan Guillaume Analysis of longitudinal imaging data

Page 29: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Summary

Longitudinal standard methods not really appropriate toneuroimaging data:

Convergence issues with LMEN-OLS & LME with random intercepts: issues when CSdoes not hold

The SwE methodAccurate in a large range of settingsEasy to specifyNo iteration needed

Quite fastNo convergence issues

Can accommodate pure between covariatesBut, careful in small samples:

Adjustment essentialIf low significance level, spatial regul. needed for powerIf unbalanced design, effective dof estimation needed

Bryan Guillaume Analysis of longitudinal imaging data

Page 30: Analysis of longitudinal imaging data with OLS & Sandwich ... · Bryan Guillaume Analysis of longitudinal imaging data. Introduction The Sandwich Estimator method An adjusted Sandwich

IntroductionThe Sandwich Estimator method

An adjusted Sandwich Estimator methodRemarks and summary

Thanks for your attention!

Bryan Guillaume Analysis of longitudinal imaging data