raymond j. carroll department of statistics department of biostatistics & epidemiology...

26
1 N ONPARAMETRIC R EGRESSION AND M EASUREMENT E RROR Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology Faculty of Nutrition Center for Environmental and Rural Health Texas A&M University David Ruppert School of Operations Research and Inductrial Relations Cornell University AND CHECK OUT: http://stat.tamu.edu/ carroll (most papers available, as well as some matlab software) http://stat.tamu.edu/B3NC (Training program in Biol- ogy & Bioinformatics)

Upload: others

Post on 10-Jul-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

1

NONPARAMETRIC REGRESSION AND

MEASUREMENT ERROR

Raymond J. Carroll

Department of Statistics

Department of Biostatistics & Epidemiology

Faculty of Nutrition

Center for Environmental and Rural Health

Texas A&M University

David Ruppert

School of Operations Research and Inductrial Relations

Cornell University

AND CHECK OUT:

http://stat.tamu.edu/ � carroll (most papers available, as

well as some matlab software)

http://stat.tamu.edu/B3NC (Training program in Biol-

ogy & Bioinformatics)

Page 2: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

2

OUTLINE

� Model

� Consistent estimation and rates of convergence

� Functional Method: SIMEX

� Structural methods

� Example

� Comments

Page 3: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

3

THE BASIC MODEL� Nonparametric regression:

� � � ��� ��� ���

�� �������� �����

unspecified

� Measurement error: observe

� � � � ���� �Normal

�� ��� �� � (1)

� Assume���� known for today

� Model (1) is more general than it looks

� Since� �����

is unspecified, you need only need that (1)

holds after a transformation

� �! �� � � "exp # log

�� ��%$'&� � ( # log

�� ��)$+*

� See work by Nusser, et al. and Eckert & Carroll for

such transformations

Page 4: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

4

THE BASIC MODEL� � � ��� � � � �

� )� � �� � � � ���� �

Normal� )���� �

� The issue broadly breaks down into what you must

assume about the�

’s

� Functional: assume nothing about the�

’s

� Structural: assume a (possibly flexible) parametric

distribution for the�

’s

Page 5: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

5

DECONVOLUTION� � � ��� � � � �

� )��� �� � � � ���� �

Normal� )� �� �

� Estimation of the regression function� �����

without

knowing the density of�

� Closely related to density deconvolution: estimate

density of�

from the�

’s alone

� Density deconvolution (with Stefanski & Hall, 1988)

and nonparametric regression (Fan & Truong, 1993)

are nearly impossible

� No globally consistent estimator of� �� ��

can have a

rate of convergence faster than log� � �

� log��� � � ��� ���

� A sample of size 16 in parametric regression has

approximately the same information as a sample

of size 10,000,000 for consistent nonparametric

regression

Page 6: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

6

DECONVOLUTION

� The deconvolution results mean that if you insist on

being consistent nonparametrically

� and if you insist on no assumptions about the den-

sity of the latent�

� The you cannot get a decent estimator

� Fan & Truong constructed an ”estimator” using our

deconvoluting kernel

� It’s awful

� Bandwidth selection essentially impossible

� Even cheating and selecting the bandwidth that

minimizes MSE, the method has MSE’s an order

of magnitude larger than our other methods� Theory really means something here

Page 7: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

7

WHAT CAN WE DO??� � � ��� � � � � � � �

� One method is to allow a little bit of inconsistency

� Still make no assumptions about the density of�

� But construct an estimator which has less bias than

the naive estimator that ignores error

� The naive estimator has a bias of order��� �!���� �

� The SIMEX estimator of Cook and Stefanski has a

bias of order� � �!���� �

� SIMEX is general purpose: kernels, splines, etc.

Page 8: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

Measurement Error Variance

Coe

ffici

ent

0.0 0.5 1.0 1.5 2.0 2.5 3.0

0.0

0.2

0.4

0.6

0.8

1.0

1.2

Naive Estimate

Illustration of SIMEX

Page 9: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

Measurement Error Variance

Coe

ffici

ent

0.0 0.5 1.0 1.5 2.0 2.5 3.0

0.0

0.2

0.4

0.6

0.8

1.0

1.2

Naive Estimate

Illustration of SIMEX

Page 10: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

Measurement Error Variance

Coe

ffici

ent

0.0 0.5 1.0 1.5 2.0 2.5 3.0

0.0

0.2

0.4

0.6

0.8

1.0

1.2

Naive Estimate

Illustration of SIMEX

Page 11: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

Measurement Error Variance

Coe

ffici

ent

0.0 0.5 1.0 1.5 2.0 2.5 3.0

0.0

0.2

0.4

0.6

0.8

1.0

1.2

Naive Estimate

SIMEX Estimate

Illustration of SIMEX

Page 12: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

12

SIMEX� We have derived the asymptotics for SIMEX in kernel

regression

� Bias is� � � � �� �

� Rate of convergence is standard � � � ���� Approximately consistent estimation with stan-

dard rates

� Deconvolution is globally consistent with slow

rates

� SIMEX has vastly beaten deconvolution in all our

simulations

� Asymptotic variance is the same as if measure-

ment error was ignored, but multipled by a factor

depending only on the extrapolant function� linear–quadratic–cubic ratios are 1–9–52

Page 13: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

13

SPLINES� � � �!� � � ,� � � � �

� Write� �����

in terms of basis functions:

� �� �� � � ������ � ��� �! ��

� Smoothing splines� P–splines (Eiler & Marx, with Ruppert, like Splus),

e.g.,

��� � � � � � � � � � ����� � � �� � � � � ��� P–splines and smoothing splines are ”typically” equiv-

alent, even for � � ���, at least in our context

� We place knots at quantiles of the�

–distribution

Page 14: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

14

MEAN–BASED STRUCTURALMETHODS� �� � � � ������ � �� � �� ��

� It follows that� �!� � � � � � ������ � � � # � � ��� ��� � $ � � ���� � � ��� � � � �

� If�

is parametric, and�

is normal, then" � � � &

is

known and � � ����� is readily calculated

� Penalization: for some matrix � , minimize

��� ��� # � � � ������ � �� � � � �%$ ��� � � � � �� Ridge–type estimator:

�must be estimated

� Shrinkage of � � to� �� � � nearly singular

� � � yields enormously variable estimates

� We developed a “minimize estimated MSE” shrink-

age method

Page 15: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

15

SPLINES: FULLY STRUCTURAL

� If we specify a distribution for�

and, then we have

a fully specified parametric model using the stan-

dard mixed model formulation:

� � � � � � � � � � � � � ���� � � �� � � � � �� � � � �

Normal� )� �� � for

� � �

�Normal

� )� �� �� �

Normal����� )� �� �

� �Normal

��� )� �� �

� Good starting values are easy: previous method or

SIMEX

� A penalty is incorporated naturally through the vari-

ance component� ��

� Computed via MCMC: program on my web site

Page 16: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

16

COMMENTS ON FULLYSTRUCTURAL

� We have investigated allowing�

to have a flexible

distribution, i.e., mixture of� �

normals

� More complex computationally

� Useful for severely skew�

, not much gain otherwise

(or loss!)

� Still, structural spline is remarkably robust to de-

viations from normality for X

Page 17: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

17

SPECIAL CASE: SIMULATION� � � �

with

� �! �� � sin��� �� � �

� � � � # sign�� �� � � $

��

standard normal, 30% of observed variability in�

due to error

� Frequentist MSE

� Naive smoothing spline:��* �

� SIMEX smoothing spline: � *��� Mean–based P–spline, � � � �

:� *�

� Bayes P–spline, � � � :� * �

� Note: Bayes method is a better frequentist estima-

tor

� Here are some examples for sin� � ��

with different

amounts of measurement error

Page 18: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

18

SIMEX

� As I will show via simulated data, the naive estimate

in the presence of measurement error loses all fea-

tures

� That is, with sufficiently large measurement error, the�and

�data look like a polynomial

� The SIMEX method can be expected to perform fairly

poorly here.

� SIMEX adds error to a problem that has already lost

its features

� Each SIMEX step is going to fit a polynomial

� Thus, SIMEX with sufficiently large measurement er-

ror is typically going to be a corrected polynomial fit.

Page 19: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

−2.5 −2 −1.5 −1 −0.5 0 0.5 1 1.5 2 2.5−2.5

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

2.5sin(2x), n=250, attenuation=0.75

TrueBayesNaiveSIMEX

Figure 1: Note how the naive fit captures some of the features, so SIMEX does much better

Page 20: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

−2.5 −2 −1.5 −1 −0.5 0 0.5 1 1.5 2 2.5−2.5

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

2.5sin(2x), n=250, attenuation=0.50

TrueBayesNaiveSIMEX

Figure 2: Note how the naive fit captures NONE of the features, so SIMEX does NO better

Page 21: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

−2.5 −2 −1.5 −1 −0.5 0 0.5 1 1.5 2 2.5−2.5

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

2.5sin(2x), n=250, attenuation=0.33

TrueBayesNaiveSIMEX

Page 22: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

22

WHY DOES THE STRUCTURALSPLINE WORK SO WELL?

� Take� �! �� �

sin� � ��

,�

� Normal� � � �

,� � � � �

,� �� � �* ���.

� Estimate�

� Estimate #1:� �!� � � �

� Estimate #2: Mean of�

over MCMC samples

� Ratio of MSE for sin� ���� ��� �

� Structural spline with MCMC gives a much better es-

timate of�

� Structural Spline with MCMC uses the assumed dis-

tribution of�

given�

in a strong way

� Suggests potential for model non–robustness

Page 23: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

−2 −1 0 1 2 3 40

10

20

30

40

50

60

70Skew−normal X distribution

Figure 3: This is the distribution for�

and � , which are both assumed to be normal

Page 24: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

−1.5 −1 −0.5 0 0.5 1 1.5 2 2.5−2.5

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

2.5sin(2x), n=250, attenuation=0.33, skew−normal X,Y

TrueBayesNaiveSIMEX

Figure 4: Here you assume that�

and � are normal, when they are skew–normal. Note how the

Bayes fit is still much better than naive or SIMEX, but clearly worse than the normal�

and �

simulation

Page 25: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

25

SUMMARY OF METHODS� Naive: great estimate of the wrong thing

� Deconvolution: lousy estimate of the right thing

� SIMEX: approximately consistent, no assumptions,

cannot cope with large measurement error

� Mean function P–splines

� Parametric model for [X]

� Flexible mean function� Ad hoc smoothing

� Structural splines

� Parametric model for�

given�

, [X]

� MCMC incorporates penalty naturally

� Results great even when�

� � � � � �� Few results yet when Y given X is mismodeled

Page 26: Raymond J. Carroll Department of Statistics Department of Biostatistics & Epidemiology …carroll/ftp/recent.talks.directory/npreg_eiv_2001.pdf · 1 NONPARAMETRIC REGRESSION AND MEASUREMENT

26

CONCLUSIONS

� The more (correct!) assumptions you make, the more

efficient the estimator

� Heirarchy

� Global consistency, [X],�

given�

unspecified

� Approximate consistentcy, [X],�

given�

unspec-

ified

� Flexible means, [X] specified� Bayes, [X],

�given

�specified

� It is remarkable how well you can extract the fea-

tures of the problem in the presence of measurement

error