robust variable selection in semiparametric mean

Upload: joanne-wong

Post on 05-Oct-2015

11 views

Category:

Documents


0 download

DESCRIPTION

Robust statistics

TRANSCRIPT

Robust variable selection in semiparametric mean-covarianceregression for longitudinal data analysis robust semiparametric smooth-threshold generalized estimating equations for the analysis of longitudinal data based on the modified Cholesky decomposition and B-spline approximations. proposed method can automatically eliminate inactive predictors by setting the corresponding parameters to be zero, and simultaneously estimate the mean regression coefficients, generalized autoregressive coefficients and innovation variances. In order to overcome the outliers in either the response or/and the covariate domain, we use a bounded score function and leverage-based weights to achieve better robustness. the proposed estimators have desired large sample properties including consistency and oracle property. Finally, Monte Carlo simulation studies are conducted to investigate the robustness and efficiency of the proposed method under different contaminations.

Hence, to seek a more robust method against outliers is a very important issue in longitudinal studies. In recent years, many authors studied the influence of outliers to the estimate and had developed many robust methods. An incomplete list of recent works on the robust GEE (generalized estimating equations)method include [4,6,9,18,19] and so on. a better estimate for the covariance matrix will result in a better estimate for the mean parameter. But all methods above mainly payed attention to the estimate of mean parameters while regarded the covariance parameters as nuisance parameters

Recently, motivated by the modified Cholesky decomposition,Ye and Pan [28] proposed joint mean and covariance regression models by using generalized estimating equations. The advantages of this decomposition are that it makes covariance matrices to be positive definite and the parameters in it have well-founded statistical concepts.

In this paper, we consider the semiparametric mean-covariance Model (SMCM) and decompose the inverse of covariance matrix by the modified Cholesky decomposition. The entries in this decomposition are autoregressive parameters and log innovation variances. See [10,28,31,32] for references.

This paper has made the following contributions: (i) we establish consistency and asymptotic normality ofthe mean regression coefficients, generalized autoregressive coefficients and innovation variances, and obtain the optimalconvergent rate for estimating the nonparametric functions. (ii) The proposed method can alleviate the effect of outliersin either the response or/and the covariate domain by using the bounded Hubers score function and Mallows weights.(iii) The proposed method can automatically eliminate inactive predictors by setting the corresponding parameters to bezero and estimate nonzero coefficients through semiparametric smooth-threshold generalized estimating equations.