a simple explanation of the lasso and least angle regression

A simple explanation of the Lasso and Leas Angle Regression Give a set of input measurements x1, x2 ...xp and an outcome measurement lasso fits a linear model yhat=b0 + b1*x1+ b2*x2 + ... bp*xp The criterion it uses is !inimi"e sum# #y$yhat%&2 % sub'ect to sum(absolute value#b'%) = s The first sum is ta en over observations #cases% in the dataset. The boun tunin- parameter. hen s is lar-e enou-h, the constraint has no effect solution is 'ust the usual multiple linear least s/uares re-ression of y o ever hen for smaller values of s #s =0% the solutions are shrun en the least s/uares estimates. 3ften, some of the coefficients b' are "ero. li e choosin- the number of predictors to use in a re-ression model, and validation is a -ood tool for estimatin- the best value for s . Computation of the Lasso solutions The computation of the lasso solutions is a /uadratic pro-rammin- problem be tac led by standard numerical analysis al-orithms. 5ut the least an-l procedure is a better approach. This al-orithm exploits the special struc lasso problem, and provides an efficient ay to compute the solutions simulataneously for all values of s . 6east an-le re-ression is li e a more democratic version of for ard s re-ression. 7ecall ho for ard step ise re-ression or s

Upload: joanne-wong

Post on 05-Oct-2015

10 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

DESCRIPTION

Lasso and lars

TRANSCRIPT

A simple explanation of the Lasso and Least Angle Regression

Give a set of input measurements x1, x2 ...xp and an outcome measurement y, the lasso fits a linear model

yhat=b0 + b1*x1+ b2*x2 + ... bp*xp

The criterion it uses is:

Minimize sum( (y-yhat)^2 ) subject to sum[absolute value(bj)] =0) the solutions are shrunken versions of the least squares estimates. Often, some of the coefficients bj are zero. Choosing "s" is like choosing the number of predictors to use in a regression model, and cross-validation is a good tool for estimating the best value for "s".

Computation of the Lasso solutions

The computation of the lasso solutions is a quadratic programming problem, and can be tackled by standard numerical analysis algorithms. But the least angle regression procedure is a better approach. This algorithm exploits the special structure of the lasso problem, and provides an efficient way to compute the solutions simulataneously for all values of "s".

Least angle regression is like a more "democratic" version of forward stepwise regression. Recall how forward stepwise regression works:

Forward stepwise regression algorithm:

Start with all coefficients bj equal to zero. Find the predictor xj most correlated with y, and add it into the model. Take residuals r= y-yhat. Continue, at each stage adding to the model the predictor most correlated with r. Until: all predictors are in the modelThe least angle regression procedure follows the same general scheme, but doesn't add a predictor fully into the model. The coefficient of that predictor is increased only until that predictor is no longer the one most correlated with the residual r. Then some other competing predictor is invited to "join the club".

Least angle regression algorithm:

Start with all coefficients bj equal to zero. Find the predictor xj most correlated with y Increase the coefficient bj in the direction of the sign of its correlation with y. Take residuals r=y-yhat along the way. Stop when some other predictor xk has as much correlation with r as xj has. Increase (bj, bk) in their joint least squares direction, until some other predictor xm has as much correlation with the residual r. Continue until: all predictors are in the modelSurprisingly it can be shown that, with one modification, this procedure gives the entire path of lasso solutions, as s is varied from 0 to infinity. The modification needed is: if a non-zero coefficient hits zero, remove it from the active set of predictors and recompute the joint direction.

Programming Linear Regression Procedure for …eddata.fnal.gov/lasso/summerstudents/papers/2014/Megan-Williams.pdf · Programming Linear Regression Procedure for Analog & Digital

A robust hybrid of lasso and ridge regressionowen/reports/hhu.pdf · A robust hybrid of lasso and ridge regression Art B. Owen Stanford University October 2006 Abstract Ridge regression

Lasso Regression - cs229.stanford.edu

Lasso regression - few.vu.nlwvanwie/Courses/HighdimensionalDataAnalysis/... · Lasso regression Effect of the penalty on the loss function unpenalized loss penalized loss The red

Lasso-Type Sparse Regression and High-Dimensional Gaussian ...xiaohuic/publications/ubc_2012_fall_chen_xiaohui.… · Lasso-Type Sparse Regression and High-Dimensional Gaussian Graphical

The Group Lasso for Logistic Regression - ETH Z

Adaptive Lasso and group-Lasso for functional Poisson ...rivoirar/IPR14.pdf · Adaptive Lasso and group-Lasso for functional Poisson regression S. Ivano z, ... ture di erent features

New Bayesian Lasso in Tobit Quantile Regression · model. These methods were extended to Tobit quantile regression. instance, (Alhamzawi, 2013), who proposed adaptive lasso in Tobit

Robust Regression and Lassoguppy.mpe.nus.edu.sg/~mpexuh/papers/Lasso_TIT.pdf · Robust Regression and Lasso Huan Xu,∗ Constantine Caramanis† and Shie Mannor ‡ September 8, 2008

The Group Lasso for Logistic Regression - ETH Zstat.ethz.ch/Manuscripts/buhlmann/logistic-grouplasso.pdf · The Group Lasso for Logistic Regression ... Choosing di erent contrasts

Regularized Regression - University of Washington · Impact of feature selection and lasso Lasso has changed machine learning, statistics, & electrical engineering But, for feature

The adaptive fused lasso regression and its application on ... · combines a Lasso penalty over coe cients and a Lasso penalty over their di erence, thus enforcing similarity between