How to configure lasso regression to not penalize certain variables?

Question

I'm trying to use lasso regression in python. I'm currently using lasso function in scikit-learn library.

I want my model not to penalize certain variables while training. (penalize only the rest of variables)

Below is my current code for training

rg_mdt = linear_model.LassoCV(alphas=np.array(10**np.linspace(0, -4, 100)), fit_intercept=True, normalize=True, cv=10)
rg_mdt.fit(df_mdt_rgmt.loc[df_mdt_rgmt.CLUSTER_ID == k].drop(['RESPONSE', 'CLUSTER_ID'], axis=1), df_mdt_rgmt.loc[df_mdt_rgmt.CLUSTER_ID == k, 'RESPONSE'])

df_mdt_rgmt is the data mart and I'm trying to keep the coefficient for certain columns non-zero.

glmnet in R provides 'penalty factor' parameter that let me do this, but how can I do that in python scikit-learn?

Below is the code I have in R

get.Lassomodel <- function(TB.EXP, TB.RSP){
  VT.PEN <- rep(1, ncol(TB.EXP))
  VT.PEN[which(colnames(TB.EXP) == "DC_RATE")] <- 0
  VT.PEN[which(colnames(TB.EXP) == "FR_PRICE_PW_REP")] <- 0

  VT.GRID <- 10^seq(0, -4, length=100)

  REG.MOD <- cv.glmnet(as.matrix(TB.EXP), as.matrix(TB.RSP), alpha=1, 
  lambda=VT.GRID, penalty.factor=VT.PEN, nfolds=10, intercept=TRUE)

  return(REG.MOD)
}

How to configure lasso regression to not penalize certain variables?

Answers (1)

Related Questions