Scikit Learn : Logistic Regression Error

Question

I am trying to run logistic regression on my data (6 categorical, 1 integer) using scikit learn. I am following the scikit learn documentation but when trying to fit my data I am getting the following value error. Can someone please help.

#Below are the variables of my data.
train_data.dtypes
    OUTPUT
    TripType                 category
    VisitNumber              category
    Weekday                  category
    Upc                      category
    ScanCount                   int64
    DepartmentDescription    category
    FinelineNumber           category
    dtype: object


X = train_data.loc[:, 'VisitNumber':'FinelineNumber']
Y = train_data.loc[:, 'TripType':'TripType']
logreg = linear_model.LogisticRegression()
logreg.fit(X, Y)

**ValueError: could not convert string to float: GROCERY DRY GOODS**

jakevdp · Accepted Answer

Scikit-learn can only handle numerical features. For some ideas on how to proceed in your case, see Encoding Categorical Features in the scikit-learn docs.

Scikit Learn : Logistic Regression Error

Answers (2)

Related Questions