error while passing data-frame through k-means

Question

Although my data-frame as all the float values everywhere. While passing the data frame through k-means it shows that couldn't convert the string to float.

How to convert nan values if any to float values in the entire data-frame?

Yoshitha Penaganti · Accepted Answer

This would do your job and convert all the columns in string format to categorical codes or use one hot encoding of the variables in these columns.

import numpy as np  
from sklearn.cluster import KMeans
import pandas
df = pandas.read_csv('zipIncome.csv')
print(df)

df[col_name]= df[col_name].astype('category')
df[col_name] = df[col_name].cat.codes
kmeans = KMeans(n_clusters=4,init='k-means++', max_iter=600, algorithm = 'auto').fit(df)
print (kmeans.labels_)
print(kmeans.cluster_centers_)

error while passing data-frame through k-means

Answers (2)

Related Questions