K means plotting problems? not sure where I am going wrong, any suggestions?

Question

I have been working with some data for quite some time now and I am trying to get 4 clusters using the k means method. I will put my code below so you can see where I am currently at. Im not sure if I missed a step or what but this is my first time doing k means clustering with python.

data=Bel_sort_cleaned.drop(['cprptp','JURISDICTION','STREET','ADDRESS','ddes1',                      
'DESCRIPTION','COM_BLDG_VALUE','OCCUPANCY','segments','OBJECTID'], axis=1)
data=data.dropna()

X=data.values.reshape(-1,1)
y=data['HOUSENUM'].values,reshape(-1,1)

kmeans = KMeans(n_clusters=4, random_state=0).fit(X)

label = kmeans.fit_predict(X)

plt.scatter(X[label==0, 0], X[label==0, 1], s=100, c='red', label ='Cluster 1')
plt.scatter(X[label==1, 0], X[label==1, 1], s=100, c='blue', label ='Cluster 2')
plt.scatter(X[label==2, 0], X[label==2, 1], s=100, c='green', label ='Cluster 3')
plt.scatter(X[albel==3, 0], X[label==3, 1], s=100, c='cyan', label ='Cluster 4')

the original data was loaded in earlier in my file which is where the 'Bel_sort_cleaned' came from.

Any Ideas would be greatly appreciated, As I am pretty stuck.

I am currently getting an Indexerror

K means plotting problems? not sure where I am going wrong, any suggestions?

Answers (1)

Related Questions