sns.pairplot returns bad results for Kmeans cluster visualizations

Question

#import libraries
import pandas as pd
import numpy as np
import random as rd
import matplotlib.pyplot as plt
import seaborn as sns

data = pd.read_csv('C:/Users/yehya/Desktop/cmps276/forestfires.csv')
data = pd.get_dummies(data)

#Visualise data points

sns.pairplot(data)
sns.plt.show()
#plt.show()

I'm trying to run a simple scatterplot using sns.pairplot, my end goal is applying Kmeans cluster on my data. But I want to visualize my data. before applying anything I wanted to use a scatterplot. using the above code the results I got were these . the data consists of 13 columns and about 450 rows. I'm new to these data manipulation algorithms and visualizations, I'm not sure I'm approaching this problem in the correct way. what might be a better way to visualize my data? the target column is Area. ill leave a link to the dataset which can be found on Kaggle https://www.kaggle.com/elikplim/forest-fires-data-set, forestfire. Help would be appreciated thanks

sns.pairplot returns bad results for Kmeans cluster visualizations

Answers (1)

Related Questions