Vaibhav Singh
Vaibhav Singh

Reputation: 1209

Color based on categorical variable in Python SNS Barplot

I have a dataframe as below (obtained after lot of preprocessing)

Please find code

{'token': {0: '180816_031', 1: '180816_031', 2: '180816_031', 3: '180816_031', 4: '180816_031', 5: '180816_031', 6: '180816_031', 7: '180816_031', 8: '180816_031', 9: '180816_031'}, 'variable': {0: 'Unnamed: 0', 1: 'adj_active_polymerase', 2: 'adj_functional_sequencing_pores', 3: 'adj_high_quality_reads', 4: 'adj_single_pores', 5: 'cell_mask_bilayers_sum', 6: 'num_align_high_quality_reads', 7: 'num_total_cells', 8: 'potential_pore', 9: 'short_pass'}, 'value': {0: 21.0, 1: 615850.51515151514, 2: 615850.51515151514, 3: 486008.39393939392, 4: 803784.06060606055, 5: 1665347.5757575757, 6: 468638.03030303027, 7: 2097152.0, 8: 1158527.0, 9: 2067189.2424242424}}

I am using below code to create my SNS bar plot but its throwing error, after graph I want to show standard deviation

df1 = df1.groupby(['token','variable']).agg({'value': 'mean'})
df1.reset_index(inplace=True)
g=sns.barplot(x='token',y='value',data=df1, color='variable')
plt.show()

The output that I want is as below

Dataframe

Dataframe Expected Output

However, I get an error Code on this

ValueError: Invalid RGBA argument: 'variable'

Upvotes: 0

Views: 2409

Answers (1)

Space Impact
Space Impact

Reputation: 13255

First the mistake lies in the color parameter. It is hue parameter that do what you wanted. Check the code below:

#For sorting the values in descending order
df.sort_values('value',inplace=True,ascending=False)

fig,ax = plt.subplots()
fig.set_size_inches(16,8)

#to get different colors for each of the variable assign the variable to hue
g=sns.barplot(x='token',y='value',data=df, hue='variable',ax=ax)

#Code for to put legend outside the plot
box = ax.get_position()
ax.set_position([box.x0, box.y0, box.width * 0.8, box.height])
# Put a legend to the right of the current axis
ax.legend(loc='center left', bbox_to_anchor=(1, 0.5))

# Adding respective values to the top of each bar
for p in ax.patches: 
    ax.annotate("%d" % p.get_height(), (p.get_x() + p.get_width() / 2, p.get_height()),
                ha='center', va='center', fontsize=11, color='black', xytext=(0, 10), 
                textcoords='offset points',fontweight='bold')

#To save the plot as 'SO.png'    
plt.savefig('SO.png',dpi=100,bbox_inches='tight')
plt.show()

Code for legend is from link.

The plot looks like:

Sample picture of the data provided as per OP's visualization

Upvotes: 1

Related Questions