KeyError: "None of [Int64Index([1960, 1961, 1962, 1963, 1964], dtype='int64')] are in the [columns]"

Question

I am trying to scatter plot a dataframe and for this I have provided it with x and y components. It is showing error in the x component. it gives the error on 'Year' column. I have checked manually that Year Column exists in the dataframe still it shows error. Note that year column contains years from 1960 to 1964.

urb_pop_reader = pd.read_csv('ind_pop_data.csv', chunksize=1000)
df_urb_pop = next(urb_pop_reader)
df_pop_ceb = df_urb_pop[df_urb_pop['CountryCode'] == 'CEB']
pops = zip(df_pop_ceb['Total Population'], 
           df_pop_ceb['Urban population (% of total)'])
pops_list = list(pops)

# Use list comprehension to create new DataFrame column 'Total Urban Population'
df_pop_ceb['Total Urban Population'] = [int(a[0]*(a[1]*0.01)) for a in pops_list]

# Plot urban population data
df_pop_ceb.plot(kind='scatter', x=df_pop_ceb['Year'], y=df_pop_ceb['Total Urban Population'])
plt.show()

Dan · Accepted Answer

If you want to use pandas' plotting, you should pass the labels as x and y, not the data:

df_pop_ceb.plot(kind='scatter', x='Year', y='Total Urban Population')

Also looking at the docs I think you should rather do

df_pop_ceb.plot.scatter(x='Year', y='Total Urban Population')

KeyError: "None of [Int64Index([1960, 1961, 1962, 1963, 1964], dtype='int64')] are in the [columns]"

Answers (2)

Related Questions

KeyError: &quot;None of [Int64Index([1960, 1961, 1962, 1963, 1964], dtype=&#39;int64&#39;)] are in the [columns]&quot;

Answers (2)

Related Questions

KeyError: "None of [Int64Index([1960, 1961, 1962, 1963, 1964], dtype='int64')] are in the [columns]"