Pandas - Usecols when columns exist in csv

Question

Since the columns and list of usecols are different, it spits the error

"ValueError" Usecols do not match names.

How can I 'usecol' if columns exist in csv?

csv sample:

df.csv

AB,CD,EF,GH
foo,20160101,a,1
foo,20160102,a,3
foo,20160103,a,5

reading csv:

import pandas as pd


df = pd.read_csv('df.csv', 
    header=0,usecols=["AB", "CD", "IJ"])

This is what I'd like to get:

df

date       AB   CD
2016-01-01  a    1
2016-01-02  a    3
2016-01-03  a    5

Ignored "IJ".

piRSquared · Accepted Answer

import csv normally

import pandas as pd
from io import StringIO

txt = """AB,CD,EF,GH
foo,20160101,a,1
foo,20160102,a,3
foo,20160103,a,5"""

df = pd.read_csv(StringIO(txt))

print(df)

    AB        CD EF  GH
0  foo  20160101  a   1
1  foo  20160102  a   3
2  foo  20160103  a   5

reindex with intersection

usecols = ['AB', 'CD', 'IJ']
df.reindex_axis(df.columns.intersection(usecols), 1)

    AB        CD
0  foo  20160101
1  foo  20160102
2  foo  20160103

Pandas - Usecols when columns exist in csv

Answers (2)

Related Questions