Reputation: 16731
Is there a possibility to read the header of a CSV file white space and case insensitive? As for now I use csv.dictreader
like this:
import csv
csvDict = csv.DictReader(open('csv-file.csv', 'rU'))
# determine column_A name
if 'column_A' in csvDict.fieldnames:
column_A = 'column_A'
elif ' column_A' in csvDict.fieldnames:
# extra space
column_A = ' column_A'
elif 'Column_A' in csvDict.fieldnames:
# capital A
column_A = 'Column_A'
# get column_A data
for lineDict in csvDict:
print(lineDict[column_A])
As you can see from the code, my csv files sometimes differ in extra white space or capital letters, for example
I want to use something like this:
column_A = ' Column_A'.strip().lower()
print(lineDict[column_A])
Any ideas?
Upvotes: 7
Views: 4256
Reputation: 27611
How about override DictReader.fieldnames
property?
class MyDictReader(DictReader):
@property
def fieldnames(self):
return [field.strip().lower() for field in super(MyDictReader, self).fieldnames]
Upvotes: 7
Reputation: 879919
You can redefine reader.fieldnames
:
import csv
import io
content = '''column_A " column_B"
1 2'''
reader = csv.DictReader(io.BytesIO(content), delimiter = ' ')
reader.fieldnames = [field.strip().lower() for field in reader.fieldnames]
for line in reader:
print(line)
yields
{'column_b': '2', 'column_a': '1'}
Upvotes: 16