Python: IndexError: list index out of range (reading from CSV with 3 columns)

Question

I am working on creating a stacked bar graph drawn from data in a CSV file. The data looks like this:

ANC-088,333,148
ANC-089,153,86
ANC-090,138,75

There more rows just like this.

The beginning script I have, just to start playing with bar graphs, looks like this:

from pylab import *

name = []
totalwords = []
uniquewords = []

readFile = open('wordstats-legends.csv', 'r').read()
eachLine = readFile.split('
')

for line in eachLine:
    split = line.split(',')
    name.append(split[0])
    totalwords.append(split[1])
    uniquewords.append(int(split[2]))

pos = arange(len(name)) + 0.5
bar(pos, totalwords, align = 'center', color='red')
xticks(pos, name)

When I decided to see how things were going, I get the following error:

---> 13     totalwords.append(split[1])
IndexError: list index out of range

What am I not seeing and what are my first steps in fixing this? (Additional explanations most welcome as I continue to try to teach myself this stuff.)

Adalee · Accepted Answer

If you are sure the whole file looks like you described, the problem will be the last newline (at the end of the file), where an empty string is inserted intoeachLine (you split the lines at the newline character and after the last newline there is nothing). So you only need to omit the last element in your eachline eg with eachLine.pop() after splitting.

If you would like a robust and general solution which takes care about every line that you can't split into three parts, you should use the solution from user1823. However, if the problem really is only what I have described above, checking for condition with splitting can slow you down for larger files.

Python: IndexError: list index out of range (reading from CSV with 3 columns)

Answers (2)

Related Questions