Python error:“IndexError: string index out of range”

Question

I import my txt file as str by using with open

with open('./doc', 'r') as f:
dat = f.readlines()

then I want to clean the data by using a for loop

docs = []
for i in dat:
if i.strip()[0] != '<':
    docs.append(i)

error returns

---------------------------------------------------------------------------
IndexError                                Traceback (most recent call last)
 in ()
      1 docs = []
      2 for i in dat:
----> 3     if i.strip()[0] != '<':
      4         docs.append(i)

IndexError: string index out of range

but if I change the code like this,just select the first 3000 lines, the code works.

docs = []
for i in dat[:3000]:
if i.strip()[0] != '<':
    docs.append(i)

My txt file contains 93408 lines ,why I can't select them all? thx!

developer_hatch · Accepted Answer

one or more lines could be empty, you need to check it before take first elem

if i.strip() != "" and i.strip()[0] != '<':
    docs.append(i)

Python error:“IndexError: string index out of range”

Answers (1)

Related Questions