Remove commas and newlines from text file in python

Question

I have text file which looks like this:

ab initio
ab intestato
ab intra
a.C.
acanka, acance, acanek, acankach, acankami, acanką
Achab, Achaba, Achabem, Achabie, Achabowi

I would like to pars every word separated by comma into a list. So it would look like ['ab initio', 'ab intestato', 'ab intra','a.C.', 'acanka', ...] Also mind the fact that there are words on new lines that are not ending with commas. When I used list1.append(line.strip()) it gave me string of every line instead of separate words. Can someone provide me some insight into this?

Full code below:

list1=[]
filepath="words.txt"
with open(filepath, encoding="utf8") as fp:  
   line = fp.readline()
   while line:
       list1.append(line.strip(','))
       line = fp.readline()

sberry · Accepted Answer

Very close, but I think you want split instead of strip, and extend instead of append

You can also iterate directly over the lines with a for loop.

list1=[]
filepath="words.txt"
with open(filepath, encoding="utf8") as fp:  
   for line in fp:
       list1.extend(line.strip().split(', '))

Remove commas and newlines from text file in python

Answers (2)

Related Questions