Retrieve text between multiple
in xml with python

Question

Hello,

I have xml files composed as follows, I would like to retrieve text1, text2, text3 and text4.




text1 
 text2  
 text3  
 text4

I've been stuck for days without finding a solution in the ElementTree doc. I have the following code but I only get the first text because of the . In addition the number of is variable from one file to another..

import xml.etree.ElementTree as ET

tree = ET.parse(file.xml))
root = tree.getroot()

for txt in root.iter('CONTENU'):
   print(txt)

>>> text1

How can I do that? Thanks in advance :)

dabingsou · Accepted Answer

Another method.

from simplified_scrapy import SimplifiedDoc,utils,req
html = '''



text1 
 text2  
 text3  
 text4




'''
doc = SimplifiedDoc(html)
texts = doc.select('CONTENU').getText(separator="|").split('|')
print (texts)

Retrieve text between multiple <br> in xml with python

Answers (2)

Related Questions

Retrieve text between multiple &lt;br&gt; in xml with python

Answers (2)

Related Questions

Retrieve text between multiple <br> in xml with python