Tag value not printing etree lxml

Question

I want to print the "Printable String" part of the code. Also I tried to print the whole tag itself but didn't find a way to print the whole tag instead of just the tag name. Retrieving Xpath and the whole tag itself is the biggest challenge currently, Thank You!

Code:

from bs4 import BeautifulSoup
from lxml import etree

doc = "Printable String"
soup = BeautifulSoup(doc, "lxml")
root = etree.fromstring(str(soup))

tree = etree.ElementTree(root)
for i, e in enumerate(root.iter()):
    print(e.text)

Output:

None
None
None
None
None
[Finished in 0.2s]

Expected Output:

None 
None
Printable String
None 
None

Jack Fleeting · Accepted Answer

A couple of things to notice:

First, for some reason you parse doc first with soup and then again parse the string of soup with lxml. The first problem is that BS doesn't leave the string along. If you

print(soup)

the output is

Printable String

You will notice two new elements (html and body) are now added, which explains why you get five Nones instead of only three.

If you parse doc directly with lxml like so and use xpath:

doc = "Printable String"
root = etree.fromstring(doc)
for z in root.xpath('//*'):
    print(z.xpath('text()'))

Output is

['Printable String']
[]
[]

Tag value not printing etree lxml

Answers (2)

Related Questions