Parsing stray text with Scrapy

Question

Any idea how to extract 'TEXT TO GRAB' from this piece of markup:


    
        
            LINK
        
    
    >
    TEXT TO GRAB

SIM · Accepted Answer

It's not an ideal solution but it should do the trick:

from scrapy import Selector

content="""

    
        
            LINK
        
    
    >
    TEXT TO GRAB

"""
sel = Selector(text=content)
item = sel.css(".navigation_page::text")
print(item.extract()[-1].strip())

OR like this:

sel = Selector(text=content)
item = ''.join([' '.join(items.split()) for items in sel.css("span.navigation_page::text").extract()])
print(item)

Output:

TEXT TO GRAB

Parsing stray text with Scrapy

Answers (2)

Related Questions