How to skip tag while scraping data using selenium

Question

HTML:


       
            Tim Cook 
            Apple CEO
                all CEOs  // Nor required this node
           
       
       
            Sundar Pichai 
            Google CEO 
       
       
            NoCompany 
            NOT, DEFINED

Code:

applicationData = [td.text for td in webBrowser.find_elements_by_xpath('//td[@class="wpsTableNrmRow"]')]
record = {'Designation': applicationData[0],
 'Designation': applicationData[1],'Designation': applicationData[2]}

OUTPUT:

 Designation: Apple CEO all CEOs  // Not required 'all CEOs'
 Designation: Google CEO
 Designation: Not, DEFINED

I am scraping data from the table and the

How can I do this?

I tried [td.get_attribute("textContent").split(" ")[0] for td in webBrowser.find_elements_by_xpath('//td[@class="wpsTableNrmRow" and text()!=" "]')]

OUTPUT:

 Designation: Apple CEO  
 Designation: Google CEO
 Designation:           // should have value 'NOT, DEFINED'

How to get value?

PDHide · Accepted Answer

applicationData = [td.get_attribute("textContent").split("
")[0] for td in webBrowser.find_elements_by_xpath('//td[@class="wpsTableNrmRow"]')]
record = {'Designation1': applicationData[0], 'Designation2': applicationData[1]}

Try above code , here we use TextCOntent and it returns different text nodes in different lines so you can split it using " "

How to skip <a> tag while scraping data using selenium

Answers (1)

Related Questions

How to skip &lt;a&gt; tag while scraping data using selenium

Answers (1)

Related Questions

How to skip <a> tag while scraping data using selenium