How to scrape affiliation related to each professor of a particular journal/article paper

Question

The website that I want to scrape is ScienceDirect . The affiliation will be available after clicking on the show more button. I am able to click on it but I am not able to scrape the affiliations which are loaded after clicking on the show more button Here is the code . The for loop is not printing the dl-tag which contains the affiliation

import time
from selenium import webdriver
from selenium.common.exceptions import NoSuchElementException
from selenium import webdriver
from bs4 import BeautifulSoup
driver = webdriver.Firefox()

driver.get('https://www.sciencedirect.com/science/article/pii/S1571065308000656')
soup = BeautifulSoup(driver.page_source,'html.parser')
time.sleep(7)

try:
    element = driver.find_element_by_css_selector('.show-hide-details.u-font-sans')
    element.click()
    time.sleep(15)
   
    for data in soup.find(id='author-group'):
        print(data)
        print('---')
        
except NoSuchElementException:  
    pass

wpercy · Accepted Answer

I think you need to move your soup instantiation down to after you've clicked on the "Show more" button.

If I run the following code:

driver = webdriver.Firefox()

driver.get('https://www.sciencedirect.com/science/article/pii/S1571065308000656')
time.sleep(3)

try:
    element = driver.find_element_by_css_selector('.show-hide-details.u-font-sans')
    element.click()
    time.sleep(9)
    soup = BeautifulSoup(driver.page_source,'html.parser')

    for data in soup.find(id='author-group'):
        print(data)
        print('---')

except NoSuchElementException:
    pass

my output is:

Author links open overlay panel
---
IgnazRutter¹
---
Fakultät für Informatik, Universität Karlsruhe, Germany
---

How to scrape affiliation related to each professor of a particular journal/article paper

Answers (2)

Related Questions