Error message 10054 when wescraping with requests module

Question

New programmer here. Tried to webscrape pof site as I was learning python. Tried to webscrape using the requests and beautiful soup. Thanks in advance

Error seems to be from the line res=requests.get('https://www.pof.com/everyoneonline.aspx?page_id=%s' %pageId)

I tried to remove the pagination page and scrape only one page, but it didn't work Also tried to use time.sleep 3 seconds between each request, but that didn't work either

#Username and password 
username='MyUsername'
password='MyPassword'


#Login to pof site
from selenium import webdriver
import bs4,requests
browser = webdriver.Chrome(executable_path='/Users/Desktop/geckodriver-v0.24.0-win32/chromedriver.exe')
browser.get('https://www.pof.com')
linkElem= browser.find_element_by_link_text('Sign In')
linkElem.click()
usernameElem=browser.find_element_by_id('logincontrol_username')
usernameElem.send_keys(username)
passwordElem=browser.find_element_by_id('logincontrol_password')
passwordElem.send_keys(password)
passwordElem.submit()

#Webscraping online profile links from first 7 pagination pages
for pageId in range(7):
    res=requests.get('https://www.pof.com/everyoneonline.aspx?page_id=%s' %pageId)
    res.raise_for_status()
    soup= bs4.BeautifulSoup(res.text)
    profile = soup.findAll('div', attrs={'class' : 'rc'})
    for div in profile:
        print (div.findAll('a')['href'])

Expected result: Printing a list of all href links of profile, so I can later save them to a csv

Actual result: requests.exceptions.ConnectionError: ('Connection aborted.', ConnectionResetError(10054, 'An existing connection was forcibly closed by the remote host', None, 10054, None))enter code here

Error message 10054 when wescraping with requests module

Answers (1)

Extracting Data~

Header data

Post data

Using Data~

Related Questions