BeautifulSoup and scraping href's isn't working

Question

Again I am having trouble scraping href's in BeautifulSoup. I have a list of pages that I am scraping and I have the data but I can't seem to get the hrefs even when I use various codes that work in other scripts.

So here is the code and my data will be below that:

import requests
from bs4 import BeautifulSoup


with open('states_names.csv', 'r') as reader:
    states = [states.strip().replace(' ', '-') for states in reader]


url = 'https://www.hauntedplaces.org/state/alabama'

for state in states:
    page = requests.get(url+state)
    soup = BeautifulSoup(page.text, 'html.parser')
    links = soup.findAll('div', class_='description')
    # When I try to add .get('href') I get a traceback error. Am I trying to scrape the href too early? 
    h_page = soup.findAll('h3')

Gaines Ridge Dinner Club
Purifoy-Lipscomb House
Kate Shepard House Bed and Breakfast
Cedarhurst Mansion
Crybaby Bridge
Gaineswood Plantation
Mountain View Hospital

teller.py3 · Accepted Answer

This works perfectly:

from bs4 import BeautifulSoup
import requests

url = 'https://www.hauntedplaces.org/state/Alabama'

r = requests.get(url)
soup = BeautifulSoup(r.text, 'lxml')

for link in soup.select('div.description a'):
    print(link['href'])

BeautifulSoup and scraping href's isn't working

Answers (2)

Related Questions

BeautifulSoup and scraping href&#39;s isn&#39;t working

Answers (2)

Related Questions

BeautifulSoup and scraping href's isn't working