Strip email and text from a full tag

Question

How I can correctly get out email and text between < a href.. > < / a > tag ?

My code:

import re
import urllib.request, urllib.parse, urllib.error
from bs4 import BeautifulSoup


url = input("Enter url -")
html = urllib.request.urlopen(url).read()
soup = BeautifulSoup(html, "html.parser")

# Retrieve all of the anchor tags
count = 0
tags = soup.find_all(href=re.compile("mailto"))
for tag in tags:
    count += 1
    print(tag)
print("Total amount of mails:", count)

My programm is receiving a full tag John Test and I want to get only email adress and name. How can I correctly strip it out ?

Strip email and text from a full tag

Answers (1)

Related Questions