How to extract text from between the
tags in BeautifulSoup

Question

What i am trying to do is to scrape only the companies names from the element, which has multiple tags. FYI, some have one company name while others have two. See element below:



License #: 
332673


BAY AREA REMODELING CO


5230 EAST 12TH


OAKLAND, CA 94601


Effective Dates:
09/16/1982 - 06/30/1984

License #: 
377133


SAVAGE ROOFING COMPANY


3055 ALVARADO STREET


SAN LEANDRO, CA 94577


Effective Dates:
 07/01/1982 - 03/31/1985

So from the above element, i want the output:

BAY AREA REMODELING CO
SAVAGE ROOFING COMPANY

Rakesh · Accepted Answer

Use next_sibling after finding the required p tag

Ex:

from bs4 import BeautifulSoup

html = """

License #: 
332673


BAY AREA REMODELING CO


5230 EAST 12TH


OAKLAND, CA 94601


Effective Dates:
09/16/1982 - 06/30/1984

License #: 
377133


SAVAGE ROOFING COMPANY


3055 ALVARADO STREET


SAN LEANDRO, CA 94577


Effective Dates:
 07/01/1982 - 03/31/1985
"""

soup = BeautifulSoup(html, 'html.parser')
for p in soup.find_all('p'):
    print(p.next_sibling.strip())

Output:

BAY AREA REMODELING CO
SAVAGE ROOFING COMPANY

How to extract text from between the <br> tags in BeautifulSoup

Answers (2)

Related Questions

How to extract text from between the &lt;br&gt; tags in BeautifulSoup

Answers (2)

Related Questions

How to extract text from between the <br> tags in BeautifulSoup