using regex or beautiful soup to grab someones website from instagram

Question

I want to grab someones website from their instagram bio. Instagram hides this website in text/javascript tag so I can't grab the url like I would normally with an anchor from beautifulsoup. Here is a fragment of the page source that contains what I'm trying to capture:

...,"country_block":false,"external_url":"https://www.brittanyannecohen.com/pattern-control","blocked_by_viewer":false,...

I noticed that the link I want to grab is always attached to an external_url attribute in a dictionary (see picture below).

I attampted to grab this url through using regex but it's not working , see code below

url=re.findall("[\"external_url\":]['https?://(?:[-\w.]|(?:%[\da-fA-F]{2}))+']",soup)

but I get error :

bad character range [-\w at position 31

using regex or beautiful soup to grab someones website from instagram

Answers (1)

Related Questions