Issue displaying Images from website using Regex

Question

Im currently trying to scrape a website for all images found. My code successfully displays all images including .jpg, .bmp & .gif. However it also displays the height of these images as well. I was wondering how I could change my code to remove the height of the image from the output as well as tidying up the output providing just the clean links as shown in the attachment. Below I have attached both a link showing my codes output as well as my current code below. I have also attached what my ideal output would be. Thanks for any help, appreciated!

My Code Output: https://i.sstatic.net/eferl.jpg

Output I am looking for: https://i.sstatic.net/RytX4.jpg

files = re.findall(r'\

akash karothiya · Accepted Answer

You can extract image src directly

>>> images = ['', '']
>>> for image in images:
        print(re.search(r']*src="([^"]*)"', image).group(1))

demo.jpg
demo2.jpg

If your input is all string, you may use findall and then iterate over it

>>> images = ''' '''
>>> res = re.findall(r']*src="([^"]*)"', images)
>>> for img in res:
        print(img)
demo.jpg
demo2.jpg

Issue displaying Images from website using Regex

Answers (2)

Related Questions