Unwanted characters in regular expressions python

Question

So, I have a site that has an XML string, and I'd like my program to return a list of strings that appear between two strings. Here's my code:

 response = requests.get(url)


 artists=re.findall(re.escape('')+'(.*?)'+re.escape(''),str(response.content))
 print(artists)

This returns a list of strings. The problem is, some strings have unwanted characters in them. For example, one of the strings in the list is "Somethin\' \'Bout A Truck" and I'd like it to be 'Somethin' 'Bout A Truck'.

Thanks in advance.

P_O_I_S_O_N · Accepted Answer

I think the beautiful soup(bs4) will solve this problem and it will also support for higher version of python 3.4

Unwanted characters in regular expressions python

Answers (2)

Related Questions