Cleaning html text, issues with replace

Question

I have an editor that ads:


 or empty `p`, and I want to rplace or remove them.

I use:

  value = value.replace('
', '
').replace('
','').strip('
')

The problem is that sometimes remove everything, an in all cases for the first paragraph I always get: p>(removes the first chracter in tag).

Odysseas · Accepted Answer

Your error is in how you use the strip method, which removes any leading or trailing sequence of the ' ' characters. So hello would be stripped to hello, for example.



If you want to remove any 
 in the beginning and in the end of the value string, you can do it like so:

if value.startswith('
'):
    value = value[4:]
if value.endswith('
'):
    value = value[:-4]

Cleaning html text, issues with replace

Answers (2)

Related Questions