Remove html linebreaks between
tags

Question

I have a CMS system that allows people to also use HTML code, but a nl2br is provided at the end of the function, which makes this:

into this:

Now I want to remove these 's that exist between

I already found another question which asks almost the same, but for newlines. I've integrated this inside my CMS but for one client all the content is already filled in so I have to fix this after the 's are replaced with 's.

The other question provides this as a regex to match within

/(?<=|)\s*?(?=
|)/is

I'd think something like this:

/(?<=|)(
|
|
)(?=
|)/is

Would do the trick, but it doesn't. What am I missing?

EDIT

I am very open for DOMDocument solutions, if there's a way to query linebreaks with xpath this would probably fix my problem.

Karolis · Accepted Answer

In the example you provided, tags are surrounded by some white-space (at least by new line characters), so this needs to be reflected in the corresponding regular expression.

/(?<=|<\/li>)(\s*
\s*|\s*\s*|\s*
\s*)(?=<\/ul>|)/is

In many cases regular expressions are NOT the best way for parsing HTML (I definitely agree with the comments above/below), but they are always good enough for some particular purposes.

Remove html linebreaks between <ul> tags

Answers (2)

Related Questions

Remove html linebreaks between &lt;ul&gt; tags

Answers (2)

Related Questions

Remove html linebreaks between <ul> tags