Return text from tags where outermost HTML tags applies to all text nodes within - jquery

Question

I have paragraphs of text that might be like this:


   
      Some text

or


   
      Some more text

or

Yet more text

However many nested tags there are, I'm able to get just the text, simply using $('p').text(). The problem is when pops up in the middle. In that case, whatever tag the text is in gets broken up. So for example, this:

Some more text

will turn into this:

Some more text

So you see, there are now 2 text nodes in the tag, not just one. What I want to do is to get just the text with it's original parent tags, with treated as just another text node, but without -induced-tag-split-up intrusion. For example, given the 2-node HTML above, I just want a function that returns this:

Some more text

That would be fine for a few given formats, but there could different types of HTML nesting that I need to retain (such as
or
or
and so on.

Edit

Rather than getting lost in loops, I suppose the easiest way is to get the $('p').html() and simply chop away all tags around ? On the left there of would be closing tags, on the right there would be opening tags. Would there be regex solution for this then?

Tomalak · Accepted Answer

find each within a
for each of those elements, compare the names of the immediately preceding and following elements
if their names are equal, move the and the contents of the following element into the preceding element
remove the now empty following element

This:

$("p").clone().find("br").each(function() {
  var $this = $(this), $prev = $this.prev(), $next = $this.next();
  if ( $prev.length && $prev.prop("nodeName") === $next.prop("nodeName") ) {
    $prev.append( $this, $next.contents() );
    $next.remove();
  }
}).end().each(function () {
    console.log( $(this).html() );
});

^{(Note that I use clone() to avoid modifying the original.)}

When applied to


  
     Some 
  
  

  
     more text

writes this to the console


   Some 


   more text

http://jsfiddle.net/Tomalak/y3hSp/

Here is an iterative approach that collapses adjacent nodes that are separated by , in form of a jQuery plugin:

$.fn.extend({
    collapseBreaks: function () {
        return this.each(function () {
            var done = false;

            while (!done) {
                done = true;

                $(this).find("br").each(function() {
                    var $this = $(this), 
                        $prev = $this.prev(), 
                        $next = $this.next();

                    if ( 
                        $prev.length 
                        && $prev.prop("nodeName") === $next.prop("nodeName") 
                    ) {
                        $prev.append( $this, $next.contents() );
                        $next.remove();
                        done = false;
                    }
                });       
            }
        });
    }
});

Use as

$("p").collapseBreaks();

http://jsfiddle.net/Tomalak/FJFgk/3/

Return text from tags where outermost HTML tags applies to all text nodes within - jquery

Edit

Answers (2)

Related Questions