javascript regex paragraph not ending with full stop

Question

I have a document containing lots of paragraphs. Some of these are subheadings, which are identifiable because they do not end with a full stop, like this:

This is a title
This is a sentence.
This is a sentence.
This is a sentence.
This is a sentence.
This is a title
This is a sentence.
This is a sentence.
This is a sentence.
This is a sentence.
This is a title
This is a sentence.
This is a sentence.
This is a sentence.
This is a sentence.

I want to make the titles go into an h3 tag but not the sentences. So I need to find and replace all paragraphs not ending in a full stop. I need to do this with javascript I have tried the following but each fails. In each case the text is first read into a variable called body.

body = body.replace(/(.*?)(?!\.)<\/p>/gi, "
$1");

That just makes everything bold

This would work, I think:

body = body.replace(/(.*?)(?/gi, "
$1");

but javascript does not recognise negative look behind.

Any ideas how I do this?

Denys S&#233;guret · Accepted Answer

You could do the replacement paragraph per paragraph, which would be cleaner that doing a regex on the whole HTML:

[].forEach.call(document.getElementsByTagName('p'), function(p){
     if (!/[.?!]\s*$/.test(p.innerHTML)) p.outerHTML=""+p.innerHTML+"";
});

This is a title
This is a sentence.
This is a sentence.
You want to handle questions, right?
I'm sure you do!
This is a title containing 1.2 million
This is a sentence.
This is a sentence.
This is a sentence.
This is a sentence.
This is a title
This is a sentence.
This is a sentence.
This is a sentence.
This is a sentence.

This way there's no problem if your HTML evolves (will you really always have only P elements?).

javascript regex paragraph not ending with full stop

Answers (2)

Related Questions