Python, How should this regex work

Question

I have a regex that should find all the "heading lines" that contain some text that do not end with a period or ? or !:

tit_pat = re.compile(r"([\w ]+?)(?![!?.])
",re.UNICODE)
res = tit_par.findall(data)

: Example:

Chapter 1x test
This a test a test test test test. This a test with some text and more text.This a test with some text and more text some text and more text. This is some more text some more text some more tex some more text chapter aaa
This a test. This a test with some text and more text some text and more text some text and more text some text and more text.
bbbb
The end.

The regex is finding all the "heading lines" that contain some text without a period and a new line. That is expected because there is a (negative) look ahead statement that checks that are no periods (or ! or ?) before accepting. However I may have a sentence that starts in a line and ends with a period in the next line. The regex is not finding the line with text without a period. Is there an explanation for this behavior?

Python, How should this regex work

Answers (1)

Related Questions