Regex: Output in between two specific words

Question

Text:

ITEM 1A.    RISK FACTORS 

    The following is a description of the principal risks inherent in our business.

ITEM 1B.    UNRESOLVED STAFF COMMENTS 

    Not Applicable.

Regex:

(?<=RISK).*

Got this:

ITEM 1A.    RISK **FACTORS** 

    The following is a description of the principal risks inherent in our business.

ITEM 1B.    UNRESOLVED STAFF COMMENTS 

    Not Applicable.

Expected:

ITEM 1A.    RISK **FACTORS

    The following is a description of the principal risks inherent in our business.

ITEM 1B.    UNRESOLVED STAFF COMMENTS 

    Not Applicable.**

How can I get all text after the word RISK and before the word ITEM 1B

Tim Biegeleisen · Accepted Answer

The following pattern should work:

(?<=RISK)(.*?)(?=ITEM 1B)

Note carefully that in the demo below I am using DOT ALL mode. This means that .* can match across newlines, which is the behavior you want here.

Demo

If you can't use lookarounds for some reason, we may still be able to proceed assuming your regex tool supports capture groups.

If your regex flavor does not support DOT ALL, then one possible workaround is to use [\s\S]*:

(?<=RISK)([\s\S]*?)(?=ITEM 1B)

Regex: Output in between two specific words

Answers (2)

Demo

Related Questions