Regex to match a specific pattern multiple times within a sentence

Question

I have the following problem with a latex textfile that consist of multiple sentences, e.g.

Aaa \cref{fig:1}. Bbb \cref{fig:2} bbb \cref{fig:3}. Ccc \cref{fig:4}. Ddd \cref{fig:5} ddd \cref{fig:6} ddd \cref{fig:7}.

What I need to find out is how to isolate the \cref{fig:xxx} parts in each sentence. The problem is that the regex should only account for sentences in which \cref{fig:xxx} occurs more than one times (>1).

A good result would be if the regex could return fig:2 and fig:3 from sentence bbb, as well as fig:5, fig:6, and fig:7 from sentence ddd.

I have to use regular expressions for the search in Textmate (texteditor).

Jan · Accepted Answer

In addition to my comment, you could come up with a recursive approach. However, looking at the documentation, recursion seems not to be supported in TextMate. In this case, you could easily repeat the pattern one more time (fulfilling your requirement of sentences with more than one occurence):

(?:\cref\{(fig:\d+)\})(?:[^.]+?(?:\cref\{(fig:\d+)\}))+

Broken down, this looks for \cref{} and captures the inner fig:+ digit, then looks for a character that is not a dot ([^.]) and repeats the first subpattern. As already mentionned in the comments, you will likely need to play around with the sentence conditions (e.g. what is considered as a sentence - this is the [^.] part). See a demo of the approach on regex101.com.

Regex to match a specific pattern multiple times within a sentence

Answers (2)

Related Questions