Regex capture repeating groups

Question

I have an input that looks like this:

0a1b1a2b2a3b3a4b

I'd need to capture key-value pairs (e.g. id - val) or at least an array of groups as the following: [0, a1b, 1, a2b, 2, a3b, 3, a4b]

Capturing just one pair (i.e. when the input contains only a single pair) works with this:

(?>(?:(\d+))(?:(.+)))?

the result being: [0, a1b].

But it doesn't work for multiple pairs - it captures 0 as a group then as a 2nd group it takes the rest of the input, excluding the first tag, as in: [0, a1b1a2b2a3b3a4b]

Can someone point me to a direction I should look into?

UPDATE: what if and are some special chars, for example 0x8F and 0x9F?

Albina · Accepted Answer

This regex matches keys and then values.

(?<=)(\d+)(?=)|(?<=)[a-z\d]*(?=)

There are 2 groups:

(?<=)(\d+)(?=) matches a key \d+ between and using positive lookbehind and lookahead
- (?<=) is a positive lookbehind
- (?=) is a positive lookahead
(?<=)[a-z\d]*(?=) matches a value between and using positive lookbehind and lookahead
- [a-z\d]* matches a value
- (?<=) is a positive lookbehind
- (?=) is a positive lookahead

regex101.com

Regex capture repeating groups

Answers (2)

Related Questions