How to validate the format of a string in Ruby, while extracting the matches?

Question

What I want

validate that a string matches this format: /^(#\d\s*)+$/ (#1 #2 for instance).
Grab all the numbers with the hash, something like #. It doesnt have to be a MatchData object, any type of array, enumerable would work.

My issue

When using match, it just matches the last occurence:

/^(#\d\s*)+$/.match "#1 #2"
# => #

When I use scan, it "works":

"#1 #2".scan /#\d/
# => ["#1", "#2"]

But I dont believe I can validate the format of the string, as it would return the same for "aaa #1 #2".

The question

Can I, with only 1 method call, both validates that my string matches /^(#\d\s*)+$/ AND grab all the instances of #number?

I kinda feel bad about asking this since I've been using ruby for a while now. It seems simple but I can't get that to work.

Wiktor Stribiżew · Accepted Answer

Yes, you may use

s.scan(/(?:\G(?!\A)|\A(?=(?:#\d\s*)*\z))\s*\K#\d/)

See the regex demo

Details

(?:\G(?!\A)|\A(?=(?:#\d\s*)*\z)) - two alternatives:
- \G(?!\A) - the end of the previous successful match
- | - or
- \A(?=(?:#\d\s*)*\z) - start of string (\A) that is followed with 0 or more repetitions of # + digit + 0+ whitespaces and then followed with the end of string
\s* - 0+ whitespace chars
\K - match reset operator discarding the text matched so far
#\d - a # char and then a digit

In short: the start of string position is matched first, but only if the string to the right (i.e. the whole string) matches the pattern you want. Since that check is performed with a lookahead, the regex index stays where it was, and then matching occurs all the time ONLY after a valid match thanks to the \G operator (it matches the start of string or end of previous match, so (?!\A) is used to subtract the start string position).

Ruby demo:

rx = /(?:\G(?!\A)|\A(?=(?:#\d\s*)*\z))\s*\K#\d/
p "#1 #2".scan(rx)
# => ["#1", "#2"]
p "#1 NO #2".scan(rx)
# => []

How to validate the format of a string in Ruby, while extracting the matches?

What I want

My issue

The question

Answers (2)

Related Questions