How to extract the year via regex from a string in Ruby

Question

I'm trying to extract the year from a string with this format:

dataset_name = 'ALTVALLEDAOSTA000020191001.json'

I tried:

dataset_name[/<\b(19|20)\d{2}\b>/, 1]
/\b(19|20)\d{2}\b/.match(dataset_name)

I'm still reading the docs but so far I'm not able to achieve the result I want. I'm really bad at regex.

ggorlen · Accepted Answer

Since your dataset name always ends in yyyymmdd.json, you can take a slice of the last 13-9 characters counting from the rear:

irb(main):001:0> dataset_name = 'ALTVALLEDAOSTA000020191001.json'
irb(main):002:0> dataset_name[-13...-9]
=> "2019"

You can also use a regex if you want a bit more precision:

irb(main):003:0> dataset_name =~ /(\d{4})\d{4}\.json$/
=> 18
irb(main):004:0> $1
=> "2019"

Answers (2)