Reputation: 47
I have been struggling with a certain column in my data where the source data is dirty and i cant find joins because of this.
So What I am trying to do is:
In example
if [website_reference_number] = "CC-DE-109" >>> Leave it like that
if [website_reference_number] = "CC-DE-109-Duplicate" >>> change to CC-DE-109
if [website_reference_number] = "CC-DE-109 Duplicate" >>> change to CC-DE-109
if [website_reference_number] = "CC-DE-109-Duplicate-Duplic" >>> change to CC-DE-109
So the rules are in human terms {Any 2 Letters}-{Any 2 Letters}-{AnyAmountOfNumbers}
Upvotes: 1
Views: 155
Reputation: 10360
Use this pattern:
/([A-Z]{2})-([A-Z]{2})-([0-9]+).*/
Upvotes: 1