Regex to remove duplicate Characters completely (without leaving them behind)

Question

I need a regex to remove duplicate characters from a string like so: abcdeafghid to bcefghi, removing a and d

I have no idea how I would go about this honestly. I can find a lot about removing duplicates, but they always leave behind one instance of the duplicated character.

The order of the characters at the end doesn't matter, but since I'm working with CJK languages it should support those. How would I go about this?

AnalystCave.com · Accepted Answer

Irrelevant of you language you can use the pseudocode below:

Dictionary dict 
for i = 0 to Len(your_string)
  if Not(dict.Exits(your_string[i])) then 
     dict.Add(your_string[i],1)
  else
     dict[your_string[i]] += 1
  end if
Next i

int index = 0 
while 1
  if dict[your_string[index]] > 1 then
   your_string = replace(your_string, your_string[index],"")
   index = 0
  else 
   index +=1
   if index >= Len(your_string) then break
  end if  
end while

Regex to remove duplicate Characters completely (without leaving them behind)

Answers (2)

Related Questions