Is it possible to remove duplicates from a string in Big Query?

Question

So been working with some data and currently have output along the lines of

Customer | Reasons
Customer1 | Answer1, Answer3, Answer2, Answer4, Answer5, Answer1, Answer3, Answer1

Is there anyway in Big Query standard sql to rid myself of duplicates within this string and end with the output below?

Customer | Reasons
Customer1 | Answer1, Answer3, Answer2, Answer4, Answer5

Thanks in advance

Elliott Brossard · Accepted Answer

Assuming I understood the question correctly, you want something like:

SELECT
  (SELECT STRING_AGG(DISTINCT s, ', ')
   FROM UNNEST(SPLIT(Customer1, ', ')) AS s) AS Customer1
FROM dataset.table

This splits the string on the ', ' separator, then aggregates the substrings into a new string with duplicates removed using the DISTINCT keyword.

Is it possible to remove duplicates from a string in Big Query?

Answers (2)

Related Questions