sample column without duplicates

Question

I'm currently writing a custom function to achieve this, but I was wondering if there was a simple, built-in function in R that would achieve the same goals.

I have data like:

stringVariable1     stringVariable2

string1             a
string1             b
string1             d
string2             e
string2             a
string3             b

And I want to shuffle the data in stringVariable2, but I don't want duplicates in respect to the different stringVariables in 1.

So this wouldn't be acceptable (as 'b' is duplicated with respect to string1):

stringVariable1     stringVariable2

string1             b
string1             b
string1             d
string2             a
string2             e
string3             d

But this would:

stringVariable1     stringVariable2

string1             b
string1             e
string1             d
string2             a
string2             e
string3             d

So essentially I'm trying to randomise the stringVariable2, without replacement with respect to the different stringVariable1's. Is creating a custom function the only way to do this?

Thanks!

sample column without duplicates

Answers (1)

Related Questions