Create sequence Unique IDs of 4 to 7 digits in a dataframe in R

Question

> dput(Data)
structure(list(DISTRICT = c("KARIMGANJ", "KARIMGANJ", "KARIMGANJ", 
"KARIMGANJ", "KARIMGANJ", "HAILAKANDI", "HAILAKANDI", "HAILAKANDI", 
"CACHAR", "CACHAR")), row.names = c(NA, -10L), class = "data.frame")

I want to create Unique IDs I have >thousand rows but logic will be same I guess. How do I create this new column as shown in the expected output? Note that sequence can be of 7 digits as well.

#     
     DISTRICT   ID
1   KARIMGANJ 1111
2   KARIMGANJ 1111
3   KARIMGANJ 1111
4   KARIMGANJ 1111
5   KARIMGANJ 1111
6  HAILAKANDI 1112
7  HAILAKANDI 1112
8  HAILAKANDI 1112
9      CACHAR 1113
10     CACHAR 1113

GKi · Accepted Answer

You can use factor to create a unique ID.

Data$ID <- unclass(factor(Data$DISTRICT)) + 1000
#     DISTRICT   ID
#1   KARIMGANJ 1003
#2   KARIMGANJ 1003
#3   KARIMGANJ 1003
#4   KARIMGANJ 1003
#5   KARIMGANJ 1003
#6  HAILAKANDI 1002
#7  HAILAKANDI 1002
#8  HAILAKANDI 1002
#9      CACHAR 1001
#10     CACHAR 1001

Or to start with the first hit with 1 using match and unique.

Data$ID <- match(Data$DISTRICT, unique(Data$DISTRICT)) + 1000
#     DISTRICT   ID
#1   KARIMGANJ 1001
#2   KARIMGANJ 1001
#3   KARIMGANJ 1001
#4   KARIMGANJ 1001
#5   KARIMGANJ 1001
#6  HAILAKANDI 1002
#7  HAILAKANDI 1002
#8  HAILAKANDI 1002
#9      CACHAR 1003
#10     CACHAR 1003

Create sequence Unique IDs of 4 to 7 digits in a dataframe in R

Answers (2)

Related Questions