repeat random numbers each n rows

Question

I have a large dataframe with 2 columns, like this:

                    dtm                SoC
      0     2018-03-01 00:00:00 +0000   39
      1     2018-03-01 00:00:01 +0000   39
      2     2018-03-01 00:00:02 +0000   39
      3     2018-03-01 00:00:03 +0000   39
     ...               ...              ...
2678393     2018-04-01 00:59:53 +0100   39
2678394     2018-04-01 00:59:54 +0100   39
2678395     2018-04-01 00:59:55 +0100   39
2678396     2018-04-01 00:59:56 +0100   39
2678397     2018-04-01 00:59:57 +0100   39
2678398     2018-04-01 00:59:58 +0100   39
2678399     2018-04-01 00:59:59 +0100   39

the column SoC is a random generated number between 0 and 40. I would like it to be a different random number repeated each 86400 rows (and not be the same for the entire dataframe).

To be more clear:

-rows 0-86399 1st random number

-rows 86400-172800 2nd random number

-etc

I was trying df['SoC']=np.repeat(random.randint(0,40),len(df)/86400) but there is an error "Length of values does not match length of index"

ideas? thank you in advance

jezrael · Accepted Answer

First create array with parameter size with floor division for integer and then repeat:

#possible duplicated random values
df['SoC'] = np.repeat(np.random.randint(0,40, size=len(df) // 86400), 86400)

#unique random numbers
df['SoC'] = np.repeat(np.random.choice(np.range(0, 40), 
                      size=len(df) // 86400, replace=False), 86400)

repeat random numbers each n rows

Answers (2)

Related Questions