How to replace values in columns based on row values and column names?

Question

I have previously posted a question on subsetting columns from row values on GIS StackExchange: here.

In short, I would like to set data to NA, if the column name (e.g. 100) is less than the row value of s_mean (e.g. value is 101).

It worked for specific applications but now it does not work, and I get the following error:

Error: Can't subset columns that don't exist.
x Locations 304, 303, 302, 301, 300, etc. don't exist.
i There are only 197 columns.
Run `rlang::last_error()` to see where the error occurred.

Here is the data:

# A tibble: 2,937 x 197
      ID   doy FireID  Year    sE     NAME    L1NAME   ID_2   area s_count s_mean s_median s_stdev  s_min   doydiff ID_E5    32    33    34    35
                                              
 1  2246   173  30048  2015     0 A         T         30048 3.86e6       0    100        0       0     0       73      56  267.  265.  264.  265.
 2  2275   174  30076  2015     0 A         T         30076 2.15e6       0    100        0       0     0       74     533  266.  266.  263.  264.
 3   704   294  28542  2015  1381 A         T         28542 6.44e5       0    100        0       0     0       194    562  277.  277.  278.  279.
 4   711   110  28549  2015     0 NA        NA        28549 2.15e5       0    101        0       0     0       9      569  262.  264.  260.  262.
 5   690   161  28528  2015   232 A         T         28528 4.29e5       0    101        0       0     0       60     580  280.  279.  280.  279.
 6   692   331  28530  2015     0 M         M         28530 2.15e5       0    101        0       0     0       130    582  280.  279.  281.  280.
 7   667    47  28506  2015   232 M         M         28506 2.79e6       0     10        0       0     0       37     589  280.  282.  281.  280.
 8   672   188  28511  2015     0 NA        NA        28511 2.79e6       0    101        0       0     0       87     594  254.  261.  259.  254.
 9   657   171  28496  2015   578 NA        NA        28496 8.59e5       0    101        0       0     0       170    611  256.  263.  260.  254.
10   635   301  28474  2015  1084 M         M         28474 1.50e6       0    101        0       0     0       200    621  282.  282.  282.  281.

The data columns continue until columns name 212. It is not shown here.

Here is the script:

polydata = read_csv("path/E15.csv")
polydata$s_mean <- round(polydata$s_mean)
polydata <- polydata[order(polydata$s_mean),]

# slice each row, and put each slice in a list
df_sub = lapply(1:nrow(polydata),
                function(x){
                  polydata[x,c(1,10,polydata$s_mean[x]:187+10)] # + 10 because of the offset: doy_columns start at 11
                })

Why do I get an error that I return too many columns when I specify 187+10 as the subsetting parameter?

What should be changed?

I eventually want this to be the outcome (compare the column names to s_mean to better understand the desired output):

ID    s_mean    32    33    34    35    36    ...    212
1     30        267   278   270   269   267   ...    298
2     100       NA    NA    NA    NA    NA    ...    298
3     35        NA    NA    NA    242   246   ...    298

How to replace values in columns based on row values and column names?

Answers (1)

Toy Dataset:

Related Questions