Removing columns from a dataframe with a common string but keeping the first

Question

In my dataframe frame I have columns with names like this:

stockA_1,stockA_2,stockA_3,stockA_4

I would like to delete all rows from my df which have the common string "stockA_" but I would like to keep only the first column with this synthetic "stockA_1"

How is it possible to make it?

jogo · Accepted Answer

This can be done with base functions:

d <- as.data.frame(matrix(1:35, 5, 7))
names(d) <- c("AA", "stockA_1", "BBB", "stockA_2", "stockA_3", "CCCCC", "stockA_4")
d[,-which(grepl("^stockA_", names(d)))[-1]]

The result is:

> d[,-which(grepl("^stockA_", names(d)))[-1]]
  AA stockA_1 BBB CCCCC
1  1        6  11    26
2  2        7  12    27
3  3        8  13    28
4  4        9  14    29
5  5       10  15    30

If you want to conserve the column "stockA_1" (which is eventually not in the first place under the "stockA_"-columns) then you can do

d <- as.data.frame(matrix(1:35, 5, 7))
names(d) <- c("AA", "stockA_11", "BBB", "stockA_2", "stockA_1", "CCCCC", "stockA_4")
i <- (!grepl("^stockA_", names(d))) | grepl("^stockA_1$", names(d))
d[,i]

with the result:

> d[,i]
  AA BBB stockA_1 CCCCC
1  1  11       21    26
2  2  12       22    27
3  3  13       23    28
4  4  14       24    29
5  5  15       25    30

Removing columns from a dataframe with a common string but keeping the first

Answers (2)

Related Questions