Extracting columns from pandas dataframe without hard coding

Question

Is there a way to extract a subset of columns from a pandas dataframe without specifying all of the columns. e.g. I have dataframe with foll. columns: str_ID, num_ID, 1990, 1991, 1992, 1993, 1994, 1995 and I want to extract columns from 1990 onwards. How do I do that without hard coding it?

df.columns.values
array(['str_ID', 'num_ID', 1990, 1991, 1992, 1993, 1994, 1995], dtype=object)

Alexander · Accepted Answer

You can use a conditional comprehension on the columns of the dataframe (assumes the column titles for the years are integers):

df[sorted(col for col in df if isinstance(col, int) and col >= 1990)]

This filters for integer columns greater than or equal to 1990 and returns the result in a sorted order.

Extracting columns from pandas dataframe without hard coding

Answers (2)

Related Questions