Conditionally dropping columns in a pandas dataframe

Question

I have this dataframe and my goal is to remove any columns that have less than 1000 entries.

Prior to to pivoting the df I know I have 880 unique well_id's with entries ranging from 4 to 60k+. I know should end up with 102 well_id's.

I tried to accomplish this in a very naïve way by collecting the wells that I am trying to remove in an array and using a loop but I keep getting a 'TypeError: Level type mismatch' but when I just use del without a for loop it works.

#this works
del df[164301.0]
del df['TB-0071']

# this doesn't work
for id in unwanted_id:
    del df[id]

Any help is appreciated, Thanks.

johnjohn · Accepted Answer

You can use dropna method:

df.dropna(thresh=[]) #specify [here] how many non-na values you require to keep the row

The advantage of this method is that you don't need to create a list.

Also don't forget to add the usual inplace = True if you want the changes to be made in place.

Conditionally dropping columns in a pandas dataframe

Answers (2)

Related Questions