Pandas - How to groupby and remove specifc rows

Question

I have a DF like this:

id     company     duration
0    Other Company    5
0    Other Company    19
0    X Company        7
1    Other Company    24
1    Other Company    6
1    X Company        12
2    X Company        9
3    Other Company    30
3    X Company        16

I need to group the DF by ID and Company and then sum the duration in each. In the end I need only the values with 'X Company'. This is what I did:

import pandas as pd
jobs = pd.read_csv("data/jobs.csv")
time_in_company = jobs.groupby(['id','company'])['duration'].agg(sum)

And got this:

id     company     duration
0    Other Company    24
0    X Company        7
1    Other Company    30
1    X Company        12
2    X Company        9
3    Other Company    30
3    X Company        16

Now I need remove all entrys from 'Other Company'. Already tried using time_in_company.drop('Any Company') #Return KeyError 'Any Company'

Tried to .set_index('company'), in order to try something else, but it tells me 'Series' object has no attribute 'set_index'

Tried to use a .filter() in the groupby but I need the .agg(sum). (And it didn't work anyway..

Can someone shed some light in the issue for me? Thanks in advance.

pizza lover · Accepted Answer

Does this help?

time_in_company= time_in_company.reset_index(level='company')
time_in_company [time_in_company ['company']!="Other Company"]

Pandas - How to groupby and remove specifc rows

Answers (2)

Related Questions