Pandas: Groupby Fillna Not working

Question

I have the following dataframe, which has about 4000 Tickers and in Total about 2 million rows:

Ticker      Date              Rank         
  1         01/01/2000         5            
  1         01/02/2000        NaN             
  2         01/01/2000         4            
  2         01/02/2000         2

I now run the following code to carryforward the Rank column, which works totally fine.:

import pandas as pd
df= df.sort_values(by=["Ticker", "Date"], ascending=[True,True])
df['Rank'] = df.groupby('Ticker')['Rank'].fillna(value=None, method="ffill")

However, I now want to carryforward a different column. In order to create this column I do the following:

  import numpy as np
  df["Code"]=np.NaN

In this function I write some code that about 200 values will be replaced by 1 according to the date and ticker values in the df "add". This code worksand looks the following:

df["Code"][(df.Date == add) & (df["Ticker"] == column)] = 1

This makes my dataframe look like this:

Ticker      Date              Rank          Code      
  1         01/01/2000         5             NaN
  1         01/02/2000        NaN            NaN
  2         01/01/2000         4              1
  2         01/02/2000         2             NaN

Now, I want to carryforward this column, but the code takes forever.

import pandas as pd
df= df.sort_values(by=["Ticker", "Date"], ascending=[True,True])
df['Code'] = df.groupby('Ticker')['Code'].fillna(value=None, method="ffill")

I have ran it for two days and my pc crashed. There must be some mistake here in the way I am doing things, because the above carryforward runs so fast and this one does not even finish. I checked the dtype of "Code" and it is float64.

Can anyone help?

Pandas: Groupby Fillna Not working

Answers (1)

Related Questions