Improving the speed/optimizing this code

Question

This codes look right to me, but it's taking me an awful time to run.

In both cases, I'm dealing with 78k rows of data. I've managed to reduce the columns to 2-4 to simplify the code. Here, I'm leaving the first and second columns alone and trying to substitute the third column with the derivative. I would do it monthly until the ProjID changes, telling the code to

for j in range (1,len(joined)):
    if joined['ProjID'][j] == joined['ProjID'][j-1]:
        joined.loc[j]=[joined.ProjID[j], joined.Month[j], (askingrent[j]-askingrent[j-1])/askingrent[j-1]]
    else:
        joined.loc[j]=[joined.ProjID[j], joined.Month[j], 0]

Here, I have 78k of rows again. But, I'm trying to simply convert the column into datetime and delete the time (hours and minutes). The code looks simple enough; but I've been waiting for 30minutes~ish. Is the speed relevant to code or something else?

ageofbuildings['Month']=pd.to_datetime(ageofbuildings['Month'])
for i in range (0, len(ageofbuildings)):
    ageofbuildings.Month[i]=ageofbuildings.Month[i].date()

Improving the speed/optimizing this code

Answers (1)

Related Questions