Python/Pandas: sort by date and compute two week (rolling?) average

Question

So far I've read in 2 CSV's and merged them based on a common element. I take the output of the merged CSV and iterate through the unique element they've been merged on. While I have them separated I want to generate a daily count line and a two week rolling average from the current date going backward. I cannot index based of the 'Date Opened' field but I still need my outputs organized by this with the most recent first. Once these are sorted by date my daily count plotting issue will be rectified. My remaining task would be to compute a two week rolling average for count within the week. I've looked into the Pandas documentation and I think the rolling_mean will work but the parameters of this function don't really make sense to me. I've tried biwk_avg = pd.rolling_mean(open_dt, 28) but that doesnt seem to work. I know there is an easier way to do this but I think I've hit a roadblock with the documentation available. The end result should look something like this graph. Right now my daily count graph isnt sorted(even though I think I've instructed it to) and is unusable in line form.

def data_sort():
    data_merge = data_extract()
    domains  = data_merge.groupby('PWx Domain')
    for domain in domains.groups.items():
        dsort = (data_merge.loc[domain[1]])
        print (dsort.head())
        open_dt = pd.to_datetime(dsort['Date Opened']).dt.date
        #open_dt.to_csv('output\''+str(domain)+'_out.csv', sep = ',')
        open_ct = open_dt.value_counts(sort= False) 
        biwk_avg = pd.rolling_mean(open_ct, 28)
        plt.plot(open_ct,'bo')
        plt.show()

data_sort()

Python/Pandas: sort by date and compute two week (rolling?) average

Answers (1)

Related Questions