Postgres - calculate change in cumulative data

Question

I am gathering data from a few API sources via Python, and adding this to 2 tables in Postgres.

I then use this data to make reports, joining and grouping/filtering the data. Every day I add thousands of rows.

The cost, revenue and sales is always cumulative, meaning that each data point is from t1 for that product and t2 is the time of data retrival.

The latest data pull will therefore include all the previous data down to t1. t1, t2 are timestamp without time zone in Postgres. I currently use Postgres 10.

sample:

id, vendor_id, product_id, t1, t2, cost, revenue, sales
1, a, a, 2018-01-01, 2018-04-18, 50, 200, 34
2, a, b, 2018-05-01, 2018-04-18, 10, 100, 10
3, a, c, 2018-01-02, 2018-04-18, 12, 100, 9
4, a, d, 2018-01-03, 2018-04-18, 12, 100, 8
5, b, e, 2018-25-02, 2018-04-18, 12, 100, 7

6, a, a, 2018-01-01, 2018-04-17, 40, 200, 30
7, a, b, 2018-05-01, 2018-04-17, 0, 95, 8
8, a, c, 2018-01-02, 2018-04-17, 10, 12, 5
9, a, d, 2018-01-03, 2018-04-17, 8, 90, 4
10, b, e, 2018-25-02, 2018-04-17, 9, 0-, 3

Cost and revenue are from two tables, and I join them on vendor_id, product_id and t2.

Is there a way I can go through all of the data and "shift" it and subtract, so instead of having cumulative data, I will have time series based data?

Should this be done prior to storing it, or better done when making the reports?

For reference, currently if I want a report with change between two times, I do two sub queries, but it seem backwards vs having the data in time series and just aggregate the needed intervals.

with report1 as (select ...),
report2 as (select ...)
select .. from report1 left outer join report2 on ...

Thanks a lot in advance!

JR

Postgres - calculate change in cumulative data

Answers (1)

Related Questions