SQL: a time-series variant of the "every nth row" problem

Question

I have a table of time-series data, with the columns:

sensor_number (integer primary key)
signal_strength (integer)
signal_time (timestamp)

Each sensor creates 20-30 rows per minute. I need a query that returns for a sensor 1 row per minute (or every 2 minutes, 3 minutes, etc). A pure SQL approach is to use a window function, with a partition on an expression that rounds the timestamp appropriately (date_trunc() works for the 1-minute case, otherwise I have to some messy casting) The problem is the expression blocks the ability to use the index. With 5B rows, that's a killer.

The best alternative I can come up with is a user-defined function that uses a cursor to step through the table in index key order (sensor_number, signal_time) and outputting a row every time the timestamp crosses a minute boundary. That's still slow though. Is there a pure SQL approach that'll accomplish this AND utilize the index?

SQL: a time-series variant of the "every nth row" problem

Answers (1)

Related Questions

SQL: a time-series variant of the &quot;every nth row&quot; problem

Answers (1)

Related Questions

SQL: a time-series variant of the "every nth row" problem