Database Implementation Help : Time-Series data

Question

This is the re-submission of my previous question:

I have a collection of ordered time-series data(stock minute price information). My current database structure using PostgreSQL is below:

symbol_table - where I keep the list of the symbols with the symbol_id as a primary key(serial). time_table, date_table - time/date values are stored there. time_id/date_id are primary keys(serial/serial).

My main minute_table contains the minute pricing information where date_id|time_id|symbol_id are primary keys(also foreign keys from the corresponding tables)

Using this main minute_table I'm performing different statistical analyses and keep the results in a separate tables, like one_minute_std - where one minute standard deviation measures are kept.

Every night I'm updating the tables with the current price information from the last day's closing prices.

With the current implementation my tables contain all the symbols with around 50m records each. Primary keys are indexed.

If I want to query for all the symbols where closing price > x and one_minute_std >2 and one_minute_std < 4 for the specific date it takes about 3-4 minutes for the search.

To speed up the process I was thinking of separating each symbol to its own table but not 100% sure if this is a 'proper' way of doing it.

Could you advise me on how I can speed up the query process?

Database Implementation Help : Time-Series data

Answers (1)

Related Questions