PostgreSQL default_statistics_target not improving row estimation

Question

I am trying to optimize our queries on Postgres which takes minutes sometimes using huge tables. Started looking at query plan and noticed close 1000x difference between estimated number of rows and actual rows on running with EXPLAIN ANALYZE.

This lead me to the parameter default_statistics_target which controls the number rows sampled by ANALYZE command to collect stats used by query planner. As few blogs suggested, I experimented by increased value setting it to 1000 and event to max allowed value of 10000.

Ran ANALYZE every time to ensure it stats are updated. But surprisingly, this did not improve the rows estimation at all. In fact it reduced the estimated value a bit further which seems strange to understand.

Also tested by reducing the value to 10. Which seems to have improved the count a bit. So I am confused if the param actually does what I thought it does. Or if there is some other way to improve row estimation. Any help would be much appreciated.

Postgres version: 9.6

Query plan: At the last index scan step, it has estimated 462 but actual is 1.9M. https://explain.depesz.com/s/GZY

After changing default_statistics_target = 1000, rows at Index scan step were

->  (cost=0.57..120.42 rows=114 width=32) (actual time=248.999..157947.395 rows=1930518 loops=1)

And on setting it to default_statistics_target = 10, counts were:

->  (cost=0.57..2610.79 rows=2527 width=32) (actual time=390.437..62668.837 rows=1930518 loops=1)

P.S. Table under consideration has more than 100M rows.

PostgreSQL default_statistics_target not improving row estimation

Answers (1)

Related Questions