postgresql search is slow on type text[] column

Question

I have product_details table with 30+ Million records. product attributes text type data is stored into column Value1.

Front end(web) users search for product details and it will be queried on column Value1.

create table product_details(
key serial primary key , 
product_key int,
attribute_key int ,
Value1 text[],
Value2 int[], 
status text);

I created gin index on column Value1 to improve search query performance. query execution improved a lot for many queries.

Tables and indexes are here

Below is one of query used by application for search.

 select p.key from (select x.product_key,
                                    x.value1,
                                    x.attribute_key,
                                    x.status
                             from product_details x
                             where value1 IS NOT NULL
                         ) as pr_d
                             join attribute_type at on at.key = pr_d.attribute_key
                             join product p on p.key = pr_d.product_key
                    where value1_search(pr_d.value1) ilike '%B s%'
                      and at.type = 'text'
                      and at.status = 'active'
                      and pr_d.status = 'active'
                      and 1 = 1
                      and p.product_type_key=1
                      and 1 = 1
                    group by p.key

query is executed in 2 or 3 secs if we search %B % or any single or two char words and below is query plan

Group  (cost=180302.82..180302.83 rows=1 width=4) (actual time=49.006..49.021 rows=65 loops=1)
  Group Key: p.key
  ->  Sort  (cost=180302.82..180302.83 rows=1 width=4) (actual time=49.005..49.009 rows=69 loops=1)
        Sort Key: p.key
        Sort Method: quicksort  Memory: 28kB
        ->  Nested Loop  (cost=0.99..180302.81 rows=1 width=4) (actual time=3.491..48.965 rows=69 loops=1)
              Join Filter: (x.attribute_key = at.key)
              Rows Removed by Join Filter: 10051
              ->  Nested Loop  (cost=0.99..180270.15 rows=1 width=8) (actual time=3.396..45.211 rows=69 loops=1)
                    ->  Index Scan using products_product_type_key_status on product p  (cost=0.43..4420.58 rows=1413 width=4) (actual time=0.024..1.473 rows=1630 loops=1)
                          Index Cond: (product_type_key = 1)
                    ->  Index Scan using product_details_product_attribute_key_status on product_details x  (cost=0.56..124.44 rows=1 width=8) (actual time=0.026..0.027 rows=0 loops=1630)
                          Index Cond: ((product_key = p.key) AND (status = 'active'))
                          Filter: ((value1 IS NOT NULL) AND (value1_search(value1) ~~* '%B %'::text))
                          Rows Removed by Filter: 14
              ->  Seq Scan on attribute_type at  (cost=0.00..29.35 rows=265 width=4) (actual time=0.002..0.043 rows=147 loops=69)
                    Filter: ((value_type = 'text') AND (status = 'active'))
                    Rows Removed by Filter: 115
Planning Time: 0.732 ms
Execution Time: 49.089 ms

But if i search for %B s%, query took 75 secs and below is query plan (second time query execution took 63 sec)

In below query plan, DB engine didn't consider index for scan as in above query plan indexes were used. Not sure why ?

Group  (cost=8057.69..8057.70 rows=1 width=4) (actual time=62138.730..62138.737 rows=12 loops=1)
  Group Key: p.key
  ->  Sort  (cost=8057.69..8057.70 rows=1 width=4) (actual time=62138.728..62138.732 rows=14 loops=1)
        Sort Key: p.key
        Sort Method: quicksort  Memory: 25kB
        ->  Nested Loop  (cost=389.58..8057.68 rows=1 width=4) (actual time=2592.685..62138.710 rows=14 loops=1)
              ->  Hash Join  (cost=389.15..4971.85 rows=368 width=4) (actual time=298.280..62129.956 rows=831 loops=1)
                    Hash Cond: (x.attribute_type = at.key)
                    ->  Bitmap Heap Scan on product_details x  (cost=356.48..4937.39 rows=681 width=8) (actual time=298.117..62128.452 rows=831 loops=1)
                          Recheck Cond: (value1_search(value1) ~~* '%B s%'::text)
                          Rows Removed by Index Recheck: 26168889
                          Filter: ((value1 IS NOT NULL) AND (status = 'active'))
                          Rows Removed by Filter: 22
                          Heap Blocks: exact=490 lossy=527123
                          ->  Bitmap Index Scan on product_details_value1_gin  (cost=0.00..356.31 rows=1109 width=0) (actual time=251.596..251.596 rows=2846970 loops=1)
                                Index Cond: (value1_search(value1) ~~* '%B s%'::text)
                    ->  Hash  (cost=29.35..29.35 rows=265 width=4) (actual time=0.152..0.153 rows=269 loops=1)
                          Buckets: 1024  Batches: 1  Memory Usage: 18kB
                          ->  Seq Scan on attribute_type at  (cost=0.00..29.35 rows=265 width=4) (actual time=0.010..0.122 rows=269 loops=1)
                                Filter: ((value_type = 'text') AND (status = 'active'))
                                Rows Removed by Filter: 221
              ->  Index Scan using product_pkey on product p  (cost=0.43..8.39 rows=1 width=4) (actual time=0.009..0.009 rows=0 loops=831)
                    Index Cond: (key = x.product_key)
                    Filter: (product_type_key = 1)
                    Rows Removed by Filter: 1
Planning Time: 0.668 ms
Execution Time: 62138.794 ms

Any suggestions pls to improve query for search %B s%

thanks

postgresql search is slow on type text[] column

Answers (1)

Related Questions