Combine elements of array into different array

Question

I need to split text elements in an array and combine the elements (array_agg) by index into different rows

E.g., input is

'{cat$ball$x... , dog$bat$y...}'::text[]

I need to split each element by '$' and the desired output is:

{cat,dog} - row 1
{ball,bat} - row 2
{x,y} - row 3
...

Sorry for not being clear the first time. I have edited my question. I tried similar options but unable to figure out how to get it with multiple text elements separated with '$' sysmbol

Erwin Brandstetter · Accepted Answer

Exactly two parts per array element (original question)

Use unnest(), split_part() and array_agg():

SELECT array_agg(split_part(t, '$', 1)) AS col1
     , array_agg(split_part(t, '$', 2)) AS col2
FROM   unnest('{cat$ball, dog$bat}'::text[]) t;

Split comma separated column data into additional columns

General solution (updated question)

For any number of arrays with any number of elements containing any number of parts.

Demo for a table tbl:

CREATE TABLE tbl (tbl_id int PRIMARY KEY, arr text[]);

INSERT INTO tbl VALUES
  (1, '{cat1$ball1, dog2$bat2}')  -- 2 parts per array element, 2 elements
, (2, '{cat$ball$x, dog$bat$y}')  -- 3 parts ...
, (3, '{a1$b1$c1$d1, a2$b2$c2$d2, a3$b3$c3$d3}');  -- 4 parts, 3 elements

Query:

SELECT tbl_id, idx, array_agg(elem ORDER BY ord) AS pivoted_array
FROM   tbl t
     , unnest(t.arr) WITH ORDINALITY a1(string, ord)
     , unnest(string_to_array(a1.string, '$')) WITH ORDINALITY a2(elem, idx)
GROUP  BY tbl_id, idx
ORDER  BY tbl_id, idx;

We are looking at two (nested) LATERAL joins here. LATERAL requires Postgres 9.3. Details:

What is the difference between LATERAL and a subquery in PostgreSQL?

WITH ORDINALITY for the the first unnest() is up for debate. A simpler query normally works, too. It's just not guaranteed to work according to SQL standards:

SELECT tbl_id, idx, array_agg(elem) AS pivoted_array
FROM   tbl t
     , unnest(t.arr) string
     , unnest(string_to_array(string, '$')) WITH ORDINALITY a2(elem, idx)
GROUP  BY tbl_id, idx
ORDER  BY tbl_id, idx;

Details:

PostgreSQL unnest() with element number

WITH ORDINALITY requires Postgres 9.4 or later. The same back-patched to Postgres 9.3:

SELECT tbl_id, idx, array_agg(arr2[idx]) AS pivoted_array
FROM   tbl t
     , LATERAL (
         SELECT string_to_array(string, '$') AS arr2  -- convert string to array
         FROM   unnest(t.arr) string  -- unnest org. array
         ) x
     , generate_subscripts(arr2, 1) AS idx  -- unnest 2nd array with ord. numbers
GROUP  BY tbl_id, idx
ORDER  BY tbl_id, idx;

Each query returns:

 tbl_id | idx | pivoted_array
--------+-----+---------------
      1 |   1 | {cat,dog}
      1 |   2 | {bat,ball}
      1 |   3 | {y,x}
      2 |   1 | {cat2,dog2}
      2 |   2 | {ball2,bat2}
      3 |   1 | {a3,a1,a2}
      3 |   2 | {b1,b2,b3}
      3 |   3 | {c2,c1,c3}
      3 |   4 | {d2,d3,d1}

SQL Fiddle (still stuck on pg 9.3).

The only requirement for these queries is that the number of parts in elements of the same array is constant. We could even make it work for a varying number of parts using crosstab() with two parameters to fill in NULL values for missing parts, but that's beyond the scope of this question:

PostgreSQL Crosstab Query

Combine elements of array into different array

Answers (2)

Exactly two parts per array element (original question)

General solution (updated question)

Related Questions