Grouping multiple rows into one string postgres for each position in select

Question

I have a table education that has a column university. For each of the rows in the table I want to find 3 most similar universities from the table.

Here is my query that finds 3 most similar universities to a given input:

select distinct(university),
               similarity(unaccent(lower(university)),
                          unaccent(lower('Boston university')))
from education
order by similarity(unaccent(lower(university)),
                    unaccent(lower('Boston university'))) desc
limit 3;

It works fine. But now I would like to modify this query so that I get two columns and a row for each existing university in the table: the first column would be the university name and the second would be the three most similar universities found in the database (or if its easier - four columns where the first is the university and the next 3 are the most similar ones).

What should this statement look like?

GMB · Accepted Answer

You could use an inline aggregated query:

with u as (select distinct university from education)
select 
    university,
    (
        select string_agg(u.university, ',')
        from u 
        where u.university != e.university
        order by similarity(
            unaccent(lower(u.university)), 
            unaccent(lower(e.university))
        ) desc
        limit 3
    ) similar_universities
from education e

This assumes that a given university may occur more than once in the education table (hence the need for a cte).

Grouping multiple rows into one string postgres for each position in select

Answers (2)

Related Questions