PostgreSQL - "Ten most frequent entries"

Question

We've got a table with two colums: USER and MESSAGE
An USER can have more than one message.
The table is frequently updated with more USER-MESSAGE pairs.

I want to frequently retrieve the top X users that sent the most messages. What would be the optimal (DX and performnce wise) solution for it?

The solutions I see myself:

I could GROUP BY and COUNT, however it doesn't seem like the most performant nor clean solution.
I could keep an additional table that'd keep count of every user's messages. On every message insertion into the main table, I could also update the relevant row here. Could the update be done automaticaly? Perhaps I could write a procedure for it?
For the main table, I could create a VIEW that'd have an additional "calculated" column - it'd GROUP BY and COUNT, but again, it's probably not the most performant solution. I'd query the view instead.

Please tell me whatever you think might be the best solution.

Laurenz Albe · Accepted Answer

The first and third solutions are essentially the same, since a view is nothing but a “crystallized” query.

The second solution would definitely make for faster queries, but at the price of storing redundant data. The disadvantages of such an approach are:

You are running danger of inconsistent data. You can reduce that danger somewhat by using triggers that automatically keep the data synchronized.
The performance of modifications of message will be worse, because the trigger will have to be executed, and each modification will also modify users (that is the natural place to keep such a count).

The decision should be based on the question whether the GROUP BY query will be fast enough for your purposes. If yes, use it and avoid the above disadvantages. If not, consider storing the extra count.

PostgreSQL - "Ten most frequent entries"

Answers (2)

Related Questions

PostgreSQL - &quot;Ten most frequent entries&quot;

Answers (2)

Related Questions

PostgreSQL - "Ten most frequent entries"