Get most commonly occurring value for each user id

Question

I have a table with userIds and product categories prod. I want to get a table of unique userIds and associated most occurring product categories prod. In other words, I want to know what item categorys each customer is buying the most. How can I achieve this in PL/SQL or Oracle SQL?

|userId|prod|
|------|----|
|123544|cars|
|123544|cars|
|123544|dogs|
|123544|cats|
|987689|bats|
|987689|cats|

I have already seen SO questions for getting the most common value of a column, but how do I get the most common value for each unique userId?

David Faber · Accepted Answer

SELECT user_id, prod, prod_cnt FROM (
    SELECT user_id, prod, prod_cnt
         , RANK() OVER ( PARTITION BY user_id ORDER BY prod_cnt DESC ) AS rn
      FROM (
        SELECT user_id, prod, COUNT(*) AS prod_cnt
          FROM mytable
         GROUP BY user_id, prod
    )
) WHERE rn = 1;

In the innermost subquery I am getting the COUNT of each product by user. Then I rank them using the analytic (window) function RANK(). Then I simply select all of those where the RANK is equal to 1. Using RANK() instead of ROW_NUMBER() ensures that ties will be returned.

Get most commonly occurring value for each user id

Answers (2)

Related Questions