Count of the max value in an aggregate select

Question

I want to count the number of the number of the maximum value of the minimum values in a particular field.

This select is broken if I were to uncomment the lines:

select clientnumber
      ,count(distinct clientname) as NumberUnique
      ,max(clientname) as UniqueSample1
      -- ,sum(case clientname when min(clientname) then 1 else 0 end) as UniqueCount1
      ,min(clientname) as UniqueSample2
      -- ,sum(case clientname when min(clientname) then 1 else 0 end) as UniqueCount2
  FROM dbo.InvoicesSent dt
 group by clientnumber
 having count(distinct clientname) > 1

Background: We are reviewing records that have two different names on the invoices for the same clientnumber. Knowing which one of the two has the higher count is handy for the analysis.

Note: I know there are other solutions that can take into account more than 2 unique values but at this point, I want to solve this problem for its academic value. I also know I could solve this with a subselect but I am holding out for some graceful solution.

Gordon Linoff · Accepted Answer

If you know that there are two names, then you could do this without a subselect:

select top (1) with ties clientnumber, 
       min(clientname) over (partition by clientnumber) as min_name,
       max(clientname) over (partition by clientnumber) as max_name,
       (case when clientname = min(clientname) over (partition by clientnumber)
             then count(*)
             else sum(count(*)) over (partition by clientnumber) - count(*)
        end) as min_count,
       (case when clientname = max(clientname) over (partition by clientnumber)
             then count(*)
             else sum(count(*)) over (partition by clientnumber) - count(*)
        end) as max_count
from InvoicesSent i
group by clientnumber, clientname
order by row_number() over (partition by clientnumber order by clientname) +
         (case when min(clientname) over (partition by clientnumber) = max(clientname) over (partition by clientnumber) then 1 else 0 end);

I am not arguing that this is elegant. Merely that it is possible for your conditions.

Here is a slightly simpler form:

select top (1) with ties clientnumber, 
       clientname as min_name,
       max(clientname) over (partition by clientnumber) as max_name,
       count(*) as min_count,
       sum(count(*)) over (partition by clientnumber) - count(*) max_count
from InvoicesSent i
group by clientnumber, clientname
order by row_number() over (partition by clientnumber order by clientname) +
         (case when count(*) over (partition by clientnumber) >= 2 then 0 else 1 end)

And a db<>fiddle.

Actually, this isn't so bad. It is aggregating by the name column and the client, putting all the data on one row using window functions and then selecting the one row. There is a little trick in the order by to only select clients with more than one name.

Count of the max value in an aggregate select

Answers (2)

Related Questions