SQL Server GROUP BY duplicate values

Question

I have a NOTOK table that containts idproductype and idcause columns and a query that looks like this

SELECT pt.Name, c.Description 
FROM NOTOK n
JOIN ProductType pt ON n.IDProductType = pt.ID
JOIN Cause c ON n.IDCause=c.ID

That returns the following dataset

PRODUCTTYPE CAUSE
productType1 cause1
productType1 cause1
productType1 cause1
productType1 cause2
productType1 cause2
productType1 cause3
productType2 cause1
productType2 cause1
productType3 cause3
productType3 cause3
productType3 cause1

Now, i'd like to group this by the ProductType-column and put the most common two causes in different column, so the results is as following

PRODUCTTYPE CAUSE1 CAUSE2
productType1 cause1 cause2
productType2 cause1 NULL
productType3 cause3 cause1

How would i do that?

T I · Accepted Answer

You can use a cte to get the most common and use ROW_NUMBER to provide a way of getting the top 2 causes per product. This can then be pivoted to the result set you require.

WITH cte AS
(
 SELECT *, 
   ROW_NUMBER() OVER ( PARTITION BY product_type ORDER BY n DESC ) o
 FROM (
  SELECT product_type, cause, COUNT(1) n
  FROM notok
  GROUP BY product_type, cause
 ) t
)

SELECT product_type, MAX([1]), MAX([2])
FROM cte
PIVOT (
  MAX(cause)
  FOR o IN ([1],[2])
) p
GROUP BY product_type

demo

SQL Server GROUP BY duplicate values

Answers (2)

Related Questions