Royi Namir
Royi Namir

Reputation: 148624

Row_number over (Partition by xxx) in Linq?

I have a DataTable which has a structure and data:

id |   inst   |   name
------------------------
 1 |  guitar  |  john
 2 |  guitar  |  george
 3 |  guitar  |  paul
 4 |  drums   |  ringo
 5 |  drums   |  pete

I can retrieve the records via:

IEnumerable <Beatle>...

class Beatle
{
  int id;
  string inst;
  string name;
}

I'd like to get the internal order of those who play the different instruments. In MSSQL I'd use

SELECT 
    *
    ,Row_Number() OVER (PARTITION BY inst ORDER BY id) AS rn
FROM Beatles

This query returns

id |   inst   |   name  | rn
-----------------------------
 1 |  guitar  |  john   | 1
 2 |  guitar  |  george | 2
 3 |  guitar  |  paul   | 3
 4 |  drums   |  ringo  | 1
 5 |  drums   |  pete   | 2

Question:
How can I do that in Linq?

Upvotes: 56

Views: 67351

Answers (9)

T.Y. Kucuk
T.Y. Kucuk

Reputation: 477

Based on @Jon Comtois answer you can use the following extension method if you need the specific row number gets filtered out;

    /// <summary>
    /// Groups and orders by the data partitioning and returns the list of data with provided rownumber
    /// It is the equivalent of SQL's ROW_NUMBER() OVER (PARTITION BY ... ORDER BY ...)
    /// </summary>
    /// <typeparam name="TSource">Source type</typeparam>
    /// <typeparam name="TGroupingKey">Generic type for grouping property</typeparam>
    /// <typeparam name="TOrderKey">Generic type for ordering property</typeparam>
    /// <param name="source">Source list to be partitioned</param>
    /// <param name="groupingProperty">Grouping property</param>
    /// <param name="orderProperty">Ordering property</param>
    /// <param name="orderByAsc">Order direction</param>
    /// <param name="rowNumber">Rows to be filtered out finally</param>
    /// <returns>Partitioned list</returns>
    public static List<TSource> FilterByPartitioning<TSource, TGroupingKey, TOrderKey>(this List<TSource> source, Func<TSource, TGroupingKey> groupingProperty, Func<TSource, TOrderKey> orderProperty, bool orderByAsc = true, int rowNumber = 1)
    {
        var orderedData = orderByAsc ? source.OrderBy(orderProperty) : source.OrderByDescending(orderProperty);

        return orderedData.GroupBy(groupingProperty)
            .Select(g => new { g, count = g.Count() })
            .SelectMany(t => t.g.Select(b => b)
                .Zip(Enumerable.Range(1, t.count), (source, i) => new { source, row = i }))
            .Where(x => x.row == rowNumber)
            .Select(x => x.source).ToList();
    }

    //Usage
    var result = myList.FilterByPartitioning(group => group.PropertyToGroup, order => order.PropertyToOrder, orderByAsc: false, rowNumber: 1);

Upvotes: 0

jurio
jurio

Reputation: 21

https://github.com/jurioli/Applied

var result = data.GroupBy(a => new { a.inst }).AsPartition()
.Over(p => p.RowNumber(), (a, value) => new { a.inst, a.name, RowNumber = value })
.ToList();

Upvotes: 1

Rabbi
Rabbi

Reputation: 4722

B"H

I know this is old. But why isn't the solution simply?

var o = beatles.GroupBy(x => x.inst)
               .SelectMany(g =>
                   g.Select((j, i) => new { j.inst, j.name, rn = i + 1 })
               );

Upvotes: 38

Tomasz Maj
Tomasz Maj

Reputation: 1539

Some may find useful using in your code to get proper index

.Select((item, i) => new { Item = item, Index = i })

Upvotes: 2

shA.t
shA.t

Reputation: 16968

Another solution for Linq to objects is:

var result = beatles
            .GroupBy(g => g.inst)
            // PARTITION BY ^^^^
            .Select(c => c.OrderBy(o => o.id).Select((v, i) => new { i, v }).ToList())
            //                   ORDER BY ^^
            .SelectMany(c => c)
            .Select(c => new { c.v.id, c.v.inst, c.v.name, rn = c.i + 1 })
            .ToList();

[ C# Demo ]

Upvotes: 4

Jon Comtois
Jon Comtois

Reputation: 1854

Try this one liner:

var o = beatles
    .OrderBy( x => x.id )
    .GroupBy( x => x.inst )
    .Select( group => new { Group = group, Count = group.Count() } )
    .SelectMany( groupWithCount =>
        groupWithCount.Group.Select( b => b)
        .Zip(
            Enumerable.Range( 1, groupWithCount.Count ),
            ( j, i ) => new { j.inst, j.name, RowNumber = i }
        )
    );

foreach (var i in o)
{
    Console.WriteLine( "{0} {1} {2}", i.inst, i.name, i.RowNumber );
}

Output:

Guitar John 1
Guitar George 2
Guitar Paul 3
drums Ringo 1
drums Pete 2

Upvotes: 35

Benjamin Fourgeaud
Benjamin Fourgeaud

Reputation: 21

Another solution to do the equivalent of RANK() OVER (PARTITION BY "partitionBy" ORDER BY "orderBy" DESC):

 DataTable Rank(DataTable dt, string partitionBy, string orderBy, int whichRank)
   {

        DataView dv = new DataView(dt);
        dv.Sort = partitionBy + ", " + orderBy + " DESC";

        DataTable rankDt = dv.ToTable();
        rankDt.Columns.Add("Rank");
        int rank = 1;

        for (int i = 0; i < rankDt.Rows.Count - 1; i++)
        {
            rankDt.Rows[i]["Rank"] = rank;
            DataRow thisRow = rankDt.Rows[i];
            DataRow nextRow = rankDt.Rows[i + 1];

            if (thisRow[partitionBy].ToString() != nextRow[partitionBy].ToString())
                rank = 1;
            else
                rank++;
        }

        DataView selectRankdv = new DataView(rankDt);
        selectRankdv.RowFilter = "rank = " + whichRank;
        return selectRankdv.ToTable();
  }

Upvotes: 2

Ethan Brown
Ethan Brown

Reputation: 27282

As @The_Smallest points out, row number is not supported by LINQ. Here's how you can get what you're looking for, though:

var grouped = beatles.OrderBy( x => x.id )
  .ToList()   // required because SelectMany below doesn't evaluate to SQL
  .GroupBy( x => x.inst );
var rns = grouped.ToDictionary( x => x.Key, x => 1 );
var result = grouped
  .SelectMany( x => x.Select( 
    y => new { inst = y.inst, name = y.name, rn = rns[y.inst]++ } ) );

Upvotes: 3

leppie
leppie

Reputation: 117280

Another idea is using a view, if possible.

Upvotes: 6

Related Questions