how to remove duplicate data for group query in Linq

Question

I'm trying to find a distinct list of filenames related to each bugid, and I used linq to group all filenames related to each bug id. I don't know how I can remove duplicate filenames related to each bugid,in file ouput I have multiple rows like this: bugid filename1 filename2 filename3 filename4 ............. there are multiple rows with the same bugid and also there duplicate filenames for each bug id, this is my code:

using System;
using System.Collections.Generic;
using System.Text;
using System.Linq;


namespace finalgroupquery
{
    class MainClass
{
        public static void Main (string[] args)
        {

            List  list2=new List  ();
             using(System.IO.StreamReader reader1= new System.IO.StreamReader( @"/home/output"))
                using (System.IO.StreamWriter file = new System.IO.StreamWriter( @"/home/output1")) 
                        {string line1;
                         while ((line1=reader1.ReadLine())!=null) 
                            { string[] items1=line1.Split('	');        
                                    bug bg=new bug();
                                      bg.bugid=items1[0];
                                for (int i=1; i<=items1.Length -1;i++)
                                    { bg.list1.Add(items1[i]);}
                                            list2.Add(bg);
                            }

                            var bugquery= from c in list2 group c by c.bugid into x select
                                            new Container { BugID = x.Key, Grouped = x };



                            foreach (Container con in bugquery)
                            {
                                StringBuilder files = new StringBuilder();
                                files.Append(con.BugID);
                                files.Append("	");

                                foreach(var x in con.Grouped)
                                {
                                    files.Append(string.Join("	", x.list1.ToArray()));
                                }

                                file.WriteLine(files.ToString());       }


            }
        }
    }

    public class Container
    {
        public string BugID {get;set;}
        public IGrouping Grouped {get;set;}
    }

    public class bug
    { 
        public List list1{get; set;}
        public string bugid{get; set;}

        public bug()
        {
            list1=new List();
        }       


    }
}


}

Dweeberly · Accepted Answer

From your description it sounds like you want to do this:

        List  bugs = new List();
        var lines = System.IO.File.ReadLines(@"/home/bugs");
        foreach (var line in lines) {
            string[] items = line.Split('	');
            bug bg=new bug();
            bg.bugid = items[0];
            bg.list1 = items.Skip(1).OrderBy(f => f).Distinct().ToList();
            bugs.Add(bg);
            }

This will produce a list of objects, where each object has a unique list of filenames.

how to remove duplicate data for group query in Linq

Answers (2)

Related Questions