6:[["$","$Le",null,{}],["$","div",null,{"className":"min-h-screen bg-gray-100 p-6","children":[["$","$Lf",null,{}],["$","script",null,{"type":"application/ld+json","dangerouslySetInnerHTML":{"__html":"{\"@context\":\"https://schema.org\",\"@type\":\"QAPage\",\"mainEntity\":{\"@type\":\"Question\",\"name\":\"Finding unique Values from a file\",\"text\":\"

I have a 6 mb sized csv file. I want to filter the data by column A & Column C so that I need to remove any duplicates. What is the easiest way to do it and how to do it. Any help is very much appreciated.

\\n\",\"author\":{\"@type\":\"Person\",\"name\":\"mousey\"},\"upvoteCount\":3,\"answerCount\":2,\"acceptedAnswer\":{\"@type\":\"Answer\",\"text\":\"

Use cut or awk to select fields. Sort and uniq to remove duplicates. FOr example

\\n\\n

awk -F\\\",\\\" '{print $1}' A.csv|sort|uniq\\n

\\n\",\"author\":{\"@type\":\"Person\",\"name\":\"Navi\"},\"upvoteCount\":6}}}"}}],["$","div",null,{"className":"bg-white shadow-md rounded-lg p-6 mb-6 relative","children":[["$","div",null,{"className":"absolute top-4 right-4 flex flex-wrap space-x-2","children":[["$","span","csv",{"className":"bg-blue-600 text-white text-sm px-3 py-1 rounded-full","children":["$","$L10",null,{"href":"/discussion/tag/csv/1","children":"csv"}]}]]}],["$","div",null,{"className":"flex items-center mb-4","children":[["$","img",null,{"src":"https://www.gravatar.com/avatar/d53ea849b871db7a92c128121633f290?s=256&d=identicon&r=PG","alt":"mousey","className":"w-16 h-16 rounded-full border"}],["$","div",null,{"className":"ml-4","children":[["$","a",null,{"href":"https://stackoverflow.com/users/2605095/mousey","target":"_blank","rel":"noopener noreferrer","className":"text-lg font-semibold text-blue-600 hover:underline","children":"mousey"}],["$","p",null,{"className":"text-sm text-gray-500","children":["Reputation: ",11901]}]]}]]}],["$","h1",null,{"className":"text-2xl font-bold text-gray-800 mb-4","children":"Finding unique Values from a file"}],["$","p",null,{"className":"text-gray-700 mt-4","dangerouslySetInnerHTML":{"__html":"

\n"}}],["$","div",null,{"className":"text-gray-600 text-sm mt-4","children":[["$","p",null,{"children":["Upvotes: ",3]}],["$","p",null,{"children":["Views: ",7101]}]]}]]}],["$","div",null,{"className":"container mx-auto","children":[["$","h2",null,{"className":"text-2xl font-semibold text-gray-800 mb-6","children":["Answers (",2,")"]}],[["$","div","10118283",{"className":"bg-white shadow-md rounded-lg p-6 mb-6","children":[["$","div",null,{"className":"flex items-center mb-4","children":[["$","img",null,{"src":"https://www.gravatar.com/avatar/5f783b574e38dc79cfa03908f190c4e4?s=256&d=identicon&r=PG","alt":"shweta","className":"w-12 h-12 rounded-full border"}],["$","div",null,{"className":"ml-4","children":[["$","a",null,{"href":"https://stackoverflow.com/users/1328294/shweta","target":"_blank","rel":"noopener noreferrer","className":"text-lg font-semibold text-blue-600 hover:underline","children":"shweta"}],["$","p",null,{"className":"text-sm text-gray-500","children":["Reputation: ",101]}]]}]]}],["$","p",null,{"className":"text-gray-700 mb-4","dangerouslySetInnerHTML":{"__html":"

cat foo.csv | cut -f2 -d , | sort | uniq\n

\n\n

It will give you unique ids from 2nd column

\n\n

cat foo.csv | cut -f1 -d , | sort | uniq\n

\n\n

It will give you unique ids from 1st column

\n\n

-f < number > : column number\n\n-d  < space >< delimiter > : file delimiter \n

\n"}}],["$","div",null,{"className":"text-gray-600 text-sm","children":["$","p",null,{"children":["Upvotes: ",10]}]}]]}],["$","div","4696783",{"className":"bg-white shadow-md rounded-lg p-6 mb-6","children":[["$","div",null,{"className":"flex items-center mb-4","children":[["$","img",null,{"src":"https://www.gravatar.com/avatar/0967001e6bcda72d17e40d0f7e452ff1?s=256&d=identicon&r=PG&f=y&so-version=2","alt":"Navi","className":"w-12 h-12 rounded-full border"}],["$","div",null,{"className":"ml-4","children":[["$","a",null,{"href":"https://stackoverflow.com/users/287727/navi","target":"_blank","rel":"noopener noreferrer","className":"text-lg font-semibold text-blue-600 hover:underline","children":"Navi"}],["$","p",null,{"className":"text-sm text-gray-500","children":["Reputation: ",8736]}]]}]]}],["$","p",null,{"className":"text-gray-700 mb-4","dangerouslySetInnerHTML":{"__html":"

Use cut or awk to select fields. Sort and uniq to remove duplicates. FOr example

\n\n

awk -F\",\" '{print $1}' A.csv|sort|uniq\n

\n"}}],["$","div",null,{"className":"text-gray-600 text-sm","children":["$","p",null,{"children":["Upvotes: ",6]}]}]]}]]]}],["$","div",null,{"className":"bg-white shadow-md rounded-lg p-6 mt-6","children":[["$","h2",null,{"className":"text-2xl font-semibold text-gray-800 mb-4","children":"Related Questions"}],["$","ul",null,{"className":"list-disc list-inside","children":[["$","li","74637028",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/74637028","className":"text-blue-600 hover:underline","children":"Getting unique values from csv file, output to new file"}]}],["$","li","72313594",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/72313594","className":"text-blue-600 hover:underline","children":"Extracting unique values from multiple rows in csv file"}]}],["$","li","29226073",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/29226073","className":"text-blue-600 hover:underline","children":"Distinct column values in csv file"}]}],["$","li","59230506",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/59230506","className":"text-blue-600 hover:underline","children":"Check for unique elements of csv"}]}],["$","li","59164259",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/59164259","className":"text-blue-600 hover:underline","children":"Finding duplicates, and uniques of the duplicates in a csv"}]}],["$","li","58522549",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/58522549","className":"text-blue-600 hover:underline","children":"How can I find unique fields in a CSV file?"}]}],["$","li","53048002",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/53048002","className":"text-blue-600 hover:underline","children":"Unique elements in columns in csv file using python"}]}],["$","li","38846109",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/38846109","className":"text-blue-600 hover:underline","children":"Get unique rows from csv"}]}],["$","li","30156098",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/30156098","className":"text-blue-600 hover:underline","children":"How to get unique values from a csv file"}]}],["$","li","29261616",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/29261616","className":"text-blue-600 hover:underline","children":"Get unique values from a column using Python"}]}]]}]]}]]}],["$","$L11",null,{}],["$","$L12",null,{}],["$","$L13",null,{}],["$","$L14",null,{}],["$","$L15",null,{}]]

Finding unique Values from a file

Answers (2)

Related Questions