How to remove lines with repeated value from a text file

Question

i have a text file with various codes (one code per line) in a column and some of them appear more than once (always in order). I would like to know how can i remove those lines with repeated values.

Example: File1.dat

I would like the output to be:

Note: In my file there are no empty spaces between lines. Any solution would do, scripts, terminal commands, etc. Thanks in advance.

Jean-Fran&#231;ois Fabre · Accepted Answer

Since the duplicate lines are consecutive, With Linux/MSYS you can simply use uniq

Output with your data:

$ uniq lines.txt
84578
84581
84627
84761
84792
84886
84905

Python solution using generator comprehension to check if first line or line different from previous to issue the line in the output file:

with open("lines.txt") as fr,open("uniq.txt","w") as fw:
    for line in (x for i,x in enumerate(fr) if i==0 or lines[i-1]!=x):
        fw.write(line)

How to remove lines with repeated value from a text file

Answers (2)

Related Questions