Finding a duplicate line within a text file

Question

I am newbie in python and trying to do something with it. Taken a task as a mini project at my work place. It has many parts. Currently I am stuck in following situation.

I have a text file which i Need to do as follows:

What I am trying to do is

take this file as input
read through all the lines
Find the duplicate element in the file and print it out and delete it
print the whole file after printing and removing the duplicate entry
If there is no duplicate entry then print no duplicate entry found and also print the entire list

As of now I could only read the lines and print it. Also the total number of lines in that file

f = open('ABC.txt')
count = 0

for line in f:
    count=count+1
    print(line)
    a=line
    print(a)


print(count)

But after that I a stuck in there.. I am a network administrator and trying to use my bachelor days that I need to compare the line with the proceeding lines. But t do that I am not able to find get it worked like any array or something.. Can anybody please help...

DYZ · Accepted Answer

Typically, you would keep a set of previously seen lines. If a new line is not in the set, add it to the set and print it. If it is in the set, then it is a duplicate.

seen = set()
with open('ABC.txt') as f:
    for line in f:
        if line not in seen:
            seen.add(line)
            print(line)
        else:
            # a dupe

Finding a duplicate line within a text file

Answers (2)

Related Questions