Checking if a set of string in a file are in another file or not using python or bash

Question

Hi I want to check whether a set of words (alphanumeric) contained in one file are in another file containing some set of words.

Like I have a file: f1.txt (20K size)

w1
w2
w3
w4
.. //more ids like this

Another file f2.txt (120 K size)

q1
q2
q3
q4
q5
q6
q7
q8
w2

So I want to check "how" many and "which" ids from "f1.txt" are present in "f2.txt"

I want the output to be like:

1
w2

I know this is easy and can be done using loops. I want to know if we can do this using bash scripting, using "grep" n all. As this is fast, I mainly want to analyze the data. Python would also do.

Any leads appreciated.

Ibrahim · Accepted Answer

You can use

str.count(sub[, start[, end]])

Return the number of non-overlapping occurrences of substring sub in the range [start, end]. Optional arguments start and end are interpreted as in slice notation.

f1_lines = [line.strip("
") for line in f1.readlines()]
f2_lines = [line.strip("
") for line in f2.readlines()]

for w in f1_lines:
    print(w, f2_lines.count(w))

Checking if a set of string in a file are in another file or not using python or bash

Answers (2)

Related Questions