compare two csv files and do something if there is a match between two fields

Question

So I have a nested for loop

for rowst in student:
    for rowtu in tutor:
        if rowst['RegGroup'][-3:] in rowtu['StaffCode']:
            print (rowst['RegGroup'][-3:],rowtu['StaffCode'])
            print("----------------------------------------")

student is student = csv.DictReader(fr) tutor is tutor = csv.DictReader(fr2)

What I am trying to achieve is to compare the two files to check if the last 3 characters in student RegGroup match the tutors staffcode. Then print something as shown in the code.

the result I get is:

FLI FLI
----------------------------------------

This suggests it is only working for the first or last value of the first for loop which isn't what I want. I have checked and Yes there are more than one reg group's that matches because I used the reg groups in the student file to populate a unique list of tutors staff codes.

can anyone tell me where i'm going wrong as my friend seems to think that my implementation should work?

as requested, some of the csv data (can't share for gdpr reasons but can show the two fields im comparing)

tutors.csv

StaffCode
FLI
RTH
POD
DFI
LNO
VAI
HPI
LNE
SLA
ASP
HST
RCO
WKI
GBA
RKI
BPE
SMI
NRY
CSC

subset of students.csv (the XX represents a yeargroup)

RegGroup
XXFLI
XXRTH
XXPOD
XXDFI
XXLNO
XXVAI
XXFLI
XXLNO
XXHPI
XXLNO
XXPOD
XXHPI
XXLNE
XXLNO
XXRTH
XXHPI
XXRTH
XXLNO
XXVAI
XXDFI
XXVAI
XXFLI
XXRTH
XXFLI
XXLNE
XXDFI
XXVAI
XXLNE

Harpe · Accepted Answer

The dictReader is an iterator that goes over the file once and has to be re-initiated after reading the file once.

Here is a code example that works, but is not really elegant:

with open("tutor.csv") as stu:
    student = csv.DictReader(stu)
    for rowst in student:
        with open("student.csv") as tu:
            tutor = csv.DictReader(tu)
            for rowtu in tutor:
                if rowst['RegGroup'][-3:] in rowtu['StaffCode']:
                    print (rowst['RegGroup'][-3:],rowtu['StaffCode'])
                    print("----------------------------------------")

the line "with open..." creates a context in which the file is available and is automatically closed afterwards. However for large files, this is not something you want to repeat and you should store your data in an appropriate object.

For that, you can use something like numpy.loadtxt.

compare two csv files and do something if there is a match between two fields

Answers (2)

Related Questions