Python Dictionary Multiple Keys, Search Function

Question

I have the following sample CSV named results1111.csv:

Master #,Scrape,Date of Transaction
2C7E4B,6854585658,5/2/2007
2C7E4B,8283876134,5/8/2007
2C7E4B,4258586585,5/18/2007
C585ED,5554541212,5/18/2004
585868,5555551214,8/16/2012

I have the following code which opens the CSV and then puts the data into multiple dictionaries:

with open('c:\results1111.csv', "r") as f:
    f.next()
    reader = csv.reader(f)
    result = {}
    for row in reader:
        key = row[0]
        result[key] = row[1:]
        values = row[1:]
        telnumber = row[1]
        transdate = row[2]
#print key
#print values
#print telnumber
#print transdate
#print result

        d = {}
        d.setdefault(key, []).append(values)
        print d

The output of the above code is:

{'2C7E4B': [['6854585658', '5/2/2007']]}
{'2C7E4B': [['8283876134', '5/8/2007']]}
{'2C7E4B': [['4258586585', '5/18/2007']]}
{'C585ED': [['5554541212', '5/18/2004']]}
{'585868': [['5555551214', '8/16/2012']]}

I would like to search the dictionaries for any instance where the same key has multiple phone numbers tied to it, such as the first three entries in the output above. When that happens, I would then like to remove the dictionary with the earliest date. I would then like to output all of the remaining dictionaries back into a CSV. The output should look like this:

2C7E4B,8283876134,5/8/2007
2C7E4B,4258586585,5/18/2007
C585ED,5554541212,5/18/2004
585868,5555551214,8/16/2012

Since there are thousands of keys (in the real input csv), I am not sure how to write a statement to do this. Any help is appreciated.

tdelaney · Accepted Answer

You'll need to sort all of the reocrds for a single master by date, which is more easily done with a list than a dict. Since month/day/year date doesn't sort correctly without some sort of conversion, I create a datetime object as the first item of the record. Now the list will sort by date (and if two records have the same date, by telephone number) so it's just a question of finding, sorting and deleting items from the list.

import csv
import collections
import datetime as dt

open('temp.csv', 'w').write("""Master #,Scrape,Date of Transaction
2C7E4B,6854585658,5/2/2007
2C7E4B,8283876134,5/8/2007
2C7E4B,4258586585,5/18/2007
C585ED,5554541212,5/18/2004
585868,5555551214,8/16/2012
""")

with open('temp.csv') as f:
    f.next()
    reader = csv.reader(f)
    # map master to list of transactions
    result = collections.defaultdict(list)
    for row in reader:
        key = row[0]
        # make date sortable
        sortable_date = dt.datetime.strptime(row[2], '%m/%d/%Y')
        result[key].append([sortable_date, row[1], row[2]])

for value in result.values():
    # discard old records
    if len(value) > 1:
        value.sort()
        del value[0]
        # or to delete all but the last one
        # del value[:-1]

keys = result.keys()
keys.sort()

for key in keys:
    transactions = result[key]
    for transaction in transactions:
        print key, transaction[1], transaction[2]

Python Dictionary Multiple Keys, Search Function

Answers (2)

Related Questions