Update a nested dict with a Manager from multiprocessor module

Question

I've been trying to update a nested dictionary using multiprocessing.

I'm able to add the information I want if the dictionary contains a list of elements but if it's a nested dictionary I don't see any changes.

I'm aware that the multiprocessing modules says that it's a dictproxy and not a dict and I've tried changing the example on the module to achieve it but I haven't had any luck.

import socket
import csv
from pprint import pprint
from multiprocessing import Pool,Process,current_process,Manager

def dns_lookup(aggregate,ip):
    try:
        hostname=socket.gethostbyaddr(ip)[0]
        proc_name = current_process().name
        #print(str(hostname) + " extracted from ip " + str(ip) + " by process id: " + str(proc_name) )
        aggregate[ip]+=hostname
    except Exception as e:
        pass

if __name__=='__main__':
    procs=[]
    manager=Manager()
    aggregate=manager.dict()
    with open("ip_list.csv","r") as ipfile:
        csv_reader=csv.reader(ipfile)
        for index,row in enumerate(csv_reader):
            if index == 0:
                pass
            else:
                aggregate[row[0]]=row[1:]
                #ips.append((row[0]))
                proc = Process(target=dns_lookup, args=(aggregate,row[0],))
                procs.append(proc)
                proc.start()

    for proc in procs:
        proc.join()
    pprint(dict(aggregate))

The above code works but if I try and change the original dict to

aggregate[row[0]]={'Other Items':row[1:]}

and then try to update it as

d['hostname']=hostname
aggregate[ip]=d
#aggregate[ip]+=d

doesn't have any effect.

I need for the actual list to have a dictionary and not a list of elements.

The current file is small but I will have to scale this up to about 10k lookups so multiprocessing is required.

Any help is much appreciated.

Thanks, Karan

Update a nested dict with a Manager from multiprocessor module

Answers (1)

Related Questions