Giving a composite score for merged dictionary keys

Question

I am working on a project that needs to say that a certain ID is most likely. Let me explain using example. I have 3 dictionaries that contain ID's and their score

Ex: d1 = {74701: 3, 90883: 2}

I assign percentage score like this,

d1_p = {74701: 60.0, 90883: 40.0} , here the score is the (value of key in d1)/(total sum of values)

Similarly i have 2 other dictionaries

d2 = {90883: 2, 74701: 2} , d2_p = {90883.0: 50.0, 74701.0: 50.0}

d3 = {75853: 2}, d3_p = {75853: 100.0}

My task is to give a composite score for each ID from the above 3 dictionaries a decide a winner by taking the highest score. How would i mathematically assign a composite score between 0-100 for each of these ID's??

Ex: in above case 74701 needs to be the clear winner.

I tried giving average, but it fails, because I need to give more preference for the ID's that occur in multiple dictionaries. Ex: lets say 74701 was majority in d1 and d2 with 30,40 values. then its average will be (30+40+0)/3 = 23.33 , while 75853 which occurs only once with 100% will get (100+0+0)/3 = 33.33 and it will be given as winner, which is wrong.

Hence can somone suggest a good mathematical way in python with maybe code to give such score and decide majority?

luizbarcelos · Accepted Answer

Instead of trying to create a global score from different dictionaries, since your main goal is to analyze frequency I would suggest to summarize all the data into a single dictionary, which is less error prone in general. Say I have 3 dictionaries:

a = {1: 2, 2: 3}
b = {2: 4, 3: 5}
c = {3: 4, 4: 9}

You could summarize these three dictionaries into one by summing the values for each key:

result = {1: 2, 2: 7, 3: 9, 4: 9}

That could be easily achieved by using Counter:

from collections import Counter

result = Counter(a)
result.update(Counter(b))
result.update(Counter(c))
result = dict(result)

Which would yield the desired summary. If you want different weights for each dictionary that could also be done in a similar fashion, the takeaway is that you should not be trying to obtain information from the dictionaries as separate entities, but instead merge them together into one statistic.

Giving a composite score for merged dictionary keys

Answers (2)

Related Questions