How to reduce duplicating data in an array

Question

What's the easiest and most efficient way to reduce the duplication of data?

I tried to make an algorithm, but it started to get way to complicated.

I do have data kept in an array like this: [[data, 'country_code',value],[data, 'country_code',value],[data, 'country_code',value],[data, 'country_code',value]]

For example, I have [[2019-01-23, "GER", 200],[2019-01-23,"USA",300],[2019-01-23,"GER", 301]].

And I need:

[[2019-01-23,"GER", 501],[2019-01-23,"USA",300]]

wim · Accepted Answer

Accumulate with a defaultdict, and use a list comprehension to collect results:

>>> from collections import defaultdict
>>> d = defaultdict(int)
>>> for date, code, n in L:
...     d[date, code] += n
...     
>>> [[date, code, n] for [[date, code], n] in d.items()]
[['2019-01-23', 'GER', 501], ['2019-01-23', 'USA', 300]]

How to reduce duplicating data in an array

Answers (2)

Related Questions