Code complexity when iterating inside python dictionaries

Question

Data from file read, are stored in a dictionary with file titles as keys, and as value, there is another dictionary, which has words as keys and their occurrence as value. An example of the structure is below:

data = {
    'file1.txt': {
        'word1': 45,
        'word2': 23,
        ...
    }
    'file2.txt': {
        'word3': 25,
        'word4': 10
        ...
    }
    ...
}

I want to calculate the code complexity of the python code above.

words = Counter()
for file in data:
    words.update(data[file])
return words.most_common(5)

Some thoughts: Dictionary iterating has a big O notation O(N), when N is the number of files. But, also the new Counter() is updated for every word inside the file, which means that the code complexity has a linear dependency on the number of words inside the file. But also, it can make a difference if a file has words repeated or not. The code consumes more time if, for example, a file has 5 different words, than having the same word 5 times.

Also, most_common() method has a O(KlogN) complexity, with K the number of files, and N the number of words.

Is my assumption correct? Or am I missing something?

So, what is the total complexity of the code above?

Code complexity when iterating inside python dictionaries

Answers (1)

Related Questions