Find files bigger than 300MB using os.walk in Python?

Question

I have written this code to walk a directory and find files bigger than 300MB.

However, I get a lot of duplicate values and the number of duplicates varies between the files. Can anyone explain this or improve the code for me?

import os

path = 'C:\Users\brentond\Desktop\Lower Thames Crossing'
for foldername, subfolders, filenames in os.walk(path):
    for subfolder in subfolders:
        for filename in filenames:
            if os.path.getsize(os.path.join(foldername, filename))>300000000:
                print(foldername + '\' + filename)

Thierry Lathuille · Accepted Answer

You don't have to explore the subfolders yourself, walk does it for you.

From the doc:

os.walk(top, topdown=True, onerror=None, followlinks=False)

Generate the file names in a directory tree by walking the tree either top-down or bottom-up. For each directory in the tree rooted at directory top (including top itself), it yields a 3-tuple (dirpath, dirnames, filenames).

(emphasis mine)

So, just do:

import os

path = 'C:\Users\brentond\Desktop\Lower Thames Crossing'
for foldername, subfolders, filenames in os.walk(path):
    for filename in filenames:
        if os.path.getsize(os.path.join(foldername, filename))>300000000:
            print(foldername + '\' + filename)

Find files bigger than 300MB using os.walk in Python?

Answers (2)

Related Questions