sort images based on a cluster correspondances list

Question

I have the following working code to sort images according to a cluster list which is a list of tuples: (image_id, cluster_id).
One image can only be in one and only one cluster (there is never the same image in two clusters for example).

I wonder if there is a way to shorten the "for+for+if+if" loops at the end of the code as yet, for each file name, I must check in every pairs in the cluster list, which makes it a little redundant.

    import os
    import re
    import shutil

    srcdir  = '/home/username/pictures/' # 
    if not os.path.isdir(srcdir):
        print("Error, %s is not a valid directory!" % srcdir)
        return None

    pts_cls # is the list of pairs (image_id, cluster_id)

    filelist    = [(srcdir+fn) for fn in os.listdir(srcdir) if  
                  re.search(r'\.jpg$', fn, re.IGNORECASE)]
    filelist.sort(key=lambda var:[int(x) if x.isdigit() else  
                  x for x in re.findall(r'[^0-9]|[0-9]+', var)])

    for f in filelist:
        fbname  = os.path.splitext(os.path.basename(f))[0]

        for e,cls in enumerate(pts_cls): # for each (img_id, clst_id) pair
            if str(cls[0])==fbname: # check if image_id corresponds to file basename on disk)
                if cls[1]==-1: # if cluster_id is -1 (->noise)
                    outdir = srcdir+'cluster_'+'Noise'+'/'
                else:
                    outdir = srcdir+'cluster_'+str(cls[1])+'/' 

                if not os.path.isdir(outdir):
                    os.makedirs(outdir)

                dstf = outdir+os.path.basename(f)
                if os.path.isfile(dstf)==False:
                    shutil.copy2(f,dstf)

Of course, as I am pretty new to Python, any other well explained improvements are welcome!

sort images based on a cluster correspondances list

Answers (1)

Related Questions