python boto3 s3 client filter subdirectories and depth

Question

I am Trying to fetch list of subdirectories in S3 bucket without returning any filenames.

My S3 bucket have following structure.

s3://my-bucket/databases/mysql--    # host-2022-09-09-10
s3://my-bucket/databases/mysql--/tarfiles.tar.gz

I am trying to return only directory names like mysql--. I don't need any more sub directories or filenames inside mysql-xx.

As everything is stored as objects, I couldn't find any solution like setting depth-level etc.

my code:

        s3 = boto3.resource('s3')
        my_bucket = s3.Bucket(S3_BUCKET)
        prefix = 'databases/mysql-'
        for item in my_bucket.objects.filter(Prefix=prefix):
            st.write(item.key)

Other option is to do pythonic grep/filtering the filenames. But it won't help as every request will scan all the files and return and entire list has to be filtered. Unnecessarily gets expensive.

Thank you!

python boto3 s3 client filter subdirectories and depth

Answers (1)

Related Questions