Fastest method to recognize an all-NULL string in python

Question

Suppose a performance-critical code section reads equally-sized data block from a disk file. How can I detect an all-null string/data block in the shortest time possible?

This is my current code:

# options.blocksize = 1024*1024
f, dummy = do_open(dev, 'r')
zeroblock = '\0'*options.blocksize
while True:
    block = f.read(options.blocksize)
    if not block:
        break
    if block == zeroblock:
        csum = "0000"

As you can see, I am comparing an all-null block to the ones read from the file. This method works but, for large blocks, it spend considerable time in the comparison.

I also tried counting NULL occurrences:

# options.blocksize = 1024*1024
f, dummy = do_open(dev, 'r')
zeroblock = '\0'*options.blocksize
while True:
    block = f.read(options.blocksize)
    if not block:
        break
    if block.count('\0') == options.blocksize:
        csum = "0000"

but it is even slower than the first method.

Any suggestion on how to improve performance? Thanks.

Fastest method to recognize an all-NULL string in python

Answers (1)

Related Questions