dd: reading binary file as blocks of size N returned less data than N

Question

i need to process large binary files in segments. in concept this would be similar to split, but instead of writing each segment to a file, i need to take that segment and send it as the input of another process. i thought i could use dd to read/write the file in chunks, but the results aren't at all what i expected. for example, if i try :

dd if=some_big_file bs=1M |
    while : ; do
        dd bs=1M count=1 | processor
    done

... the output sizes are actually 131,072 bytes and not 1,048,576.

could anyone tell me why i'm not seeing output blocked to 1M chunks and how i could better accomplish what i'm trying to do ?

thanks.

pynexj · Accepted Answer

According to dd's manual:

bs=bytes

[...] if no data-transforming conv option is specified, input is copied to the output as soon as it's read, even if it is smaller than the block size.

So try with dd iflag=fullblock:

fullblock

Accumulate full blocks from input. The read system call may return early if a full block is not available. When that happens, continue calling read to fill the remainder of the block. This flag can be used only with iflag. This flag is useful with pipes for example as they may return short reads. In that case, this flag is needed to ensure that a count= argument is interpreted as a block count rather than a count of read operations.

dd: reading binary file as blocks of size N returned less data than N

Answers (2)

Related Questions