Default size of input split in Hadoop

Question

What is the default size of input split in Hadoop. As I know default size of block is 64 MB. Is there any file in Hadoop jar in which we can see the default values of all such things ? like default replication factor etc. like anything default in Hadoop.

Marco99 · Accepted Answer

Remember these two parameters: mapreduce.input.fileinputformat.split.minsize and mapreduce.input.fileinputformat.split.maxsize. I refer these as minSize, maxSize respectively. By default minSize is 1 byte and maxSize is Long.MAX_VALUE. The block size can be 64MB or 128MB or more.

The input split size is calculated by a formula like this during runtime: max(minSize, min(maxSize, blockSize)

Courtesy: Hadoop:The definitive guide.

Default size of input split in Hadoop

Answers (2)

Related Questions