Compress .npy data to save space in disk

Question

l have stored on my disk a huge dataset. Since my dataset is about 1.5 TB. l divide it into 32 samples to be able to use numpy.save('data_1.npy') in python 2.7 . Here is a sample of 9 sub-datasets. Each one is about 30 GB.

The shape of each .npy file is (number_of_examples,224,224,19) and values are float.

data_1.npy
data_2.npy
data_3.npy
data_4.npy
data_5.npy
data_6.npy
data_7.npy
data_8.npy
data_9.npy

Using np.save(' *.npy'), my dataset occupy 1.5 Tera in my disk.

1)Is there an efficient way to compress my dataset in order to gain some free space disk ? 2) Is there an efficient way of saving files which take less space memory than np.save() ?

Thank you

Compress .npy data to save space in disk

Answers (1)

Related Questions