Modify only a few bytes from a npz numpy file without rewriting the whole file

Question

This works to write and load a numpy array + metadata in a .npz compressed file (here the compression is useless because it's random, but anyway):

import numpy as np

# save
D = {"x": np.random.random((10000, 1000)), "metadata": {"date": "20221123", "user": "bob", "name": "abc"}}
with open("test.npz", "wb") as f:
    np.savez_compressed(f, **D)

# load
D2 = np.load("test.npz", allow_pickle=True)
print(D2["x"])
print(D2["metadata"].item()["date"])

Let's say we want to change only a metadata:

D["metadata"]["name"] = "xyz"

Is there a way to re-write to disk in test.npz only D["metadata"] and not the whole file because D["x"] has not changed?

In my case, the .npz file can be 100 MB to 4 GB large, that's why it would be interesting to rewrite only the metadata.

Modify only a few bytes from a npz numpy file without rewriting the whole file

Answers (1)

Related Questions