Storing numpy arrays as PyTables cell element

Question

I have 4 files with data in the following format: 3 files contain numpy arrays with different dimensions, say, 20, 30 and 25. The number of records in each file is the same, say 10000. The fourth file contains 1000 floats (as many as arrays in each file). I attempt to create a table based on these files with the following structure:

+-----------------------------------------------------------+
| VecsFile #0   | VecsFile #1   | VecsFile #2   | FloatFile |
+-----------------------------------------------------------+
|np.ndarray(20,)|np.ndarray(30,)|np.ndarray(25,)|   0.1     |
+-----------------------------------------------------------+
|np.ndarray(20,)|np.ndarray(30,)|np.ndarray(25,)|   0.2     |
                               ...

By I encountered that PyTables doesn't receive numpy array as valid type for cell data.

Code: import tables import numpy as np

def create_table_def(n_files):
    table_def = dict()
    for rnum in range(n_files):
        table_def['VecsFile #'+str(rnum)] = tables.Col.from_atom(tables.Float64Atom())
    table_def['FloatFile'] = tables.Col.from_atom(tables.Float64Atom())

    return table_def

r0 = np.load('file0.npy')
r1 = np.load('file1.npy')
r2 = np.load('file2.npy')
s = np.random.rand(*r0.shape)


with tables.open_file('save.hdf', 'w') as saveFile:
    table_def = create_table_def(3)
    table = saveFile.create_table(saveFile.root, 'que_vectors', table_def)
    tablerow = table.row
    for i in range(r0.shape[0]):
        print(r0[i])
        tablerow['VecsFile #0'] = r0[i]
        tablerow['VecsFile #1'] = r1[i]
        tablerow['VecsFile #2'] = r2[i]
        tablerow['FloatFile'] = s[i]
        tablerow.append()
    table.flush()

And I get the following traceback:

    Traceback (most recent call last):
  File "C:/scratch_6.py", line 27, in 
    tablerow['VecsFile #0] = r0[i]
  File "tables	ableextension.pyx", line 1591, in tables.tableextension.Row.__setitem__
TypeError: invalid type () for column ``VecsFile #0``

Am I doing something wrong? Or is this way to store such vectors and column with floats as one file without appending all these vectors to a numpy matrix? I want to use it for appending rows with vectors and one float in future, ranging them and delete them.

Storing numpy arrays as PyTables cell element

Answers (1)

Related Questions