handling a huge file with python and pytables

Question

simple problem, but maybe tricky answer:

The problem is how to handle a huge .txt file with pytables.

I have a big .txt file, with MILLIONS of lines, short lines, for example:

line 1  23458739
line 2  47395736
...........
...........

The content of this .txt must be saved into a pytable, ok, it's easy. Nothing else to do with the info in the txt file, just copy into pytables, now we have a pytable with, for example, 10 columns and millions of rows.

The problem comes up when, with the content in the txt file, 10 columns x millions lines are directly generated in the paytable BUT, depending on the data on each line of the .txt file, new colums must be created on the pytable. So how to handle this efficiently??

Solution 1: first copy all the text file, line by line into pytable (millions), and then iterate over each row on pytable (millions again) and, depending on the values, generate the new columns needed for the pytable.

Solution 2: read line by line the .txt file, do whatever needed, calculate the new needed values, and then send all the info to a pyrtable.

Solution 3:.....any other efficient and faster solution???

handling a huge file with python and pytables

Answers (1)

Related Questions