NumPy loadtext()

Question

I have a text file which I want to load into a NumPy array with loadtext(). The file is tab delimited and sometime I have a value after the last tab instead of empty:

Value1	ab\Value2	ab\value3	ab
Value4	ab\Value5	ab\value6	ab\value7
Value8	ab\Value9	ab\value10	ab
Value11	ab\Value12	ab\value13	ab

However, NumPy gives me an error with that line:

ValueError: Wrong number of columns

Is it possible to load such a data structure directly into a NumPy array (with None as a value)? Or do I have to open the file, insert a None if there is no value and load the manipulated text file in a NumPy array?

Thanks

tmdavison · Accepted Answer

neither numpy.genfromtxt or numpy.loadtxt can deal with an uneven number of columns in a file. If you have access to pandas, it can do what you need with pandas.read_table.

import pandas as pd
df = pd.read_table('myfile.txt',header=None,sep='	')

# to get the data in a numpy ndarray:
myarray = df.values

By default missing values are assigned NaN, but you can change that with df.fillna(value)

NumPy loadtext()

Answers (2)

Related Questions