numpy vs list for non-numerical data

Question

Numpy arrays for numerical data clearly work great, but is it slower to use them for non-numerical data?

For instance, say I have some nested lists of text data:

mammals = ['dog', 'cat', 'rat']
birds = ['stork', 'robin', 'penguin']

animals1 = [mammals, birds]

When accessing and manipulating this data is this list of nested lists going to be faster than the numpy array equivalent?

import numpy as np
animals2 = np.array(animals1)

Since numpy arrays are implemented as "strided" arrays where each element has a fixed length, a "sparse" list of strings with a few long strings will use up a disproportionate amount of memory if converted to a numpy array. But what about speed?

numpy vs list for non-numerical data

Answers (1)

Related Questions