Reputation: 21682
I want to create a series of lists, all of varying lengths. Each list will contain the same element e
, repeated n
times (where n
= length of the list).
How do I create the lists, without using a list comprehension [e for number in range(n)]
for each list?
Upvotes: 813
Views: 1058852
Reputation: 394785
Create List of Single Item Repeated n Times in Python
Depending on your use-case, you want to use different techniques with different semantics.
For immutable items, like None, bools, ints, floats, strings, tuples, or frozensets, you can do it like this:
[e] * 4
For example:
>>> [None] * 4
[None, None, None, None]
Note that this is usually only used with immutable items (strings, tuples, frozensets, etc) in the list, because they all point to the same item in the same place in memory.
For an example use-case, I use this when I have to build a table with a schema of all strings, so that I don't have to give a highly redundant one to one mapping.
schema = ['string'] * len(columns)
Multiplying a list gives us the same elements over and over. The need for this is infrequent:
[iter(iterable)] * 4
This is sometimes used to map an iterable into a list of lists:
>>> iterable = range(12)
>>> a_list = [iter(iterable)] * 4
>>> [[next(l) for l in a_list] for i in range(3)] # uninteresting usage
[[0, 1, 2, 3], [4, 5, 6, 7], [8, 9, 10, 11]]
We can see that a_list
contains the same range iterator four times:
>>> from pprint import pprint
>>> pprint(a_list)
[<range_iterator object at 0x7f9fe3b58420>,
<range_iterator object at 0x7f9fe3b58420>,
<range_iterator object at 0x7f9fe3b58420>,
<range_iterator object at 0x7f9fe3b58420>]
I've used Python for a long time now, and I have seen very few use-cases where I would do the above with mutable objects.
Instead, to have repeated, say, a mutable empty list, set, or dict, you should do something like this:
list_of_lists = [[] for _ in iterator_of_needed_length]
The underscore is simply a throwaway variable name in this context.
If you only have the number, that would be:
list_of_lists = [[] for _ in range(4)]
The _
as the throwaway name is not really special, but a static code analyzer will probably complain if you don't intend to use the variable and use any other name.
Beware doing this with mutable objects, when you change one of them, they all change because they're all the same object:
foo = [[]] * 4
foo[0].append('x')
foo now returns:
[['x'], ['x'], ['x'], ['x']]
But with immutable objects, you can make it work because you change the reference, not the object:
>>> l = [0] * 4
>>> l[0] += 1
>>> l
[1, 0, 0, 0]
>>> l = [frozenset()] * 4
>>> l[0] |= set('abc')
>>> l
[frozenset(['a', 'c', 'b']), frozenset([]), frozenset([]), frozenset([])]
But again, mutable objects are no good for this, because in-place operations change the object, not the reference:
l = [set()] * 4
>>> l[0] |= set('abc')
>>> l
[set(['a', 'c', 'b']), set(['a', 'c', 'b']), set(['a', 'c', 'b']), set(['a', 'c', 'b'])]
Upvotes: 132
Reputation: 280
import numpy as np
a = np.full(fill_value=1, shape=3)
print(a)
b = np.repeat(a=[1,2,3],repeats=3)
print(b)
c = np.repeat(a=[[1,2,3]],repeats=3,axis=0).flatten()
print(c)
Upvotes: -1
Reputation: 23011
If you're seeking
[1, 1, 1, 2, 2, 2, 3, 3, 3]
without numpy, you can use the builtin itertools module
from itertools import chain
list(chain.from_iterable(zip(*[[1,2,3]]*3)))
With a simple list comprehension (without even itertools)
[e for x in zip(*[[1,2,3]]*3) for e in x]
Upvotes: 3
Reputation: 837916
You can also write:
[e] * n
You should note that if e is for example an empty list you get a list with n references to the same list, not n independent empty lists.
Performance testing
At first glance it seems that repeat is the fastest way to create a list with n identical elements:
>>> timeit.timeit('itertools.repeat(0, 10)', 'import itertools', number = 1000000)
0.37095273281943264
>>> timeit.timeit('[0] * 10', 'import itertools', number = 1000000)
0.5577236771712819
But wait - it's not a fair test...
>>> itertools.repeat(0, 10)
repeat(0, 10) # Not a list!!!
The function itertools.repeat
doesn't actually create the list, it just creates an object that can be used to create a list if you wish! Let's try that again, but converting to a list:
>>> timeit.timeit('list(itertools.repeat(0, 10))', 'import itertools', number = 1000000)
1.7508119747063233
So if you want a list, use [e] * n
. If you want to generate the elements lazily, use repeat
.
Upvotes: 1188
Reputation: 395
Sorry for my really late answer
You can use numpy.repeat
easily. Just by writing the value that you would like to produce and the number of repetition.
import numpy as np
x = [1,2,3]
y = np.linspace(0,1000,10000)
for i in x:
new_x = np.repeat(i,len(y))
print(new_x)
Upvotes: 4
Reputation: 453
If you are looking for a simple repeat like:
[1, 2, 3, 1, 2, 3, 1, 2, 3]
simply use:
[1, 2, 3] * 3
But if you are seeking for:
[1, 1, 1, 2, 2, 2, 3, 3, 3]
This one is better while takes more time:
numpy.concatenate([([i]*3) for i in [1,2,3]], axis=0)
Upvotes: 9
Reputation: 107588
Itertools has a function just for that:
import itertools
it = itertools.repeat(e,n)
Of course itertools
gives you a iterator instead of a list. [e] * n
gives you a list, but, depending on what you will do with those sequences, the itertools
variant can be much more efficient.
Upvotes: 32
Reputation: 17026
As others have pointed out, using the * operator for a mutable object duplicates references, so if you change one you change them all. If you want to create independent instances of a mutable object, your xrange syntax is the most Pythonic way to do this. If you are bothered by having a named variable that is never used, you can use the anonymous underscore variable.
[e for _ in xrange(n)]
Upvotes: 18
Reputation: 16553
>>> [5] * 4
[5, 5, 5, 5]
Be careful when the item being repeated is a list. The list will not be cloned: all the elements will refer to the same list!
>>> x=[5]
>>> y=[x] * 4
>>> y
[[5], [5], [5], [5]]
>>> y[0][0] = 6
>>> y
[[6], [6], [6], [6]]
Upvotes: 226