Normal Distribution using Numpy

Question

I want to generate a dataset with m random data points of k dimensions each. Thus resulting in data size of shape (m, k). These points should be i.i.d. from a normal distribution with mean 0 and standard deviation 1. There are 2 ways of generating these points.

First way:

import numpy as np

# Initialize the array 
Z = np.zeros((m, k)) 

# Generate each point of each dimension independent of each other 
for datapoint in range(m):
    z = [np.random.standard_normal() for _ in range(k)] 
    Z[datapoint] = z[:]

Second way:

import numpy as np

# Directly sample the points
Z = np.random.normal(0, 1, (m, k))

What I think is the 2nd way gives a resulting dataset not independent of each other but the 1st one gives i.i.d dataset of points. Is this the difference between the 2 pieces of code?

Normal Distribution using Numpy

Answers (1)

Related Questions