Numpy - How do you normalize specific features in a dataset?

Question

There is set of data that has a mixture of continuous and symbolic data, such as the following:

data = [[duration, protocol, bytes, rate],
        [0,        tcp,      215,   0.45],
        [4,        udp,      1474,  0.63],
        [63,       icmp,     30,    0.07]]

The 1st, 3rd, and 4th columns are continuous features while the 2nd column is symbolic.

Is there a way to normalize the 1st, 3rd, and 4th columns without touching the 2nd, and without having to remove the second from the set of data?

Edit: For this problem, I want to normalize the data by making each column between 0 and 1 based on the min and max of each column.

Numpy - How do you normalize specific features in a dataset?

Answers (1)

Related Questions