Conditions in NumPy in Python

Question

I have an array data created with NumPy in Python with data

I want to calculate the mean of column 1 but only for those rows whose second column value is greater than 350.

I think it's something like

data[ > 350][:,1].mean()

I know that > 350 is not correct but I don't know how to specify that it should check the second column

Carsten · Accepted Answer

You're almost there. You can select all the rows where the second column is greater than 350 by using:

data[:,1] > 350

This will create a numpy array of booleans (print it to see what it looks like. it's just True and False values in the shape of data[:,1] depending on whether they satisfy the condition), which you can use to index data:

data[ data[:,1] > 350 ][:,1].mean()

Conditions in NumPy in Python

Answers (2)

Related Questions