Does pandas != 'a value' return NaNs?

Question

When I use x['test'] = df['a_variable'].str.contains('some string') I get-

True
NaN
NaN
True
NaN

If I use x[x['test'] != True] Should I receive back the rows with value NaN?

Thanks.

EdChum · Accepted Answer

Yes this is expected behaviour:

In [3]:
df = pd.DataFrame({'some_string':['asdsa','some',np.NaN, 'string']})
df

Out[3]:
  some_string
0       asdsa
1        some
2         NaN
3      string

In [4]:
df['some_string'].str.contains('some')

Out[4]:
0    False
1     True
2      NaN
3    False
Name: some_string, dtype: object

Using the above as a mask:

In [13]:
df[df['some_string'].str.contains('some') != False]

Out[13]:
  some_string
1        some
2         NaN

So the above is expected behaviour.

If you specify the value for NaN values using na=value then you can get whatever value you set as the returned value:

In [6]:
df['some_string'].str.contains('some', na=False)

Out[6]:
0    False
1     True
2    False
3    False
Name: some_string, dtype: bool

The above becomes important as indexing with NaN values will result in a KeyError.

Does pandas != 'a value' return NaNs?

Answers (2)

Related Questions

Does pandas != &#39;a value&#39; return NaNs?

Answers (2)

Related Questions

Does pandas != 'a value' return NaNs?