Serdia
Serdia

Reputation: 4418

How to deal with missing values in Azure Machine Learning Studio

Looks like I have 672 mission values, according to statistics. There are NULL value in QuotedPremium column.

enter image description here

I implemented Clean Missing Data module where it should substitute missing values with 0, but for some reason I'm still seeing NULL values as QuotedPremium, but...it says that missing values are = 0

enter image description here

enter image description here

Here you see it tells me that missing values = 0, but there are still NULLs

enter image description here

So what really happened after I ran Clean Missing Data module? Why it ran succesfully but there are still NULL values, even though it tells that number of missing values are 0.

Upvotes: 0

Views: 2162

Answers (2)

akashperfect
akashperfect

Reputation: 291

Since they are not really missing values, its a string NULL which is added to all these cells. So, in order to substitute these values with 0 you can use this below:

Use Execute R Script module, and add this code in it.

dataset1 <- maml.mapInputPort(1); # class: data.frame
dataset1[dataset1 == "NULL"] = 0; # Wherever cell's value is "NULL", replace it with 0
maml.mapOutputPort("dataset1"); # return the modified data.frame

Image for same:

enter image description here

Upvotes: 1

desertnaut
desertnaut

Reputation: 60321

NULL is indeed a value; entries containing NULLs are not missing, hence they are neither cleaned with the 'Clean Missing Data' operator nor reported as missing.

Upvotes: 2

Related Questions