Natural language classifier returns classifications for untrained items

Question

I am confused as to how NLC works. My expectation is that when it is asked to classify text that it should have no relation or training data to learn from it should return no results or results with very low confidence scores.

I have trained a model with a set of training data and when I attempt to classify text that is outside of the training data I am getting results with high confidence values (~60%).

Here's an example of my training data:

foo,1,2,3,4
bar,1,2,3,4
baz,1,2,3,4

When I try to classify the text "This should not exist" I receive a high confidence that this text is "1".

Is my assumption correct in that I should be returned values in this case? Am I training the data to classify foo, bar, and baz incorrectly? If not what should I expect from the NLC service?

Natural language classifier returns classifications for untrained items

Answers (1)

Related Questions