Is it possible to do pixel-wise classification where each pixel has multiple labels?

Question

Using a CNN, I would like to take an image where each pixel is annotated with 3 labels. Something like this:

0,1 (not object/object)
0,1,2,3... (Class of object, eg cat,dog)
0,1,2,3...(Object Number of given class eg, 2nd instance of cat)

In other words given a picture of multiple cats and dogs the CNN would output that a given pixel is from an object, that object is a cat and it belongs to the second instance of cat in the image (counting from the top left hand corner for example).

Is this possible to do with a single CNN or would I have to combine multiple CNN's to achieve this result?

EDIT: I should note I understand I would initially have to train the CNN with annotated images where each pixel already has 2 or 3 labels as above.

Is it possible to do pixel-wise classification where each pixel has multiple labels?

Answers (1)

Related Questions