Reputation: 83
I've been exploring neural networks and have been able to successfully train a network even on my own images in a way to label individual pictures as certain things, but don't know how to use that trained network to identify and perhaps return multiple objects from one image. For example, if you trained cats and dogs, and one image has multiple cats and dogs, how would you apply the trained network and return their location (in the image)?
Here is the main tutorial I followed for implementation in Python: http://machinelearningmastery.com/object-recognition-convolutional-neural-networks-keras-deep-learning-library/
A general answer would suffice, as in, is a sliding window over the image the best solution for this or is there something easier?
A specific example (particularly in python) would be appreciated. I've been using matplotlib for most of the image work, so I'd prefer to stay away from PIL slicing.
Thanks!
Upvotes: 3
Views: 1521
Reputation: 329
As you want to use your existing trained n/w:
If you are open to other n/w's, check out CRF as Recurrent Neural Networks. Ex: https://github.com/torrvision/crfasrnn
Hope this helps.
Upvotes: 2