Reputation: 274
I want to detect more objects than coco dataset which detects only 80 objects , I want to detect as many as possible actions also like hugging ,swimming.....etc.
I don't care about the size and I do not want to train myself ... So is there a dataset(weights) big enough already available that I can download and use or I do have to train and label for yolo?
Upvotes: 2
Views: 350
Reputation: 556
What you are trying to classify is represented as Action Recognition. Here [1] is a good repo that lists a lot of out-of-the-box models for this task.
An explanation: Models (like YOLO) contain two main blocks: feature extraction (CNN stuff) and classification (linear layers). When training from scratch, both feature extraction and classification will be trained from scratch. It is easy to train classification to what you want, but it is hard to train the feature extraction part (as it takes a lot of time). Hence, we typically use pre-trained models on generalized datasets (like YOLO is trained on COCO), so our feature extraction part starts from a somewhat good generalized place. Later, we replace the classification part will our own to be trained from scratch for our task.
TL;DR, you can use a pre-trained YOLO model on COCO for your task by replacing the last linear layers to classify what you want. Here are some resources for different frameworks [2, 3].
Here [4] is a simple walkthrough of how to do this.
Upvotes: 0