Reputation: 359
I am working on image detection where I am detecting and classifying an image into one of 14 different thoric diseases (multi-label classification problem).
The model is trained on NIH dataset
with which I get 80% AUC
. Now I want to improve the model by training on a second dataset. But the main problem is both dataset's classes are not matched.
The second dataset contains 10 classes that overlap with the first dataset with which I trained the model.
Questions:
Is it possible to retrain a model on fewer classes.
Will retraining my model on a new dataset impact the AUC of other non-similar classes?
How big is the chance that this will improve the model?
The model and code are based on fast.ai
and PyTorch
.
Upvotes: 3
Views: 1030
Reputation: 677
Based on discussion in the comments:
Upvotes: 2