Django ORM filter queryset for duplicates

Question

I'm looking for a way to get a queryset for possible duplicates in the database.

Simplified models:

class Artist(models.Model):
    name = models.CharField(max_length=255, db_index=True)

class Track(models.Model):
    name = models.CharField(max_length=255, db_index=True)
    artist = models.ForeignKey(related_name='tracks')

What is working so far is a query to get 'Tracks' with equal names:
(not so elegant, however the speed does not matter much as the query is only used in infrequent maintenance work):

qs = Track.objects.all()
duplicates = Track.objects.values('name')\
    .annotate(Count('id'))\
    .filter(id__count__gt=1)    
qs = qs.filter(name__in=[item['name'] for item in duplicates])

Any ideas how to extend this to get a queryset where Track.name and the related Artist.name are possible duplicates?

Django ORM filter queryset for duplicates

Answers (1)

Related Questions