在Django中,如何找到违反唯一的together约束的行



对于以下API端点,

import json
from django.contrib.auth.decorators import login_required
from django.http import JsonResponse, HttpResponseBadRequest
from django.views.decorators.http import require_POST
from lucy_web.models import UserApn

@login_required
@require_POST
def save_apn(request, version):
player_id = json.loads(request.body).get('player_id')
if player_id:
UserApn.objects.get_or_create(user=request.user, player_id=player_id)
return JsonResponse({'status': 'success'})
else:
return HttpResponseBadRequest()

以下是基本模型:

from django.contrib.auth.models import User
from django.db import models
from .timestamped_model import TimeStampedModel

class UserApn(TimeStampedModel):
user = models.ForeignKey(User)
player_id = models.CharField(max_length=255)

get_or_create()的调用引发了一些MultipleObjectsReturned错误。为了解决这个问题,我想对userplayer_id施加unique_together约束。然而,首先,我必须编写一个数据迁移,以消除违反这种唯一的together约束的行。

如何编写选择这些的查询?目前已提出以下建议:

def remove_duplicate_apns(apps, schema_editor):
UserApn = apps.get_model('lucy_web', 'UserApn')
previous_user_id = None
previous_player_id = None
for apn in UserApn.objects.all().order_by('user_id', 'player_id'):
if apn.user_id == previous_user_id and apn.player_id == previous_player_id:
print(f'deleting {apn} (id: {apn.id})')
apn.delete()
else:
previous_user_id = apn.user_id
previous_player_id = apn.player_id

不过,这似乎也可以在单个查询中完成。

更新

我发现可以将两个字段userplayer_id传递给.values(),然后使用.distinct()检查重复项。例如,以下测试通过:

from django.test import TestCase
from django.contrib.auth.models import User
from myapp.models import UserApn

class UserApnTest(TestCase):
def test_1(self):
user = User.objects.create_user(username='jayz')
apn1 = UserApn.objects.create(user=user, player_id='foo')
apn2 = UserApn.objects.create(user=user, player_id='foo')
apn3 = UserApn.objects.create(user=user, player_id='bar')
self.assertEqual(
len(UserApn.objects.values('user', 'player_id')) -
len(UserApn.objects.values('user', 'player_id').distinct()), 1)

然而,问题仍然存在,它的输出是带有user_idplayer_id的字典,但原始的id丢失了,所以我不能随后get()重复的对象并删除它们。如何做类似的事情,但保留对重复对象的引用?

我设法将重复的UserApn分组到以下查询集中:

UserApn.objects.all().difference(UserApn.objects.distinct('user', 'player_id'))

请注意,向distinct()传递多个参数仅适用于PostgreSQL。

最新更新