32
How I used Case..When in Django
I was working on a multi-tenant project and encountered a bug when using Django’s GenericForeignKey
with django-tenants
. It was using the public schema’s contenttype_id
instead of the tenant schema’s contenttype_id
.
So, if I have a model of Comment
, my django_content_type
table would have something like
public.django_content_type
id | app_label | model |
---|---|---|
15 | comments | comment |
tenant.django_content_type
id | app_label | model |
---|---|---|
19 | comments | comment |
There shouldn’t be a problem here since django-tenants
should handle this because it chooses the id
of the tenant first and then only use the public
as a fall back value. But for some reason, it was sometimes using the public id so comments aren’t appearing at all!
In order to fix this, I opted to remove django_content_type
table from all of my tenants and should only use the public’s django_content_type
values.
What I had to do was to update the contents inside my models that were using GenericForeignKey
s, which in my case is the comments
table.
Here’s the model:
# comments/models.py
class Comment(models.Model):
user = models.ForeignKey(settings.AUTH_USER_MODEL, on_delete=models.CASCADE)
parent = models.ForeignKey("self", null=True, blank=True, on_delete=models.SET_NULL)
path = models.CharField(max_length=350)
text = models.TextField()
timestamp = models.DateTimeField(auto_now_add=True)
updated = models.DateTimeField(auto_now=True)
active = models.BooleanField(default=True)
flagged = models.BooleanField(default=False)
target_content_type = models.ForeignKey(ContentType, null=True, blank=True, on_delete=models.SET_NULL)
target_object_id = models.PositiveIntegerField(null=True, blank=True)
target_object = GenericForeignKey("target_content_type", "target_object_id")
I need to update the target_content_type
so that it uses the public id which is 15
instead of 19
.
What I needed to do was:
- Determine which target ids need to be updated. For example, a
comment
can be in anAnnouncement
or in aPost
. So, we’d have to determine thecontent_type_id
forAnnouncement
andPost
in the tenant’s schema and update its value so it uses the one in public. - Update the values using
Case..When
. - Drop the
tenant.django_content_type
table so it would always usepublic.django_content_type
.
For number 1, I had to do a GROUP BY
to determine which id’s I need to update then get its equivalent in the public schema.
For number 2, I had to use the Case..When
syntax. So, for example when
the target_content_type_id
is 19
, then
update its value to 15
.when
the target_content_type_id
is 20
, then
update its value to 12
.
I think I need not explain number 3 since it only drops
the table.
I created a management command for this so it can be easily executed in production. Here’s the code:
from django.apps import apps
from django.contrib.contenttypes.models import ContentType
from django.core.management.base import BaseCommand
from django.db import connection
from django.db.models import Case, F, Value, When
from django_tenants.utils import schema_context
from tenant.models import Tenant
def group_by_sql(schema, table, column):
sql = f"""
SELECT {column} FROM {schema}.{table}
GROUP BY {column}
"""
print(sql)
return sql
class Command(BaseCommand):
help = "One time management command execution to update tenant's content_type_ids"
def handle(self, *args, **options):
has_gfk_models = [
{
'app_label': 'comments',
'model': 'comment',
'col': 'target_content_type_id'
},
{
'app_label': 'notifications',
'model': 'notification',
'col': 'target_content_type_id'
},
{
'app_label': 'notifications',
'model': 'notification',
'col': 'action_content_type_id',
},
{
'app_label': 'prerequisites',
'model': 'prereq',
'col': 'parent_content_type_id',
},
]
for tenant in Tenant.objects.exclude(schema_name='public'):
for has_gfk_model in has_gfk_models:
app_label, model, col = has_gfk_model.values()
# Number 1
with connection.cursor() as cursor:
cursor.execute(group_by_sql(
schema=tenant.schema_name,
table=f"{app_label}_{model}",
column=col))
# Remove null ids
tenant_target_content_type_ids = [_id[0] for _id in cursor.fetchall() if _id[0]]
# print(tenant_target_content_type_ids)
# tenant content_type_id : public content_type_id
ct_ids_map = {}
for ct_id in tenant_target_content_type_ids:
# Get what kind of model the given ID is
with schema_context(tenant.schema_name):
ct_tenant_app = ContentType.objects.get(id=ct_id)
# ... then fetch its equivalent in the public tenant
try:
ct_public = ContentType.objects.get(app_label=ct_tenant_app.app_label, model=ct_tenant_app.model)
ct_ids_map[ct_id] = ct_public.id
except ContentType.DoesNotExist:
# Just skip the apps that aren't installed anymore
print(f'{ct_tenant_app} has been removed from settings.APPS')
continue
# Number 2
Model = apps.get_model(app_label, model)
with schema_context(tenant.schema_name):
# Using CASE..WHEN is much faster compared to bulk_update in this case
# https://docs.djangoproject.com/en/dev/ref/models/conditional-expressions/#conditional-update
whens = []
for tenant_ct_id, public_ct_id in ct_ids_map.items():
# Build query
# when target_content_type_id is 19 then update it to 15
# When(target_content_type_id={tenant_ct_id}, then=Value({public_ct_id}))
when = {
col: tenant_ct_id,
'then': Value(public_ct_id),
}
whens.append(When(**when))
# If we are currently updating comments, the query would look something like
# Comment.objects.update(
# target_content_type_id=Case(
# When(target_content_type_id=17, then=Value(25)),
# When(...),
# default=F(target_content_type_id)))
# )
case_when = {
# When statements should be wrapped in a `Case` so we need to unpack the list `*whens`
col: Case(*whens, default=F(col)),
}
# Filter out the queryset so we don't bother updating other target ids
# The `default` is useless in this case because we are only updating the ids that are needed
# so it's safe to remove the `default=F(col)`.
qs = Model.objects.filter(**{f'{col}__in': ct_ids_map.keys()})
qs.update(**case_when)
print(connection.queries)
# Drop the table so it only uses public.django_content_type
drop_contenttype_table = f"DROP TABLE IF EXISTS {tenant.schema_name}.django_content_type CASCADE"
with connection.cursor() as cursor:
cursor.execute(drop_contenttype_table)
The code above could still be improved but it did the job for me. Also, here’s a link to a GitHub gist: https://gist.github.com/yujinyuz/d257d3ce978deb0bc7a1fecbd3f8d101
32