On Postgres, ULID, and Django

PostgreSQL does not natively support the ULID type. Two common workarounds for this are using the UUID fields or strings. Both have implications and don’t look very neat.

Nevertheless, Postgres is an extensible fella. There are many ULID extensions on the Internet, and one of them brags its supremacy. Not to mention, it is written on Rust. Common sense tells us <irony>everything must be written on Rust</irony>.

The taming of Postgres

You have at least two options for integrating Rust-ULID support into Postgres. The first is very long and requires compiling the source code and doing many other unpleasant things. I’ve gone this road once, and it was nothing interesting. The second is easier.

Postgres ships an official Docker image. It’s Debian-based (there is also an Alpine one, but who cares). All that we need is to download the latest Debian package (0.2.0 at the moment of writing). The next step requires injecting this package into a new Docker image based on the official Postgres image.

FROM postgres:latest

COPY pgx_ulid-v0.2.0-pg17-amd64-linux-gnu.deb .

RUN apt update && \
    apt install /pgx_ulid-v0.2.0-pg17-amd64-linux-gnu.deb && \
    rm -rf pgx_ulid-v0.2.0-pg17-amd64-linux-gnu.deb && \
    apt clean && \
    rm -rf /var/lib/apt/lists/*

Build the image.

docker build -t postgre-ulid .

Start the container. I always prefer to cook some docker compose-wrapper, one can start it as a standalone container, whatever.

services:
    db:
        container_name: my_awesome_db
        image: postgre-ulid
        environment:
            - POSTGRES_DB=pgdb
            - POSTGRES_USER=pguser
            - POSTGRES_PASSWORD=pgpassword01

Start and jump into it.

docker-compose up -d
docker exec -it my_awesome_db psql -d pgdb -U pguser -W

If everything is ok, you will have access to the PSQL CLI which meets you with a friendly prompt pgdb=#.

To validate that the extension works we need to activate it. Then check it is not malfunctioning.

pgdb=# CREATE EXTENSION ulid;
pgdb=# SELECT gen_ulid();

I don’t want to discuss the monotonicity here, leave it to a motivated reader.

Django Unchained

So far, so good. We have our DB understanding of the ULID notation. Now, we need our backend to do the same trick.

Django does not also support ULID by default. I’ve found a project where authors bring this support. Cool? Not so fast.

In the description for the ULIDField you can find the next line:

This field type is natively stored in the DB as a UUID (when supported) and a string/varchar otherwise.

That’s exactly how I started this post. Have we spent our time in vain taming Postgres?

Postgres supports UUID natively, the Django ULID lib checks the support and translates your data from ULID to UUID:

return value.uuid if connection.features.has_native_uuid_field else str(value)

One day there will be a connection.features.has_native_ulid_field, but not today. Today we have to use a string value instead:

return str(value)

Given this, Django inserts a ULID value into a SQL query as a plain string. On the other hand, Postgres uses this value to fulfill the ULID column, not the VARCHAR one.

Django manages the automatic provisioning of primary keys. It implicitly creates them for you and tells your database to produce new values. The DB eventually returns these values thanks to the RETURNING SQL keyword. In the Django ULID lib, we have to provide primary key fields and values for them from Python.

Unchaining

The Django ULID library is very small. It only has three files, two of which may seem redundant. To make your code less convoluted and unwieldy, it may be desirable to completely rewrite the main class, skipping another dependency in your project.

For the sake of brevity, I’ll use the original lib, but here is an example of a standalone option.

Modifying the Django ULID machinery shouldn’t be too much of a lift, let’s start with the definition of our class:

import django_ulid

class PGULIDField(django_ulid.ULIDField):
    description = 'The native ULID support for Postrgres only'

Next, we need to change the get_internal_type method. It returns a string with the column type for future SQL queries. Change it from UUIDField to ulid. Thus we tell Django to make requests with the right column type. You can check a vendor type here as well:

    def get_internal_type(self, connection):
        # if connection.vendor != "postgresql":
        #    raise Exception(f"Incorrect vendor: {connection.vendor}")
        return 'ulid'

In my code, I changed the get_internal_type method with the db_type one, for the latter is more common (see the Django doc on that).

The get_db_prep_value method returns a value in the format the database backend understands. As Postgres supports UUID fields, we must change this method to use strings. Don’t be afraid, this is only conversion from ULID data to string in Python, the Postgres will store ULID in the end.

    def get_db_prep_value(self, value, *args, **kwargs):
        if value is None:
            return None

        if not isinstance(value, ulid.ULID):
            value = self.to_python(value)

        return str(value)

All things are done! I skipped everything about forms, if you need them, modifying the target class isn’t a big deal as well.

Wait, but by default, Django handles the ID creation for us! Yes, providing an analog of the AutoField with ULID is a complicated topic. I’m not sure it deserves our time, one day they bring the ULID support natively anyway.

To save our time now, I’ve made a mixin for that:

from django.db import models
from django.db.models import Func

class AutoPGULIDMixin(models.Model):
    class Meta:
        abstract = True

    id = PGULIDField(primary_key=True, db_default=Func(function='gen_ulid'))

When you use this mixin with your model, it not only provides a primary key for you, but it also tells Postgres to produce values for it!

Now, you can check these modifications by creating a test model and migrating it into your database:

from django.db import models

from myapp.mixins import AutoPGULIDMixin

class MyFirstULID(AutoPGULIDMixin, models.Model):
    name = models.CharField(max_length=24)

After the migration, there will be a table with two columns, id of the ulid type, and name of the varchar(24) one.

Postgres creates a B-Tree index for the ULID field automatically. If you don’t see it in the pgadmin tool, don’t worry, pgadmin does not show indexes for primary keys at all.

You can use ULID primary keys not bothering yourself with their management!

Misc

Django provides you with a way to activate third-party extensions. Please, follow this answer. Another more convenient option is to tell Postgres to activate it during the initialization:

services:
    db:
        container_name: my_awesome_db
        image: postgre-ulid
        environment:
            - POSTGRES_DB=pgdb
            - POSTGRES_USER=pguser
            - POSTGRES_PASSWORD=pgpassword01
        volumes:
            - ./init.sql:/docker-entrypoint-initdb.d/init.sql

All that you need is to provide the init.sql file with the right set of SQL queries.