Skip to content

Async index creation

Andreas Brandl requested to merge ab/async-index-creation into master

What does this MR do?

This introduces "async index creation". It aims to offload actual index creation to a more convenient time, e.g. during low-traffic times. We piggy-back on the reindexing cronjob that runs on the weekends and prioritize async index creation over any reindexing activity (this is a hack, potentially should move to a separate cronjob).

Async index creation works in two phases:

  1. Prepare - this is a regular migration to prepare the index, but it does not create it. Instead, it creates a record in postgres_async_indexes with the actual index definition
  2. Create index - this is a regular migration using add_concurrent_index to create the index. If it already exists (due to preceding async index creation), this is a no-op. If it doesn't, it creates the index normally as one would expect. In both cases, we remove the record from postgres_async_indexes.

In that sense, async index creation is "best effort" and optional. If it's not happening async, it'll happen with (2). This doesn't change the time for when you can start relying on the presence of the index - it's still only after add_concurrent_index has been run.

Scope of usage

This is a developer convenience tool aimed for use on GitLab.com only. It is only relevant in the GitLab.com environment, which sees highly frequent deploys and where we can't afford blocking the deploy for many hours to create some indexes.

The workflow is for developers only and requires a manual checkpoint: Has the index been created asynchronously already? If yes, we can merge the second migration that has the add_concurrent_index steps (which are then a no-op). If that goes wrong, i.e. if we merge too early, we would see the index creation block the deploy (as expected).

Background

In context of primary key int4 -> int8 migrations, we need to create a lot of indexes on large tables in %14.2. This can be a manual effort, e.g. doing through the change management process to create them - however, that causes quite a bit of overhead to coordinate.

This change introduces tooling to automate it: We can simply ship migrations to "prepare" indexes, wait for them to be created over the weekend and come back and continue with dependent changes the week after.

It is not a strict alternative to using the change management process, but rather an additional means to getting those indexes created - so whatever turns out to be quicker, we can use that. I expect we'll have more situations like this in the future, so I'd deem this useful in any case.

Status:

  • Works on my machine 😸

Does this MR meet the acceptance criteria?

Conformity

Availability and Testing

Edited by Patrick Bair

Merge request reports

Loading