Script-based Migrations #101

enobayram · 2022-12-05T13:58:34Z

We're currently using beam-automigrate + a number of custom migration commands at chainweb-data start up for managing our Postgres schema. This combination served us well so far, but going forward we're planning to use more and more advanced features of Postgres like partitioned tables, triggers etc. so it seems like we'd be better served by converting to the more conventional migration scheme of expressing the schema migrations as a sequence of SQL scripts to be executed at start up.

As part of this transition to the script-based migrations, we also need to consider the case of an existing chainweb-data deployment using a very old version of the chainweb-data binary. Here's how we're planning to perform this transition in a way that works for the existing chainweb-data deployments too:

We declare that the next release is a "transition version".
The transition version keeps migrating the database using beam-automigrate same as how been doing it so far.
The transition version also runs the MigrationInitialization command of postgresql-simple-migration seems like a good library to support this alternative way of managing schema migrations. Marking the deployment as having gone through the "transition migration". I.e. when this command is run, we know that the DB schema is in the known transition state.
We use pg_dump to take a snapshot of the Postgres DB schema as it appears at the transition version and manually go through the dump to make sure we don't have anything deployment-specific in there so that it can be run on any empty Postgres database. We call the resulting SQL dump init.sql.
Post-transition versions of chainweb-data performs the following logic:
- If blocks table doesn't exist) That means this is an entirely new deployment, so we run init.sql followed by the incremental migrations scripts.
- If blocks table exists) Then we check whether MigrationInitialization has already been performed. (I.e. we run the MigrationValidation MigrationInitialization MigrationCommand.)
  - If MigrationInitialization was run) This means this is a properly transitioned deployment, so we just apply any missing migration scripts.
  - If MigrationInitialization was not run) That means this is a pre-transition deployment, so we exit with a message instructing the user to run the transition release

The text was updated successfully, but these errors were encountered:

This commit extends the current schema migration with the transition release phase of the upcoming script-based schema migrations. The full transition plan can be found at: #101

* Prepare for the script based migrations This commit extends the current schema migration with the transition release phase of the upcoming script-based schema migrations. The full transition plan can be found at: #101 * Trim trailing whitespace * Use System.Exit.die for migration failure

This PR is a part of the incremental script-based migration transition #101. This PR adds a `migrate` subcommand to `chainweb-data` that can be used to just run migrations and exit. This will be useful for the post-script-based-migration version; When that version encounters a database that's too old, it'll ask the user to download the latest `beam-automigrate`-based CW-D version (the upcoming release) and run it in this PR's `migrate` mode so that the database can be used with future versions that know how do migrate it incrementally from the last `beam-automigrate`-based database state.

This PR implements the second and last step of the transition to script based migrations (#101). After this PR, new CW-D database migrations can be implemented by creating new scripts in the `haskell-src/db-schema/migrations` folder. The scripts in that folder must be named as `1.2.3.4_somename.sql`. This file names correspond to the version components `[1,2,3,4]` along with the step name `somename.sql`. The version components can contain an arbitrary number of elements, so `1.2.3_othername.sql` is also valid. The migration logic implemented by this PR aims to be fairly conservative in that it expects the existing migrations to be a perfect prefix of the incoming migrations with the correct order. The order of the migrations are defined by the version components. The condition that the existing migrations need to be a prefix all the desired migrations means that once a set of migrations are run, we can only append new migrations and those migrations have to have version components that are bigger than the existing migrations. The reason why we're being conservative like this is to avoid very subtle issues that occasionally arise due to migrations running in different orders in different deployments. It's also worth noting that the `--migrations-folder` introduced by this PR is optional and when that argument is not provided, CW-D uses the set of migrations that get embedded into the binary from the repository during compilation. The purpose is to avoid increasing the operational complexity of running CW-D from a compiled binary. The set of migrations associated with a CW-D release are tightly coupled with the Haskell code that comes with it anyway. Another point worth noting is that this migrations workflow also allows CW-D operators to interleave their own migrations with the official migrations that come with CW-D. If the operator of a particular CW-D node wants to include additional migrations, they can do so by maintaining a `migrations` folder of their own and including the official CW-D migrations side by side with their own migrations. In this setup, they need to name their own migration scripts to have version numbers that are compatible with this migration workflow. Resolves #101 * Implement script-based migrations * Return the migration steps in matchRecursive * Preorder steps and detect duplicates * Fix warnings * Base64 encode the migration checksum * Simplify matchSteps

We've recently (#101) moved away from `beam-automigrate` for handling our DB migrations, but we've mostly kept the DB schema compatibility checks that come with it as a safety measure. One of the important benefits of #101 was the leniency it brought towards external DB schema manipulations that might be useful to `chainweb-data` operators. To that end, our DB migration check was already ignoring `TableRemoved` diffs so that new tables can be created in the DB without causing issues. This PR extends that leniency towards other types of objects `beam-automigrate` looks for. Their omission was an oversight.

enobayram added the tech-debt Related to future development costs label Dec 5, 2022

enobayram self-assigned this Dec 5, 2022

enobayram added a commit that referenced this issue Dec 5, 2022

Prepare for the script based migrations

b33993b

This commit extends the current schema migration with the transition release phase of the upcoming script-based schema migrations. The full transition plan can be found at: #101

enobayram mentioned this issue Dec 5, 2022

Prepare for the script based migrations #102

Merged

enobayram mentioned this issue Dec 23, 2022

Add migrate command to chainweb-data CLI #115

Merged

enobayram mentioned this issue Mar 3, 2023

Search related continuations in tx search #66

Merged

enobayram mentioned this issue Mar 22, 2023

Implement script-based migrations #139

Merged

enobayram closed this as completed in #139 Mar 24, 2023

enobayram mentioned this issue Apr 12, 2023

Avoid checkTables errors due to unexpected DB objects #144

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Script-based Migrations #101

Script-based Migrations #101

enobayram commented Dec 5, 2022

Script-based Migrations #101

Script-based Migrations #101

Comments

enobayram commented Dec 5, 2022