Skip to content

Latest commit

 

History

History
81 lines (48 loc) · 3.64 KB

README.md

File metadata and controls

81 lines (48 loc) · 3.64 KB

domino-maintenance-mode

Easily place Domino in maintenance mode for upgrades and restore afterwards.

Test Docker Repository on Quay

How to report bugs

Please file in a ticket to the Platform Services team.

Create a Jira ticket

Warnings

  • It is important that the system is not being used while taking a snapshot until after the restore is complete. Changes to running executions during or after taking a snapshot of the system may result in executions not being fully stopped, or properly restored. If there are new Model API versions that are not fully updated (new version is in running state and old version is fully stopped), please allow this process to complete before using this tool.

  • It is possible for a running, functioning App to have a PVC mounted from an unregistered EDV that it depends on. Since the EDV no longer exists, the App cannot be restarted with this PVC mounted and can fail. It is not possible to detect this scenario automatically via the API. Please do not unregister EDVs while in maintenance mode.

  • If either stopping or starting fails because the API returned an error, or the operation timed out while waiting for the execution to enter at running state, the tool will emit a warning and save a record of the executions which failed to a log file. This is in JSON format and can be used to follow up manually.

  • Any error log levels or above require manual inspection to determine the state of an execution or executions.

Installation

pip install git+https://github.com/dominodatalab/domino-maintenance-mode.git

Note: Currently requires python < 3.11

Configuration

You must set some environment variables to configure access to the Domino deployment.

  • DOMINO_API_KEY - An administrator's Domino API key.
  • DOMINO_HOSTNAME - The URL to your Domino deployment, including protocol (and port if non-standard).
  • DOMINO_SSL_NO_VERIFY - Optional Set to "true" to disabled server certificate verification.

Usage

  • Take a snapshot of running executions:
dmm snapshot my-snapshot-file.json

This will create a timestamped snapshot file which you will need to use in subsequent steps.

  • Stop all running Apps, Model APIs, Restartable Workspaces, and Scheduled Jobs:
dmm shutdown my-snapshot-file.json
  • Perform Domino maintenance / upgrade.

  • Restore previously running Apps, Model APIs and Scheduled Jobs. Workspaces should be manually restarted by users.

dmm restore my-snapshot-file.json

Domino Version Support

Domino 4.4+, please report any issues that may arise due to API changes, as not all versions have been validated.

Classic Workspaces are not currently supported and should be shutdown manually.