Add ability to wait for an environment #1957

8W9aG · 2024-09-16T21:40:25Z

Add a wait.py containing utilities that allow for waiting for specific files to appear, and for loading imports. These are defined by the environment variables COG_WAIT_FILE and COG_EAGER_IMPORTS (CSV). If neither of these environment variables exist this is a no-op.
Separate out BasePredictor and BaseInput into their own class files now that they are accessed by multiple consumers (config and predictor).
Create a Config class which is an abstraction around cog.yaml allowing for environment variables to be used as a substitute for certain attributes of the cog.yaml, in theory allowing us to load the cog environment without relying on this file provided we have set the right environment variables.
Change create_app to take a Config class rather that a dictionary.
Add an env_property function decorator for reading from an environment variable before dropping into its wrapped function. This prevents the need for loading cog.yaml if the property is guarded by this.
Add a Mode enumeration that delineates between the different modes of predict and train in a type safe way.
Wait for the environment to setup right before the worker process calls setup.
Change the test stubs to use the Config class while injecting in some mock values if applicable.
Add some tests for code_xform to make sense of its inputs and outputs.
Increase test timeout to 20s, this is due to the wait tests using up the previous time allotment of 10s.

This allows cog to begin running and do as much work as possible with as little information as possible while it waits for a file system to appear around it. Currently cog requires cog.yaml and the prediction/training python to appear before setting up the HTTP client, in this case we can setup the HTTP client while we load these files in the background and get a signal that the file loading is finished and continue loading the predictors setup, in addition to this we perform any pre-emptive work we can while we are waiting for these files to load (such as eagerly importing modules).

* This allows us not to read the config for the app threads until absolutely necessary.

* Delay loading the cog yaml until it is read from in the code.

* Allow environment variables to control whether we wait for a file to appear before further processing * This allows the python interpreter to boot up while we wait for other files to become available

* Allow waiting for a general environment by importing select python packages while the system boots up around cog.

* I shouldn’t need to do this but want to see if it relieves the errors on linux.

* A small test to make sense of what the code stripper is doing.

* Create a class for accessing cog config * Only access variables from the config by properties * Gate those properties with environment variable function decorators to allow fetching the config from the environment rather than a file. * This allows the environment to begin running without needing the /src

* While waiting for the environment to boot, load The designated imports to speed up interpreter Time.

* In strip model source code we use an AST function that isn’t available prior to 3.9

* We aren’t waiting for these imports, we are loading them eagerly while we wait.

* We have so many that we need to increase this.

* Fix an issue with env_property where it could not handle Optional or Union

* Create these functions because they don’t exist in python 3.7

* Do not wait for a signal, just use a while loop with a 10 ms interval to check for presence of file.

* Some tests previously checked the debug logging * Make sure we conform to the same debug logging

* Confirm that this behaviour is consistent with functions.

* Allows better debugging of stderr et al

* Currently predict assumes that PredictionResponse is the serialisation target. * This isn’t the case if we are calling _predict from a training endpoint * Allow response_type to be fed into the _predict method to inform it of what kind of response it should expect.

* Use the proper endpoints to call training functions. * Log the command properly to the user.

Signed-off-by: Will Sackfield <[email protected]>

8W9aG · 2024-11-14T00:46:18Z

This has now been slimmed down to just the wait code, so is ready to be reviewed in its own right.

meatballhat

⏳📁

meatballhat · 2024-11-14T19:57:08Z

python/cog/wait.py

+    full_module_path = os.path.join(
+        pyenv_path,
+        "lib",
+        "python" + os.environ[PYTHON_VERSION_ENV_VAR],


Is it safe to assume PYTHON_VERSION_ENV_VAR is defined?

Yeh, or at the very least I'd like it to hard fail if it isn't present, it would mean something is very wrong in our system configuration

PR has been atomised as much as possible.

8W9aG added 28 commits September 16, 2024 12:22

Send in app threads directly from args

6075901

* This allows us not to read the config for the app threads until absolutely necessary.

Load config right before necessary

3d348cc

* Delay loading the cog yaml until it is read from in the code.

Add waiting for a wait file

1024cac

* Allow environment variables to control whether we wait for a file to appear before further processing * This allows the python interpreter to boot up while we wait for other files to become available

Add wait_for_imports ability

cc7fccf

* Allow waiting for a general environment by importing select python packages while the system boots up around cog.

Fix lint on src_path

19cc38d

Fix watchdog version

e79e25d

Remove load_config in openapi_schema cmd

6dfe3ec

Do not access root files on GHA workers

03f7011

Set recursive to true

4f75de4

* I shouldn’t need to do this but want to see if it relieves the errors on linux.

Watch the directory instead

949792c

Add code_xforms test

c83f2e8

* A small test to make sense of what the code stripper is doing.

Add http server to test to let it respond

7265582

Wait for environment before executing setup

5ff8030

* While waiting for the environment to boot, load The designated imports to speed up interpreter Time.

Fix Type on lower python versions

8f3f41d

Skip test_strip_model_source_code if < 3.9

1bebbc3

* In strip model source code we use an AST function that isn’t available prior to 3.9

Change COG_WAIT_IMPORTS to COG_EAGER_IMPORTS

d61d782

* We aren’t waiting for these imports, we are loading them eagerly while we wait.

Bump integration test timeout to 20 mins

e9464df

* We have so many that we need to increase this.

Add tests for Config class

adb04a4

* Fix an issue with env_property where it could not handle Optional or Union

Fix get_args and get_origin in python 3.7

25fb547

* Create these functions because they don’t exist in python 3.7

Add more tests for config

ef2a1a0

Check wait flag has fallen before eager import

061ff16

Add watch handler tests

33b4106

Remove watchdog and use SIGUSR2 for signalling

2c1f2b9

Fix no torch import in tests

f6d0e45

Merge branch 'main' into add-waiting-env

b480479

Do naive waiting for file

42b7306

* Do not wait for a signal, just use a while loop with a 10 ms interval to check for presence of file.

Merge branch 'main' into add-waiting-env

f79c597

8W9aG requested review from mattt and nevillelyh September 24, 2024 16:52

8W9aG added 8 commits October 21, 2024 12:01

Merge branch 'main' into add-waiting-env

8ea4663

Add consistent debug logging in config

3a29229

* Some tests previously checked the debug logging * Make sure we conform to the same debug logging

Add test_strip_model_source_code_keeps_referenced_class_from_function

7657c23

* Confirm that this behaviour is consistent with functions.

Explicitly check return code in test train

e3c39c3

* Allows better debugging of stderr et al

Make cog train a first class CLI function

a0477d8

* Use the proper endpoints to call training functions. * Log the command properly to the user.

Merge branch 'main' into add-waiting-env

c046ff0

Signed-off-by: Will Sackfield <[email protected]>

Add back missing imports from merge

94d1ff4

8W9aG mentioned this pull request Oct 22, 2024

Make cog train call trainings endpoint #2013

Merged

Merge branch 'main' into add-waiting-env

187eeea

Signed-off-by: Will Sackfield <[email protected]>

8W9aG mentioned this pull request Oct 23, 2024

Add Setup Logging #2018

Merged

8W9aG added 2 commits October 28, 2024 11:19

Merge branch 'main' into add-waiting-env

8ca482f

Signed-off-by: Will Sackfield <[email protected]>

Remove connection import

2840fc7

8W9aG force-pushed the add-waiting-env branch 2 times, most recently from 5cf3ace to 2840fc7 Compare November 1, 2024 19:24

Merge branch 'main' into add-waiting-env

200173c

Signed-off-by: Will Sackfield <[email protected]>

8W9aG mentioned this pull request Nov 1, 2024

Add Config class #2042

Merged

8W9aG added 3 commits November 6, 2024 11:31

Merge branch 'main' into add-waiting-env

bec26e7

Signed-off-by: Will Sackfield <[email protected]>

Fix lint

1398040

Merge branch 'main' into add-waiting-env

7de31a5

8W9aG mentioned this pull request Nov 8, 2024

Add environment variable backed properties to config #2051

Merged

Merge branch 'main' into add-waiting-env

bb57543

Signed-off-by: Will Sackfield <[email protected]>

8W9aG requested a review from nickstenning November 14, 2024 00:46

Add R8_ prefix to PYTHON_VERSION (#2058)

987b7ee

meatballhat approved these changes Nov 14, 2024

View reviewed changes

8W9aG enabled auto-merge (squash) November 14, 2024 20:36

8W9aG merged commit d714a70 into main Nov 14, 2024
19 checks passed

8W9aG deleted the add-waiting-env branch November 14, 2024 20:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ability to wait for an environment #1957

Add ability to wait for an environment #1957

8W9aG commented Sep 16, 2024 •

edited

Loading

8W9aG commented Nov 14, 2024

meatballhat left a comment

meatballhat Nov 14, 2024

8W9aG Nov 14, 2024

Add ability to wait for an environment #1957

Add ability to wait for an environment #1957

Conversation

8W9aG commented Sep 16, 2024 • edited Loading

8W9aG commented Nov 14, 2024

meatballhat left a comment

Choose a reason for hiding this comment

meatballhat Nov 14, 2024

Choose a reason for hiding this comment

8W9aG Nov 14, 2024

Choose a reason for hiding this comment

8W9aG commented Sep 16, 2024 •

edited

Loading