Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Loop until no eval master #55639

Merged
merged 21 commits into from
Jan 4, 2020
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
21 commits
Select commit Hold shift + click to select a range
57803d7
Add loop.until_no_eval that provides similar functionality but withou…
github-abcde Jul 22, 2019
561153a
Added example of how to use loop.until_no_eval in faq.
github-abcde Jul 23, 2019
feb404f
salt/states/loop.py: Updated docstring.
github-abcde Jul 23, 2019
8acb261
Updated docstring for functions for multiple types of timeout,period,…
github-abcde Jul 23, 2019
435fa74
Changed versionchanged/versionadded from Neon to Natrium.
github-abcde Jul 23, 2019
9f03dde
Add explanation of datastructure returned by boto_elb.get_instance_he…
github-abcde Jul 23, 2019
75f6e9d
Fix missed rename during rebasing
github-abcde Dec 13, 2019
9c74220
Add pylint-inspired changes
github-abcde Dec 13, 2019
73765db
Remove unneccesary list coercion
github-abcde Dec 13, 2019
a7e6a06
Merge branch 'master' into loop_until-no-eval-master
github-abcde Dec 16, 2019
02a823d
Merge branch 'master' into loop_until-no-eval-master
github-abcde Dec 18, 2019
8457965
Shut up pylint some more
github-abcde Dec 18, 2019
76775ed
Increase interval and timeout for slow windows machines
github-abcde Dec 19, 2019
02fb92f
Merge branch 'master' into loop_until-no-eval-master
github-abcde Dec 19, 2019
258e77f
Increase interval and timeout once more for slow windows machines
github-abcde Dec 24, 2019
c357998
Merge branch 'master' into loop_until-no-eval-master
github-abcde Dec 24, 2019
d0caf60
Merge branch 'master' into loop_until-no-eval-master
waynew Dec 27, 2019
cb6b22c
Change version Natrium to 3000
github-abcde Jan 2, 2020
657be7f
Merge branch 'master' into loop_until-no-eval-master
github-abcde Jan 2, 2020
69329c8
Merge branch 'master' into loop_until-no-eval-master
github-abcde Jan 2, 2020
01ebfed
Merge branch 'master' into loop_until-no-eval-master
github-abcde Jan 3, 2020
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
29 changes: 29 additions & 0 deletions doc/faq.rst
Original file line number Diff line number Diff line change
Expand Up @@ -394,6 +394,35 @@ Restart the Minion from the command line:
salt -G kernel:Windows cmd.run_bg 'C:\salt\salt-call.bat service.restart salt-minion'
salt -C 'not G@kernel:Windows' cmd.run_bg 'salt-call service.restart salt-minion'

Waiting for minions to come back online
***************************************

A common issue in performing automated restarts of a salt minion, for example during
an orchestration run, is that it will break the orchestration since the next statement
is likely to be attempted before the minion is back online. This can be remedied
by inserting a blocking waiting state that only returns when the selected minions
are back up (note: this will only work in orchestration states since `manage.up`
needs to run on the master):

.. code-block:: jinja

Wait for salt minion:
loop.until_no_eval:
- name: saltutil.runner
- expected:
- my_minion
- args:
- manage.up
- kwargs:
tgt: my_minion
- period: 3
- init_wait: 3

This will, after an initial delay of 3 seconds, execute the `manage.up`-runner
targeted specifically for `my_minion`. It will do this every `period` seconds
until the `expected` data is returned. The default timeout is 60s but can be configured
as well.

Salting the Salt Master
-----------------------

Expand Down
203 changes: 152 additions & 51 deletions salt/states/loop.py
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,10 @@

.. versionadded:: 2017.7.0

In both examples below, the execution module function ``boto_elb.get_instance_health``
returns a list of dicts. The condition checks the ``state``-key of the first dict
in the returned list and compares its value to the string `InService`.

.. code-block:: yaml

wait_for_service_to_be_healthy:
Expand All @@ -26,12 +30,40 @@
This state allows arbitrary python code to be executed through the condition
parameter which is literally evaluated within the state. Please use caution.

.. versionchanged:: 3000

A version that does not use eval is now available. It uses either the python ``operator``
to compare the result of the function called in ``name``, which can be one of the
following: lt, le, eq (default), ne, ge, gt.
Alternatively, `compare_operator` can be filled with a function from an execution
module in ``__salt__`` or ``__utils__`` like the example below.
The function :py:func:`data.subdict_match <salt.utils.data.subdict_match>` checks if the
``expected`` expression matches the data returned by calling the ``name`` function
(with passed ``args`` and ``kwargs``).

.. code-block:: yaml

Wait for service to be healthy:
loop.until_no_eval:
- name: boto_elb.get_instance_health
- expected: '0:state:InService'
- compare_operator: data.subdict_match
- period: 5
- timeout: 20
- args:
- {{ elb }}
- kwargs:
keyid: {{ access_key }}
key: {{ secret_key }}
instances: "{{ instance }}"
'''

# Import python libs
from __future__ import absolute_import, print_function, unicode_literals
import logging
import time
import operator
import sys

# Initialize logging
log = logging.getLogger(__name__)
Expand All @@ -48,35 +80,22 @@ def until(name,
m_args=None,
m_kwargs=None,
condition=None,
period=0,
timeout=604800):
period=1,
timeout=60):
'''
Loop over an execution module until a condition is met.

name
The name of the execution module

m_args
The execution module's positional arguments

m_kwargs
The execution module's keyword arguments

condition
The condition which must be met for the loop to break. This
should contain ``m_ret`` which is the return from the execution
module.

period
The number of seconds to wait between executions

timeout
The timeout in seconds
:param str name: The name of the execution module
:param list m_args: The execution module's positional arguments
:param dict m_kwargs: The execution module's keyword arguments
:param str condition: The condition which must be met for the loop to break.
This should contain ``m_ret`` which is the return from the execution module.
:param period: The number of seconds to wait between executions
:type period: int or float
:param timeout: The timeout in seconds
:type timeout: int or float
'''
ret = {'name': name,
'changes': {},
'result': False,
'comment': ''}
ret = {'name': name, 'changes': {}, 'result': False, 'comment': ''}

if m_args is None:
m_args = ()
Expand All @@ -85,39 +104,121 @@ def until(name,

if name not in __salt__:
ret['comment'] = 'Cannot find module {0}'.format(name)
return ret
if condition is None:
elif condition is None:
ret['comment'] = 'An exit condition must be specified'
return ret
if not isinstance(period, int):
ret['comment'] = 'Period must be specified as an integer in seconds'
return ret
if not isinstance(timeout, int):
ret['comment'] = 'Timeout must be specified as an integer in seconds'
return ret
if __opts__['test']:
elif not isinstance(period, (int, float)):
ret['comment'] = 'Period must be specified as a float in seconds'
elif not isinstance(timeout, (int, float)):
ret['comment'] = 'Timeout must be specified as a float in seconds'
elif __opts__['test']:
ret['comment'] = 'The execution module {0} will be run'.format(name)
ret['result'] = None
else:
if m_args is None:
m_args = []
if m_kwargs is None:
m_kwargs = {}

timeout = time.time() + timeout
while time.time() < timeout:
m_ret = __salt__[name](*m_args, **m_kwargs)
if eval(condition): # pylint: disable=W0123
ret['result'] = True
ret['comment'] = 'Condition {0} was met'.format(condition)
break
time.sleep(period)
else:
ret['comment'] = 'Timed out while waiting for condition {0}'.format(condition)
return ret


def until_no_eval(
name,
expected,
compare_operator='eq',
timeout=60,
period=1,
init_wait=0,
args=None,
kwargs=None):
'''
Generic waiter state that waits for a specific salt function to produce an
expected result.
The state fails if the function does not exist or raises an exception,
or does not produce the expected result within the allotted retries.

:param str name: Name of the module.function to call
:param expected: Expected return value. This can be almost anything.
:param str compare_operator: Operator to use to compare the result of the
module.function call with the expected value. This can be anything present
in __salt__ or __utils__. Will be called with 2 args: result, expected.
:param timeout: Abort after this amount of seconds (excluding init_wait).
:type timeout: int or float
:param period: Time (in seconds) to wait between attempts.
:type period: int or float
:param init_wait: Time (in seconds) to wait before trying anything.
:type init_wait: int or float
:param list args: args to pass to the salt module.function.
:param dict kwargs: kwargs to pass to the salt module.function.

.. versionadded:: 3000

'''
ret = {'name': name, 'comment': '', 'changes': {}, 'result': False}
if name not in __salt__:
ret['comment'] = 'Module.function "{}" is unavailable.'.format(name)
elif not isinstance(period, (int, float)):
ret['comment'] = 'Period must be specified as a float in seconds'
elif not isinstance(timeout, (int, float)):
ret['comment'] = 'Timeout must be specified as a float in seconds'
elif compare_operator in __salt__:
comparator = __salt__[compare_operator]
elif compare_operator in __utils__:
comparator = __utils__[compare_operator]
elif not hasattr(operator, compare_operator):
ret['comment'] = 'Invalid operator "{}" supplied.'.format(compare_operator)
else:
comparator = getattr(operator, compare_operator)
if __opts__['test']:
ret['result'] = None
ret['comment'] = ('Would have waited for "{}" to produce "{}".'
''.format(name, expected))
if ret['comment']:
return ret
if not m_args:
m_args = []
if not m_kwargs:
m_kwargs = {}

def timed_out():
if time.time() >= timeout:
return True
return False
if init_wait:
time.sleep(init_wait)
if args is None:
args = []
if kwargs is None:
kwargs = {}

res_archive = []
current_attempt = 0
timeout = time.time() + timeout

while not timed_out():
m_ret = __salt__[name](*m_args, **m_kwargs)
if eval(condition): # pylint: disable=W0123
while time.time() < timeout:
current_attempt += 1
try:
res = __salt__[name](*args, **kwargs)
except Exception:
(exc_type, exc_value, _) = sys.exc_info()
ret['comment'] = 'Exception occurred while executing {}: {}:{}'.format(name, exc_type, exc_value)
break
res_archive.append(res)
cmp_res = comparator(res, expected)
log.debug('%s:until_no_eval:\n'
'\t\tAttempt %s, result: %s, expected: %s, compare result: %s',
__name__, current_attempt, res, expected, cmp_res)
if cmp_res:
ret['result'] = True
ret['comment'] = 'Condition {0} was met'.format(condition)
return ret
ret['comment'] = ('Call provided the expected results in {} attempts'
''.format(current_attempt))
break
time.sleep(period)

ret['comment'] = 'Timed out while waiting for condition {0}'.format(condition)
else:
ret['comment'] = ('Call did not produce the expected result after {} attempts'
''.format(current_attempt))
log.debug('%s:until_no_eval:\n'
'\t\tResults of all attempts: %s',
__name__, res_archive)
return ret
Loading