-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Memory regression with 3002 #58791
Comments
@cachedout as requested on slack... |
@ixs Thanks for the report. Can you provide an example of the state where you're seeing these slow times? |
@garethgreenaway not really. It's a highstate run with a bunch of basic states plus third party formulas and a lot of pillars. I'm trying to recreate the problem inside a venv right now so that I can come up with a test for git bisect... |
Deleted the last two comments because I was following down a wrong path there looking at runtime. I wrote a quick shell script that bascially does the following: python setup.py sdist Then compare the memory consumption of both. 3001 was always under 110MB and 3002 was always above. Git bisect says, 5c15def is the first bad commit. comparing the previous commit and that one shows a striking difference in memory consumption:
|
It might be worthwhile to check that #58785 and this issue are not about the same underlying issue. Increased memory consumption (90MB vs. 200MB) on a 10min runtime is pretty extreme and could be seen as a memory leak... |
Running a second
|
For reference: Test scripts, git bisect built RPMS and the logs containing runtime info from GNU time are at https://github.com/ixs/salt-58791 |
Reverting the following changes brings memory consumption back down to 90MB.
I'd recommend rolling a 3002.1 release ASAP. RPMS and a python sdist tarball and logs is at https://github.com/ixs/salt-58791/tree/master/archive/fba062ce8e96d9fe657fdc7cc8ce808cc7a4aacd |
@OrangeDog @whytewolf @tuxthepenguin84 @Arderos @jbouter Have a look at https://github.com/ixs/salt-58791/tree/master/archive/fba062ce8e96d9fe657fdc7cc8ce808cc7a4aacd and feel free to try out the centos7 RPMS there or the salt tarball. |
@garethgreenaway I agree this it is likely that this is the same issue as #58785.
As you can see, I was running the code with the fix from #58811 from 16:30 to about 22:30... 🤨 The problem is not just the maintenance process. In my case the processes being killed are all MWorkers... The maintenance process is fine. |
Isn't this just a string matching issue? Since you've changed the loaded_base_name but you were/are testing if modules |
@dwoz thanks for the headsup. Let me pull that change into my local tree and test... |
@dwoz my testsuite is super slow and times out with your code... |
@ixs Can you confirm this has been resolved with 3002.2? |
Closing this out with the assumption that the fix went into 3002.2. If the issue persists please let us know. |
Description
After upgrading 3001 to 3002 massively increased runtime for highstates has been observed.
A 3001 minion highstate:
The same host once the minion and the master have been upgraded to 3002:
The failure was a service error:
Possibly unrelated...
Setup
Master and Minion are using the same stack, when running on 3002 the libraries are unchanged.
Downgrading to 3001 resolves the issue.
The text was updated successfully, but these errors were encountered: