-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
With 64 subsystems and 2048 namespaces, get_subsystems throws a traceback in the gateway log #306
Comments
As far as I can see the exception was just in displaying a log message. No need to fail the command for that. We can just move the log message a little forward in the code, enclose it in a "try" block and ignore such errors. |
@pcuzner could you send us the grpc.py file used here? |
@baum , notice that the exception was in:
It's not really I/O, just writing to the log. The strange thing is that this line is inside a "try" block and still we didn't get into the "except" clause |
This is against registry-proxy.engineering.redhat.com/rh-osbs/ceph-nvmeof:0.0.4-1 |
Note that this is also seen with 0.0.5, and with gateway configuration that has higher numbers on smaller subsystems. I noticed the error at 94 subsystems, 376 namespaces for example - so this issue is not limited to configurations of large numbers of namespaces |
@pcuzner can you specify which CLI command exactly you used to create all these subsystems and namespaces? How many bdevs did you create? |
I added the script I used to the downstream BZ.https://bugzilla.redhat.com/show_bug.cgi?id=2247718. The attachment is called "scaling script". I've been runing the scale tests in multiple dimensions - small subsystem count with high namespaces per subsystem through to high subsystem count with low namespace per subsystem. The memory and open files issue is the same due to the librbd client creation. |
@pcuzner I couldn't reproduce the issue and in the meantime there was a major change to the CLI code. So, I close this for now. Please re-open in case you see it again with the current code. |
At this scale, when I issue a get_subsystems I can see the following in the gateway log
The text was updated successfully, but these errors were encountered: