-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Performance degradation on tunneled connection #115
Comments
Any chance you can narrow down the version range? That would really help. |
@bAndie91 , @patduin: I did some investigation and found this: Test Case 1 - listTables on non-tunneled connection
Test Case 2 - listTables on tunneled connection
Summary IMPORTANT: The performance degradation does not seem to be specific to tunneled connection, the same trend can be observed in both cases. |
Excellent work! |
@bAndie91 can you please answer the question above? I also checked the spark logs if they can contain any unusual error. This might be also interesting. |
I think I know what is going on. A change I made related to #73 . Does an extra call to verify the connection is open. |
Can I ask you to try and build/run this branch: https://github.com/HotelsDotCom/waggle-dance/tree/issue-115 |
@patduin Sure, will check that branch and get back to you with the results. |
@patduin all the metastore connections are AVAILABLE during the tests run. |
Re-run the test cases on the version built from issue-115 branch and added the results to the charts. Test Case 1 - listTables on non-tunneled connection Test Case 2 - listTables on tunneled connection I can confirm that the fix resolves the performance degradation issue. Also the RetryingMetaStoreClient:184 - MetaStoreClient lost connection. Attempting to reconnect. warning disappeared from the logs. |
ok thanks really helpful! We'll need to find some other way to fix #73 without introducing the performance hit. Not sure yet how but at least we know what is going on :) |
cheers, let us know when the fix is available, we are happy to take a quick look at the performance. |
Will do and thanks! |
@rambrus I've updated the branch, I've managed to avoid the issue for normal connections but you'll see the degradation in tunneled connections still. I haven't found a way to work around this without sacrificing functionality. Would be great if you could test this. We could at least release this and if the performance is a big issue focus on that in some future PR. |
@patduin sure, will take a look and get back to you with results. |
yeah I can't really account for that. We merged the PR with the changes and try to make a release this week. |
This is adressed in 2.4.2 release, if the performance is still an issue please reopen or open a new ticket, closing this. |
I've experienced performance degradation when upgraded from 2.2.2 to 2.3.7.
see measurements in attachment which was made by Spark application calling
spark.catalog.listTables()
.newer WD is 3 times slower impacting the ssh-tunneled connections (see highlighted rows) the most.
how much can it be eliminated?
The text was updated successfully, but these errors were encountered: