-
Notifications
You must be signed in to change notification settings - Fork 28.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-3481] [SQL] Eliminate the error log in local Hive comparison test #2352
Conversation
QA tests have started for PR 2352 at commit
|
QA tests have finished for PR 2352 at commit
|
@chenghao-intel Actually this issue has bothered us for some time, and makes the Maven build on Jenkins fail. But we had never reproduce it locally... Would you mind to elaborate on the exact reproduction steps? Details like Maven profiles and other parameters would be greatly helpful. Thanks! |
I got the latest master and run
|
Hmm... I couldn't reproduce the Just to make sure I understand this correctly: so the
which overrides the random temporary directories specified by The weird thing is that this part of code has existed for a long time (ever since Spark SQL became part of Spark), but it never fails :( While debugging this "default" database missing issue, I also observed some race condition and execution order related issue, which maybe the reason that this bug has been covered for so long a time... Anyway, this PR LGTM. Thanks for fixing this! @marmbrus Let's see whether this can bring our Jenkins Maven build back! |
Yes, I think you understand correctly, but I am not sure why the unit test passed with Jenkins previously. Probably the multithreading stuff did the tricky. Let's see if this fix will help. |
Actually the SBT Jenkins build is still alright, it's the Maven build that is broken, that's even stranger, since you can easily reproduce it with SBT... |
Got more clue on this, which explains why
In a word, the TL;DR here is, previously, testing databases and testing tables created by test suites inherited from |
If you run |
Thanks for finding this! I've merge to master and 1.1 and 1.0. @JoshRosen I think this should fix the Jenkins errors. Please let me know if SQL is responsible for any more failures. |
Logically, we should remove the Hive Table/Database first and then reset the Hive configuration, repoint to the new data warehouse directory etc. Otherwise it raised exceptions like "Database doesn't not exists: default" in the local testing. Author: Cheng Hao <[email protected]> Closes #2352 from chenghao-intel/test_hive and squashes the following commits: 74fd76b [Cheng Hao] eliminate the error log (cherry picked from commit 8194fc6) Signed-off-by: Michael Armbrust <[email protected]>
This is a follow up of #2352. Now we can finally remove the evil "MINOR HACK", which covered up the eldest bug in the history of Spark SQL (see details [here](#2352 (comment))). Author: Cheng Lian <[email protected]> Closes #2377 from liancheng/remove-evil-minor-hack and squashes the following commits: 0869c78 [Cheng Lian] Removes the evil MINOR HACK
Thank you @liancheng for so detailed explanation. Actually I didn't know those while submitting this PR. :) |
a follow up of #2377 and #2352, see detail there. Author: wangfei <[email protected]> Closes #2505 from scwf/patch-6 and squashes the following commits: 4874ec8 [wangfei] removes the evil MINOR HACK
Logically, we should remove the Hive Table/Database first and then reset the Hive configuration, repoint to the new data warehouse directory etc.
Otherwise it raised exceptions like "Database doesn't not exists: default" in the local testing.