-
Notifications
You must be signed in to change notification settings - Fork 43
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
random test errors (SSL) ... squeak-4.5 #303
Comments
The root cause of this problem possibly lies in the Gofer implementation on Squeak (i.e., not catching and propagating the error as a GoferRepositoryError) or something else ... of course the response to the error should be: RETRY... |
Well, mystery somewhat resolved as testIssue234a isn't the only test susceptible to the problem:
|
Now it seems that these Squeak-4.5 test failures are not random |
@timfel, @frankshearar, @krono, I'm getting consistent SSL errors running these tests against Squeak-4.5, that are not seen in Squeak4.4 or Squeak-Trunk, so it looks like some sort of SSL-related problem that is only affecting Squeak-4.5 and only for those tests ... there are other tests that make use of https: calls so it's a bit of a mystery to me ... For this one, I'll make these expected failures for Squeak-4.5 (if I can) as they appear to fail pretty reliably, but I'm concerned that folks actually using Squeak-4.5 may start running into these issues as well |
@timfel, @kronos, another note in case you missed the previous comment (spelling errors on my part)...anyway it looks like the webclient ssl interface is consistently failing for certain github references (and not for others) and the tests are only failing for Squeak-4.5 ... passing for Squeak-4.4 and Squeak-Trunk ... anyway I've put expectedFailures in so that I can at least make sure other tests are passing for Squeak-4.5 ... so green for 4.5 does not mean that things are in good shape:) |
…cumentationIssue196TestCase>>testLockCommandReference2)
Another case popped up:
So the SSL failures are random ... |
And the failures are randomly erroring on multiple Squeak-4.4 as well ...and of course sometimes these guys pass and the test fails with unexpected pass ... I will have to wire out these tests completely for the Squeak platform until this SSL flakiness is fixed |
…omly ... just ignore `SSL error, -5` errors
clean sheet ... with SSL test failure ignored |
Oh dear. Thanks for noticing. I noticed inconsistencies in the SSL code lately. Btw: not helping at all, but error code -5 is |
If SSL isn't stable, perhaps we should consider reverting back to using On Sun, Oct 19, 2014 at 4:14 AM, Tobias Pape [email protected]
|
What is used on Pharo? |
Hmmm, Pharo's using zinc/zodiac for https and presumably it uses the squeakSSL plugin, but perhaps the internal error handling is different? ... I recently renamed MetacelloSqueakPlatform>>downloadZipArchive:to:, but other than that haven't made any other chages. In retrospect, the random failures reported in Issue #286 are probably due to randome SSL failures and that bug report dates back to the time when we switched to using WebClient ... |
@dalehenrich while trying to reproduce the problems, using OS X I remembered that SqueakSSL on OS X is not yet able to verify certifictates resulting in errors like this:
Probably curl on Linux/OS X is the feasible option. I don't know why things work on Pharo, tho. Probably the errorhandling in Gofer is different there… |
PS: I just ran the test on OS X and they don't fail. Back to random :( |
The problem with SSL and certificates on the mac is a "known problem with mavericks" ... to get the ssl certificates registered you have to hit the site using safari first ... safari and the mavericks ssl client will do the right thing ... I recall having to do something like this to get curl to work on the mac ... The Ssl error I am seeing now occurs randomly ... Gofer has nothing to do with the error handling ... I'm assuming that when WebClient makes it's https calls it is not handling the ssl error the same way that Zinc/Zodiac on the web does ... perhaps the Pharo https code does an internal retry if it hits an error like this ... I don't quite understand why Metacello is not retrying this error, but the fact is I'm not seeing this kind of behavior on any other platform and I'm consistently (randomly) seeing this happen on all Squeak versions including Squeak-4.4 and the only big difference is WebClient ... and I didn't see this issue prior to introducing WebClient ... so if there is a bugfix I would think the fix needs to go into WebClient ... and if no fix is possible there, then I guess we'll have to revert back to curl? |
On 19.10.2014, at 21:02, Dale Henrichs [email protected] wrote:
Apart from that, SSL cert verification is just not implemented. See
Probably.
I think reverting back to curl for the time being is the viable option. |
Well I was hoping to put my Metacello work to bed today/tomorrow and move on to other things so if you guys could figure out which direction you wanted to go and give me a pull request ... I'm not even sure how much usage the github stuff gets in Squeak ... the "flakey" SSL code was released on the master branch at the end of August so perhaps it is better to stick with the flakey ssl rather than revert to curl which is know to be problematic ... |
The thing with curl is that that won't work for Windows users. Using WebClient and the fix in dalehenrich/filetree#130 allows me to have Babelsberg/S load on Windows without issues. |
I guess I'm inclined to have WebClient work (then I can release Metacello) ... agree that curl is a step backward .. but if random github failures are a problem then we're stuck between a rock and hard place:( |
I think that buidlerCI is using a fairly old squeak vm ... for pharo, the vm is downloaded and installed along with the images ... squeak is just downloading the image, changes and sources files ... That may explain the SSL issues (that only travis sees occasionally) ... builderCI supports Squeak-4.3 on so perhaps builderCI should download an officially supported vm? |
Interestingly, builderCI uses the same VM (and SqueakSSL plugin) the pharo image uses. There should not be an issue; there is no blessed vm for Squeak atm. For the all-in-one-images, we use the at that time current official Cog VM, which is naturally different from the Pharo VM. So, http://mirandabanda.org/files/Cog/VM/ is the right source for the VM. |
Well, Pharo2.0 and beyond are using the vm that is downloaded from the Pharo provides some fairly convenient download scripts for picking up the Dale [1] On Mon, Oct 20, 2014 at 3:37 PM, Tobias Pape [email protected]
|
I understand. My preference would be to (as it currently is) use the Pharo VM which would be used for pharo images |
I don't quite understand ... the is code (the reference that I linked) I'd prefer a pull request from you where you verify that things are working On Mon, Oct 20, 2014 at 3:49 PM, Tobias Pape [email protected]
|
Ah, now I get it. Sorry, I am slow today. You are perfectly right. Let's see what I can craft… |
I'm still fighting problems raised when I did the simple step of merging On Mon, Oct 20, 2014 at 3:56 PM, Tobias Pape [email protected]
|
Course I had a flat time in the middle of the day as well:) So I'm a bit On Mon, Oct 20, 2014 at 4:00 PM, Dale Henrichs <
|
Understandable. |
Just out of curiosity have you seen the SSL -5 errors coming from github On Mon, Oct 20, 2014 at 4:04 PM, Tobias Pape [email protected]
|
Its incomparable, as I currently only use OS X and travis runs linux. I have not encountered a -5 on my local machine… |
I just don't know where the bug lies: vm or webClient ssl support... both On Mon, Oct 20, 2014 at 4:11 PM, Tobias Pape [email protected]
|
turns out that Pharo-1.3 does not work properly with the newer pharo vms On Mon, Oct 20, 2014 at 4:19 PM, Dale Henrichs <
|
oddly enough Pharo1.2 works with newer pharo vms (at least it doesn't hit On Mon, Oct 20, 2014 at 4:23 PM, Dale Henrichs <
|
Merge last of bugfixes for 1.0.0-beta.32.16. I'm going to go ahead and merge this pull request despite the existence of Issues with SSL on Squeak platforms: Issue #305 and Issue #303 ... the final resolution for these particular issues are still up in the air and I don't want to dely the release of 1.0.0-beta32.16 any longer ... will push bugfixes for those issues when we got things characterized correctly
Just found out Zinc/Zodiac does not verify certs at all… |
They must be handling those errors internally ... along with possibly Error On Tue, Oct 21, 2014 at 4:54 PM, Tobias Pape [email protected]
|
This error is just not checked.
Zodiac does:
… |
Ah, now we are cooking with gas:) On Tue, Oct 21, 2014 at 5:35 PM, Tobias Pape [email protected]
|
I had the -5 once on linux but everytime I enabled logging, it disappeared. |
I tweaked on webclient. can you retry? |
testing as we speak |
clean sheet on travis ... no problems on this end ... |
🎈 |
MetacelloScriptingIssuesTestCase>>testIssue234a has failed again:
So the randomness is coming from a random ssl error when trying to connect to https ... I will have to try to understand when this error is not being treated like any other error ... which is supposed to lead to a retry?
Odd that only MetacelloScriptingIssuesTestCase>>testIssue234a seems susceptible.
The text was updated successfully, but these errors were encountered: