-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Submissions randomly go missing #194
Comments
Temporarily alleviates #194 by not hanging while waiting for the file to appear (it had entered race condition avoidance which blocked everything) Also need to do something about HTTP 400 on compilation_update because we don't want to recycle jobs that "fail" because of a resubmit.
is this not the race condition? (i'll keep digging) |
thinking abt it, it might be good to separate the bucket part and pub/sub part of compilation code into two try/catch blocks |
I think our race condition had been "fixed" infrastructure-side when this happened. Even so I think the backend should be agnostic to the race condition and it affects the compile server more? I hypothesise that this could be something to do with submitting twice in a row; potentially this is another race condition inside the backend/db. e.g. 484 is missing and submitted by the same team as 482-483. |
This is causing many problems with the compilation queue and needs to be fixed as soon as possible. |
Temporarily alleviates #194 by not allowing the queue to explode in size while jobs are continually recycled into it.
Cause appears to be HTTP 500 on |
A sketchy but effective fix:
|
justify / display reason of "can't submit while prev submission processing" (as opposed to compiling, etc) |
@zoemarschner @arvid220u let's make the sketchy fix for now; a more robust solution is better in long term and probably isn't difficult to carry out |
Looking at server logs from last night, sometimes it actually still does reach the pub/sub and compile server, so there may be more than one issue. We will likely need a more robust solution to deal with this one, because the sketchy patch doesn't save the compile server from being loaded with non-existent submissions. |
Compile server is configured to retry things that are bad and ignore things that are really bad. Not a big problem right now anymore |
@j-mao I saw some more submissions that go missing (unfortunately forget the IDs now); these were also a bunch in a row by the same team, w/ only the last one making it this may be an issue worth looking into more |
eg submissions 6250-6258 don't have data, w only 6259 having a bucket; these were all submissions by the same team, all within a couple seconds of each other Is this behavior intended? and even if not, is it acceptable? |
For some reason, source code for submissions 316, 318, 325, 328 don't exist, but they are still in the database and cause trouble for the servers. Should investigate why these submissions disappeared.
The text was updated successfully, but these errors were encountered: