-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dynamic service change output file, file gets uploaded, but never gets downloaded by target service. #5741
Comments
@wvangeit there is another option to transfer files/number/strings/(maybe objects) via the ZeroMQ facilities that @GitHK added between these 2 services that you have that are actually. They are in a feedback loop right? |
Yes, I'm very aware of the zeromq facilities, I have several prototypes using these. But this system has its own issues. But oth, I also think that we have to make file communications more stable. This is basically affecting all the dynamic services (at least), so it's not just me seeing it. I remember Taylor already being happy I made this to make things slightly more deterministic: https://github.com/wvangeit/osparc-filecomms |
This issue has a partial workaround, in the sense that if the code that sends a file keeps writing the file in regular intervals, it eventually gets picked up by the system. However, this obviously doesn't work if one can't control the code writing the file (file pickers, other services, etc) |
I'm moving this to 'Error without workaround', because this is still happening too often. |
@GitHK If you need an example of this (incoming service is file picker, but behavior is the same) This same worked twice before, but in the run of around 11:02 25 Jun, suddenly the parallelrunner didn't receive it's configuration file, although it's clearly present in the filepicker: Service can't the file (parallelrunner.json) FYI @pcrespov |
(And also FYI, it just reran the exact same study again as the one above, and now it worked, so there was no issue with the study itself) |
@wvangeit I see that this started and received 3 calls to pull the inputs. Unfortunately there are no further logs of exactly what was pulled. Maybe that can be added for future debugging. |
So your log lines
Afterwards it just sits there as you described. |
Is the improved logging already available in staging? I just happened again around 10:50 Jul 1, for study ID 80471a7a-11d3-11ef-9cb8-02420a0803ee |
@wvangeit Yes, it was released to staging. Let me have a look and report back |
replied privately, looks like some path mismatch. |
Just investigated this with @GitHK . The very last issue I mentioned is apparently a currently known frontend issue on osparc staging, which is currently being investigated. None of the inputs of a service are not there, because the service is still in the 'connecting' state due to the bug. For reference, this 'only' affects the last example I provided. All the other examples from above are still valid. |
so since this is unrelated, let me know if you can find one where it is actually running and you have you see the issue. |
@wvangeit is this still an issue? |
It's a bit difficult for me to tell atm, because my osparc-filecomms lib tries to put a layer on top to hide this problem. I assume ESB input/output folders would also solve this issue at some point. For me you can close it for now, and i can reopen when i see another case of this happening. |
ok let's close it for now |
What happened rarely on my local machine deployment, became more problematic on osparc-master.
Sometimes when a service (notebook, python runner, etc) writes a file to the output folder (port), the log shows that the file got uploaded.
However, the file on the receiving end doesn't get downloaded (it's not present in the input folder, and there is no log message by the sidecar saying that the file got downloaded)
This is a flaky issue. The same scripts work many times, but get this particular issue from time to time, at different stages of file interactions.
The text was updated successfully, but these errors were encountered: