-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: eval-this workflow is not working #5106
Comments
The
I can confirm that the new
I felt like we need the integration tests in their new form to be back in working state. We have removed them at some point from ./tests, and refactored them as external scripts like evals, also using Deepseek, but they weren't working either. What do you think about this? Could we have a nightly for them - and maybe a label too, just in case needed -, also with Haiku? IMHO it would be cool if we can also have a nightly on Deepseek or something. Because
These integration tests are just like 6 tests currently (and I'm working on a seventh), but they do try to cover some things in real-like use that we just don't have coverage elsewhere. |
Oh, also: at this time, the Deepseek API key defined on this repo is depleted. I doubt it's the original reason why the eval workflow wasn't working, but it looks like the first reason currently. 😅 Cc: @neubig |
This is my proposal on these: (source) |
Is there an existing issue for the same bug?
Describe the bug and reproduction steps
Currently the eval-this workflow is not working, so we should fix it.
Idea:
@csmith49 will take a look at this
OpenHands Installation
Docker command in README
OpenHands Version
No response
Operating System
None
Logs, Errors, Screenshots, and Additional Context
No response
The text was updated successfully, but these errors were encountered: