-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Integration tests (openhands fix issue 5076) #8
Conversation
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
Trigger by: Pull Request (integration-test label on PR #8)
You can download the full evaluation outputs here. |
@openhands-agent Make the integration-runner workflow work also on schedule, a nightly schedule. |
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
Trigger by: Pull Request (integration-test label on PR #8)
Integration Tests Report (DeepSeek)
Download evaluation outputs (includes both Haiku and DeepSeek results): Download |
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
Trigger by: Pull Request (integration-test label on PR #8)
Integration Tests Report (DeepSeek)
Download evaluation outputs (includes both Haiku and DeepSeek results): Download |
Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly. |
Trigger by: Pull Request (integration-test label on PR #8)
Integration Tests Report (DeepSeek)
Download evaluation outputs (includes both Haiku and DeepSeek results): Download |
b37602a
to
0c22181
Compare
…nhands-fix-issue-5076
* Fix issue All-Hands-AI#5076: Integration test github action * Update integration-runner.yml * Update integration-runner.yml * update variables * use haiku * use base url * fix report name * Fix pr #8: Integration tests (openhands fix issue 5076) * Revert "Fix pr #8: Integration tests (openhands fix issue 5076)" This reverts commit dcd4681. * Fix pr #8: Integration tests (openhands fix issue 5076) * use haiku explicitly, in results too * remove duplicate * Update .github/workflows/integration-runner.yml * Revert "Update .github/workflows/integration-runner.yml" This reverts commit 7e7200e. * funny space * Fix pr #8: Integration tests (openhands fix issue 5076) * artifact fix * clean up remote runtimes * clean up runtimes more aggressively - a bit unexpected though * Fix pr #8: Integration tests (openhands fix issue 5076) * fix type issue that was preventing checking results * try with waiting time * add eval notes * increase timeouts * try with CI local builds * fix eval output * set debug * fix tests! * fix outputs * keep details in logs, not github comment * tweak schedule * lint-y --------- Co-authored-by: openhands <[email protected]>
End-user friendly description of the problem this fixes or functionality that this introduces
Give a summary of what the PR does, explaining any non-trivial design decisions
Link of any specific issues this addresses