Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Test eval pipeline - no merge #6879

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from
Draft

Test eval pipeline - no merge #6879

wants to merge 1 commit into from

Conversation

mamoodi
Copy link
Collaborator

@mamoodi mamoodi commented Feb 21, 2025

  • Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below

End-user friendly description of the problem this fixes or functionality that this introduces


Give a summary of what the PR does, explaining any non-trivial design decisions


Link of any specific issues this addresses


To run this PR locally, use the following command:

docker run -it --rm   -p 3000:3000   -v /var/run/docker.sock:/var/run/docker.sock   --add-host host.docker.internal:host-gateway   -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:83d31ee-nikolaik   --name openhands-app-83d31ee   docker.all-hands.dev/all-hands-ai/openhands:83d31ee

@mamoodi mamoodi added the run-eval-xs Runs evaluation with 1 instance label Feb 22, 2025
Copy link
Contributor

Running evaluation on the PR. Once eval is done, the results will be posted.

@mamoodi
Copy link
Collaborator Author

mamoodi commented Feb 22, 2025

Evaluation results (Auto Reply): ## Summary

  • submitted instances: 1
  • empty patch instances: 0
  • resolved instances: 1
  • unresolved instances: 0
  • error instances: 0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
run-eval-xs Runs evaluation with 1 instance
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant