-
Notifications
You must be signed in to change notification settings - Fork 229
Fix missing ReplicaIndexLabel when using RunLauncherAsWorker #690
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix missing ReplicaIndexLabel when using RunLauncherAsWorker #690
Conversation
cc: @tenzen-y |
Thank you for fixing this. At first glance, fixing looks reasonable. Let me review this more deeply, later. |
cc: @mimowo |
@GonzaloSaez could you sign CLA? |
LGTM |
90a4857
to
686009e
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Everything looks great to me. Thank you!
@GonzaloSaez I fixed the CI errors in a separate PR.
So, could you rebase this PR?
After that we can merge this PR, thanks.
Signed-off-by: GonzaloSaez <[email protected]>
686009e
to
55ad4cb
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you!
/lgtm
/approve
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: tenzen-y The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Kueue's TAS expects all pods in a Pod Group to have the pod index label. Else it starts printing out annoying errors like
failed to read rank information from Pods
. The latter happens when using RunLauncherAsWorker. To fix this we need to add the pod index label to the launchers' pod.