KEP-4816 update for beta in 1.34 #5261

mortent · 2025-04-27T22:37:44Z

One-line PR description: Update KEP to prepare for beta in 1.34
Issue link: DRA: Prioritized Alternatives in Device Requests #4816
Other comments: Some of the material is already covered in KEP-4381: DRA Structure Parameters, so that KEP is referenced in some places.

k8s-ci-robot · 2025-04-27T22:37:50Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: mortent
Once this PR has been reviewed and has the lgtm label, please assign sanposhiho, soltysh for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

keps/prod-readiness/OWNERS
keps/sig-scheduling/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

mortent · 2025-04-28T15:22:20Z

keps/sig-scheduling/4816-dra-prioritized-list/README.md

-
-Scheduling a claim that uses this feature may take a bit longer, if it is
-necessary to go deeper into the list of alternative options before finding a
-suitable device. We can measure this impact in alpha.


I'm not sure if we can easily measure this, as it largely depends on what the structure of the ResourceClaim.

But in general each subrequest requires the same amount of work as a regular request, so a request with two subrequest will in the worst-case take about twice as long as just a single request. The maximum number of subrequests for each request is 8, so in the worst case, where none of the eight subrequests succeed, it would take 8 times longer than just a normal request.

Since the subrequests are tried in priority order, the extra work is only needed in situations where the first subrequests can be satisfied, so a situation where using just a single request would have failed to allocate devices for the request.

jackfrancis · 2025-04-28T23:13:40Z

keps/sig-scheduling/4816-dra-prioritized-list/README.md

@@ -1010,7 +1017,6 @@ ensure they are handled by the scheduler as described in this KEP.
 #### Beta

 - Gather feedback
- Implement node scoring


Looks like we initially thought we'd pick this up in the beta cycle but now we're kicking it out of the Prioritized Alternatives scope altogether?

I'm thinking specifically about the comment above the DeviceRequest.FirstAvailable property.

// DRA does not yet implement scoring, so the scheduler will // select the first set of devices that satisfies all the // requests in the claim. And if the requirements can // be satisfied on more than one node, other scheduling features // will determine which node is chosen. This means that the set of // devices allocated to a claim might not be the optimal set // available to the cluster. Scoring will be implemented later.

[WIP] Update KEP-4816 for beta

12a43ae

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Apr 27, 2025

k8s-ci-robot added the kind/kep Categorizes KEP tracking issues and PRs modifying the KEP directory label Apr 27, 2025

k8s-ci-robot requested review from dom4ha and jeremyrickard April 27, 2025 22:37

k8s-ci-robot added the sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling. label Apr 27, 2025

github-project-automation bot added this to SIG Scheduling Apr 27, 2025

github-project-automation bot moved this to Needs Triage in SIG Scheduling Apr 27, 2025

k8s-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Apr 27, 2025

mortent commented Apr 28, 2025

View reviewed changes

jackfrancis reviewed Apr 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KEP-4816 update for beta in 1.34 #5261

KEP-4816 update for beta in 1.34 #5261

mortent commented Apr 27, 2025

k8s-ci-robot commented Apr 27, 2025

mortent Apr 28, 2025

jackfrancis Apr 28, 2025

KEP-4816 update for beta in 1.34 #5261

Are you sure you want to change the base?

KEP-4816 update for beta in 1.34 #5261

Conversation

mortent commented Apr 27, 2025

k8s-ci-robot commented Apr 27, 2025

mortent Apr 28, 2025

Choose a reason for hiding this comment

jackfrancis Apr 28, 2025

Choose a reason for hiding this comment