Skip to content

Conversation

@MenD32
Copy link
Contributor

@MenD32 MenD32 commented Nov 28, 2025

What type of PR is this?

/kind feature

What this PR does / why we need it:

This PR adds a cluster resource limits for DRA resources.

Which issue(s) this PR fixes:

Fixes #8184

Special notes for your reviewer:

The solution proposed here is limited only to Nvidia GPUs, but the adding additional DRA resources should not change the business logic for this.

Does this PR introduce a user-facing change?

DRA: Cluster limits support. A new flag --dra-total has been added, this flag can be called multiple times in order to set cluster limits for DRA resources, for example --dra-total="gpu.nvidia.com:nvidia l4:0:10" will set the limit as between 0 to 10 L4 nvidia GPUs. this feature is currently limited to the gpu.nvidia.com DRA driver.

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:


Signed-off-by: MenD32 <amit.mendelevitch@gmail.com>
Signed-off-by: MenD32 <amit.mendelevitch@gmail.com>
Signed-off-by: MenD32 <amit.mendelevitch@gmail.com>
@k8s-ci-robot k8s-ci-robot added release-note Denotes a PR that will be considered when it comes time to generate release notes. kind/feature Categorizes issue or PR as related to a new feature. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/needs-area area/cluster-autoscaler and removed do-not-merge/needs-area labels Nov 28, 2025
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: MenD32
Once this PR has been reviewed and has the lgtm label, please assign aleksandra-malinowska for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Nov 28, 2025
@MenD32
Copy link
Contributor Author

MenD32 commented Nov 28, 2025

/label tide/merge-method-squash

@k8s-ci-robot k8s-ci-robot added the tide/merge-method-squash Denotes a PR that should be squashed by tide when it merges. label Nov 28, 2025
Signed-off-by: MenD32 <amit.mendelevitch@gmail.com>
Signed-off-by: MenD32 <amit.mendelevitch@gmail.com>
Signed-off-by: MenD32 <amit.mendelevitch@gmail.com>
@MenD32 MenD32 changed the title Feat/dra cluster resource limit Feat: DRA cluster resource limit Dec 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area/cluster-autoscaler cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/feature Categorizes issue or PR as related to a new feature. release-note Denotes a PR that will be considered when it comes time to generate release notes. size/L Denotes a PR that changes 100-499 lines, ignoring generated files. tide/merge-method-squash Denotes a PR that should be squashed by tide when it merges.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

CA DRA: integrate with cluster-wide resource limits

2 participants