Conversation
sync with main
…ring GPUs between deployments
…cy in the native go dns resolver (netgo) Had problems with resolving *.localhost subdomains.
…nt vendors gpu configuration with nvidia impl for now + code to generate nvidia opaque types for gpu config so we dont have to rely on importing nvidias code, it has a init func that fails unless a specific variable is set to a version at compile time (with ldflags), this made testing extremely hard. Left for full k8s-layer impl: - Actually impl CreateResourceClaimManifest - Add Tolerations to the ResourceClaimPublic requests - Figure out if / how Constraints from the resourcev1.ResourceClaimSpec should be added Left for broader (api) impl: - Model structs for this, with additional data for controlling rbac - DTOs - Routes
…g resource claims
…olled out simpler
…complete rbac on gpuclaims
add vm usage permission
add gpu usage for users
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This PR adds features related to allocating (as an admin) GPUs with a desired sharing strategy that users can consume from deployments.
This feature is controlled with the zone capability
draand should only be enabled for clusters with a version of k8s 1.34 or above. It also requires the cluster to have the nvidia dra driver installed.Additional changes
VM can be controlled with the role permission
useVms, if not set It will default to true. If set as:useVMs: falseit will deny that role from creating new VMs.GPU usage can be controlled with the role quota
gpus.