You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Nov 20, 2024. It is now read-only.
I've looking some 'virtualized gpu' framework things. but I cannot solve some questions.
If I've allocated some "virtualized gpus" in the instances. How can I solve CUDA Core Fraction?
I've heard that Ampere's MIG Support Slicing NVIDIA CUDA Cores. but I did not heard that feature in Volta Instances.
So, Someone is Running that requires GPU resource. and other need to Wait for pre task is finished?
If that task is not for the inference things (kinda like train some lightweight model) how can I correctly tells them one of the container can use n amount of cuda cores and memory?