Manage clusters
Default resource specs
Under Resource Specs, you can set custom resource presets that users can only select and use to launch ML workloads. You can also specify the priority of the defined options. For example, when you set the resource specs as above users will only be able to select the four options below.
These default options can help admins optimize resource usage by (1) preventing someone from occupying an excessive number of GPUs and (2) preventing unbalanced resource requests which cause skewed resource usage. As for average users, they can simply get going without thinking and configuring the exact number of CPU cores and memories they need to request.
Step-by-step Guide
Click New resource spec and define the following parameters.
- Name — Set a name for the preset. Use names that well represent the preset like
a100-2.mem-16.cpu-6
. - Processor type — Define the preset by the processor type, either by CPU or GPU.
- CPU limit — Enter the number of CPUs. For
a100-2.mem-16.cpu-6
, enter6
. - Memory limit — Enter the amount of memory in GB. For
a100-2.mem-16.cpu-6
, the number would be 16. - GPU type — Specify which GPU you are using. You can get this information by using the
nvidia-smi
command on your server. In the following example, the value isa100-sxm-80gb
.
- GPU limit — Enter the number of GPUs. For
gpu2.mem16.cpu6
, enter2
. You can also place decimal values if you are using Multi-Instance GPUs (MIG). - Priority — Using different values for priority disables FIFO scheduler and assigns workloads according to priority, with lower priority being first. The example preset below always puts workloads running on
gpu-1
ahead of any other workloads.
- Available workloads — Select the type of workloads that can use the preset. With this, you can guide users to use 🗂️ Experiments by preventing them from running ️ Workspaces with 4 or 8 GPUs.