Skip to content

Commit dac477e

Browse files
committed
Update microk8s install command and custom containerd options
Signed-off-by: Christopher Desiniotis <[email protected]>
1 parent 7736a55 commit dac477e

File tree

2 files changed

+33
-26
lines changed

2 files changed

+33
-26
lines changed

gpu-operator/getting-started.rst

Lines changed: 29 additions & 25 deletions
Original file line numberDiff line numberDiff line change
@@ -479,11 +479,11 @@ options are used with the container-toolkit deployed with GPU Operator:
479479
toolkit:
480480
env:
481481
- name: CONTAINERD_CONFIG
482-
value: /var/snap/microk8s/current/args/containerd-template.toml
482+
value: /etc/containerd/containerd.toml
483483
- name: CONTAINERD_SOCKET
484-
value: /var/snap/microk8s/common/run/containerd.sock
484+
value: /run/containerd/containerd.sock
485485
- name: RUNTIME_CONFIG_SOURCE
486-
value: file=/var/snap/microk8s/current/args/containerd.toml
486+
value: "command, file"
487487
488488
489489
If you need to specify custom values, refer to the following sample command for the syntax:
@@ -495,35 +495,41 @@ If you need to specify custom values, refer to the following sample command for
495495
nvidia/gpu-operator $HELM_OPTIONS \
496496
--version=${version} \
497497
--set toolkit.env[0].name=CONTAINERD_CONFIG \
498-
--set toolkit.env[0].value=/var/snap/microk8s/current/args/containerd-template.toml \
498+
--set toolkit.env[0].value=/etc/containerd/containerd.toml \
499499
--set toolkit.env[1].name=CONTAINERD_SOCKET \
500-
--set toolkit.env[1].value=/var/snap/microk8s/common/run/containerd.sock \
500+
--set toolkit.env[1].value=/run/containerd/containerd.sock \
501501
--set toolkit.env[2].name=RUNTIME_CONFIG_SOURCE \
502-
--set toolkit.env[2].value=file=/var/snap/microk8s/current/args/containerd.toml
502+
--set toolkit.env[2].value="command, file"
503503
504504
These options are defined as follows:
505505

506506
CONTAINERD_CONFIG
507-
The path on the host to the ``containerd`` config
508-
you would like to have updated with support for the ``nvidia-container-runtime``.
509-
By default this will point to ``/var/snap/microk8s/current/args/containerd-template.toml``
507+
The path on the host to the top-level ``containerd`` config file.
508+
By default this will point to ``/etc/containerd/containerd.toml``
510509
(the default location for ``containerd``). It should be customized if your ``containerd``
511510
installation is not in the default location.
512511

513512
CONTAINERD_SOCKET
514513
The path on the host to the socket file used to
515514
communicate with ``containerd``. The operator will use this to send a
516515
``SIGHUP`` signal to the ``containerd`` daemon to reload its config. By
517-
default this will point to ``/var/snap/microk8s/common/run/containerd.sock``
516+
default this will point to ``/etc/containerd/containerd.sock``
518517
(the default location for ``containerd``). It should be customized if
519518
your ``containerd`` installation is not in the default location.
520519

521520
RUNTIME_CONFIG_SOURCE
522-
The path on the host to the runtime config source file used to
523-
load the nvidia-container-runtime into the containerd runtime.
524-
By default this will point to ``/var/snap/microk8s/current/args/containerd.toml``
525-
(the default location for ``containerd``). It should be customized if
526-
your ``containerd`` installation is not in the default location.
521+
The config source(s) that the container-toolkit uses when fetching
522+
the current containerd configuration. A valid value for this setting is any
523+
combination of [command | file]. By default this will be configured as
524+
"command, file" which means the container-toolkit will attempt to fetch
525+
the configuration via the containerd CLI before falling back to reading the
526+
config from the top-level ``containerd`` config file (configured via
527+
CONTIANERD_CONFIG). When ``file`` is specified, the absolute path to the file
528+
to be used as a config source can be specified as ``file=/path/to/source/config.toml``
529+
530+
RUNTIME_DROP_IN_CONFIG
531+
The path on the host where the NVIDIA-specific drop-in config file
532+
will be created. By default this will point to ``/etc/containerd/conf.d/99-nvidia.toml``.
527533

528534

529535
Rancher Kubernetes Engine 2
@@ -535,6 +541,8 @@ in the RKE2 documentation.
535541

536542
Refer to the :ref:`v24.9.0-known-limitations`.
537543

544+
.. _microk8s-install-procedure:
545+
538546
MicroK8s
539547
========
540548

@@ -545,13 +553,11 @@ For MicroK8s, set the following in the ``ClusterPolicy``.
545553
toolkit:
546554
env:
547555
- name: CONTAINERD_CONFIG
548-
value: /var/snap/microk8s/current/args/containerd.toml
556+
value: /var/snap/microk8s/current/args/containerd-template.toml
549557
- name: CONTAINERD_SOCKET
550558
value: /var/snap/microk8s/common/run/containerd.sock
551-
- name: CONTAINERD_RUNTIME_CLASS
552-
value: nvidia
553-
- name: CONTAINERD_SET_AS_DEFAULT
554-
value: "true"
559+
- name: RUNTIME_CONFIG_SOURCE
560+
value: "file=/var/snap/microk8s/current/args/containerd.toml"
555561
556562
These options can be passed to GPU Operator during install time as below.
557563

@@ -561,13 +567,11 @@ These options can be passed to GPU Operator during install time as below.
561567
nvidia/gpu-operator $HELM_OPTIONS \
562568
--version=${version} \
563569
--set toolkit.env[0].name=CONTAINERD_CONFIG \
564-
--set toolkit.env[0].value=/var/snap/microk8s/current/args/containerd.toml \
570+
--set toolkit.env[0].value=/var/snap/microk8s/current/args/containerd-template.toml \
565571
--set toolkit.env[1].name=CONTAINERD_SOCKET \
566572
--set toolkit.env[1].value=/var/snap/microk8s/common/run/containerd.sock \
567-
--set toolkit.env[2].name=CONTAINERD_RUNTIME_CLASS \
568-
--set toolkit.env[2].value=nvidia \
569-
--set toolkit.env[3].name=CONTAINERD_SET_AS_DEFAULT \
570-
--set-string toolkit.env[3].value=true
573+
--set toolkit.env[2].name=RUNTIME_CONFIG_SOURCE \
574+
--set-string toolkit.env[2].value=file=/var/snap/microk8s/current/args/containerd.toml
571575
572576
.. _running sample gpu applications:
573577
.. _verify gpu operator install:

gpu-operator/release-notes.rst

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -118,7 +118,10 @@ Improvements
118118
------------
119119

120120
* The GPU Operator now configures containerd and cri-o using drop-in files by default.
121-
When installing on microk8s, you need to set the value of the RUNTIME_CONFIG_SOURCE parameter in the CLusterPolicy to ``file=/var/snap/microk8s/current/args/containerd.toml``.
121+
As a consequence of this change, some of the install procedures for Kubernetes distributions
122+
that use custom containerd installations have changed.
123+
124+
- The install procedure for microk8s has changed. Refer to the latest :ref:`MicroK8s` install procedure.
122125

123126
* Hardened the GPU Operator container image by using a distroless as a base image.
124127

0 commit comments

Comments
 (0)