Files
doc-exports/docs/cce/umn/cce_10_0511.html
qiujiandong1 71d5c814e7 CCE UMN 20250311 version
Reviewed-by: Eotvos, Oliver <oliver.eotvos@t-systems.com>
Co-authored-by: qiujiandong1 <qiujiandong1@huawei.com>
Co-committed-by: qiujiandong1 <qiujiandong1@huawei.com>
2025-06-16 14:58:53 +00:00

1.7 KiB

Key CCE AI Suite (NVIDIA GPU) Parameters

Check Items

Check whether the configuration of CCE AI Suite (NVIDIA GPU) in a cluster has been intrusively modified. If so, upgrading the cluster may fail.

Solution

  1. Use kubectl to access the cluster.
  2. Run the following command to obtain the add-on instance details:

    kubectl get ds nvidia-driver-installer -nkube-system -oyaml

  3. Check whether the UpdateStrategy value is changed to OnDelete. If so, change it back to RollingUpdate.
  4. Check whether the NVIDIA_DRIVER_DOWNLOAD_URL value is the same as the GPU driver version on the add-on page. If not, correct the version on the add-on page.