forked from docs/doc-exports
Reviewed-by: Eotvos, Oliver <oliver.eotvos@t-systems.com> Co-authored-by: qiujiandong1 <qiujiandong1@huawei.com> Co-committed-by: qiujiandong1 <qiujiandong1@huawei.com>
1.7 KiB
1.7 KiB
Key CCE AI Suite (NVIDIA GPU) Parameters
Check Items
Check whether the configuration of CCE AI Suite (NVIDIA GPU) in a cluster has been intrusively modified. If so, upgrading the cluster may fail.
Solution
- Use kubectl to access the cluster.
- Run the following command to obtain the add-on instance details:
kubectl get ds nvidia-driver-installer -nkube-system -oyaml
- Check whether the UpdateStrategy value is changed to OnDelete. If so, change it back to RollingUpdate.
- Check whether the NVIDIA_DRIVER_DOWNLOAD_URL value is the same as the GPU driver version on the add-on page. If not, correct the version on the add-on page.
Parent topic: Troubleshooting for Pre-upgrade Check Exceptions