What Should I Do If a Node Pool Scale-Out Fails?

Fault Locating

Locate the fault based on the events of the failure to scale out a node pool, as shown in Table 1.

Table 1 Node pool scale-out failure

Event

Possible Cause

Reference

...call fsp to query keypair fail, error code : Ecs.0314, reason is : the keypair *** does not match the user_id ***...

The possible causes are as follows:

  • The key pair selected for logging in to the node pool has been deleted.
  • The key pair selected for logging in to the node pool is a private one which cannot be used by the current user to log in to the node pool and create nodes in the node pool.

Failed to Obtain the Key Pair Used for Logging In to a Node Pool

{"error":{"message":"encrypted key id [***] is invalid.","code":"Ecs.0912"}}

The possible causes are as follows:

  • The KMS key ID entered during node pool creation does not exist.
  • The KMS key ID entered during node pool creation is the key of another user, but the user has not authorized you.

Invalid KMS Key ID

Security group [*****] not found

This issue can arise in the following scenarios:

  • A custom security group is set up for the node pool but gets deleted, so the node pool scale-out fails.
  • No custom security group is configured for the node pool and the default security group is deleted, so the node pool scale-out fails.

The Security Group Specified by the Node Pool Deleted

Failed to Obtain the Key Pair Used for Logging In to a Node Pool

If a node pool scale-out fails, the event contains Ecs.0314. This error code indicates that the key pair used for logging in to the node pool cannot be obtained, which results in the creation failure of a new ECS.

...call fsp to query keypair fail, error code : Ecs.0314, reason is : the keypair *** does not match the user_id ***...

The possible causes are as follows:

Solution

Invalid KMS Key ID

When a node pool fails to be expanded, the reported event contains Ecs.0912.

{"error":{"message":"encrypted key id [***] is invalid.","code":"Ecs.0912"}}

The possible causes are as follows:

Solution

The Security Group Specified by the Node Pool Deleted

When a node pool fails to be expanded, the event contains the following information:

Security group [*****] not found

This issue can arise in the following scenarios:

Solution