The system checks the number of failed Yarn tasks every 10 minutes. This alarm is generated when the number of failed Yarn tasks in the last 10 minutes is greater than the threshold. This alarm is automatically cleared when the number of failed Yarn tasks is less than the threshold in the next 10 minutes.
Alarm ID |
Alarm Severity |
Auto Clear |
---|---|---|
18013 |
Major |
Yes |
Parameter |
Description |
---|---|
ServiceName |
Service for which the alarm is generated. |
RoleName |
Role for which the alarm is generated. |
HostName |
Host for which the alarm is generated. |
None
The submitted Yarn job program is incorrect. For example, the parameter for Spark to submit a job is incorrect.
Check the log of the failed job, locate the failure cause, modify the job, and submit the job again.
None