forked from docs/doc-exports
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
9.7 KiB
9.7 KiB
ALM-43007 Non-Heap Memory Usage of the JobHistory Process Exceeds the Threshold
Description
The system checks the JobHistory process status every 30 seconds. The alarm is generated when the non-heap memory usage of the JobHistory process exceeds the threshold (90% of the maximum memory).
Attribute
Alarm ID |
Alarm Severity |
Automatically Cleared |
---|---|---|
43007 |
Major |
Yes |
Parameters
Parameter |
Description |
---|---|
ServiceName |
Specifies the service for which the alarm is generated. |
RoleName |
Specifies the role for which the alarm is generated. |
HostName |
Specifies the host for which the alarm is generated. |
Impact on the System
If the available JobHistory process non-heap memory is insufficient, a memory overflow occurs and the service breaks down.
Possible Causes
The non-heap memory of the JobHistory process is overused or the non-heap memory is inappropriately allocated.
Procedure
- Check non-heap memory usage.
- Go to the cluster details page and choose Alarms.
- Select the alarm whose Alarm ID is 43007 and view the IP address and role name of the instance in Location.
- Choose Components > Spark > Instance > JobHistory (IP address of the instance for which the alarm is generated) > Customize > Non-Heap Memory Statistics of the JobHistory Process. Click OK to view the non-heap memory usage.
- Check whether the non-heap memory usage of JobHistory has reached the threshold (90% of the maximum memory).
- Choose Components > Spark > Service Configuration. Set Type to All and choose JobHistory > Default. Increase the value of -XX:MaxMetaspaceSize in SPARK_DAEMON_JAVA_OPTS as required.
- Check whether the alarm is cleared.
- If yes, no further action is required.
- If no, go to 2.
- Collect fault information.
- On MRS Manager, choose .
- Contact technical support engineers for help. For details, see technical support.
Reference
None