Alarm Description
The system checks the usage of BE data disks every 30 seconds. This alarm is generated when the disk usage exceeds the threshold (95% by default).
This alarm is cleared when the system detects that the disk usage is less than the threshold.
Alarm Attributes
Alarm ID
|
Alarm Severity
|
Auto Cleared
|
50221
|
Critical
|
Yes
|
Alarm Parameters
Parameter
|
Description
|
Source
|
Specifies the cluster or system for which the alarm is generated.
|
ServiceName
|
Specifies the service for which the alarm is generated.
|
RoleName
|
Specifies the role for which the alarm is generated.
|
HostName
|
Specifies the host for which the alarm is generated.
|
Trigger Condition
|
Specifies the threshold for triggering the alarm.
|
Impact on the System
New data fails to be written, and the task is interrupted.
Possible Causes
- The disk space of the cluster is full.
- Data skew occurs among BE nodes.
Handling Procedure
- Log in to FusionInsight Manager, choose O&M > Alarm > Alarms, and view the role name and the IP address for the hostname in Location.
- Expand the disk capacity of the node for which the alarm is generated.
- Go to 4 if the expansion fails or the alarm persists after the expansion.
Collect fault information.
- On FusionInsight Manager, choose O&M. In the navigation pane on the left, choose Log > Download.
- Expand the Service drop-down list, and select Doris for the target cluster.
- Expand the Hosts drop-down list. In the Select Host dialog box that is displayed, select the abnormal host, and click OK.
- Click the edit icon in the upper right corner, and set Start Date and End Date for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click Download.
- Contact O&M personnel and provide the collected logs.
Alarm Clearance
This alarm is automatically cleared after the fault is rectified.