Files
doc-exports/docs/mrs/umn/ALM-50210.html
Yang, Tong 5914b67d13 MRS UMN Doc 20240802 version
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2024-09-28 19:04:58 +00:00

86 lines
11 KiB
HTML

<a name="ALM-50210"></a><a name="ALM-50210"></a>
<h1 class="topictitle1">ALM-50210 Maximum Compaction Score of All BE Nodes Exceeds the Threshold</h1>
<div id="body0000001232069955"><div class="section" id="ALM-50210__section60313499"><h4 class="sectiontitle"><span id="ALM-50210__text1558625720546">Alarm Description</span></h4><p id="ALM-50210__p1793193611551">The system checks the maximum compaction score of all BE nodes every 30 seconds. This alarm is generated when the maximum compaction score exceeds the threshold (10 by default).</p>
</div>
<div class="section" id="ALM-50210__section5950580"><h4 class="sectiontitle"><span id="ALM-50210__text38748475555">Alarm Attributes</span></h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-50210__table15548096" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-50210__row49989141"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-50210__p57710042"><span id="ALM-50210__text17980150175619">Alarm ID</span></p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-50210__p44001849"><span id="ALM-50210__text199471335614">Alarm Severity</span></p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-50210__p7380012"><span id="ALM-50210__text152400388563">Auto Cleared</span></p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-50210__row30415758"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-50210__p47757325">50210</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-50210__p43138141">Major</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-50210__p4528550">Yes</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-50210__section53555227"><h4 class="sectiontitle"><span id="ALM-50210__text155061195577">Alarm Parameters</span></h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-50210__table31268239" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-50210__row59179380"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-50210__p21975462"><span id="ALM-50210__text776142495720">Parameter</span></p>
</th>
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-50210__p35182007"><span id="ALM-50210__text632018391572">Description</span></p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-50210__row12465939134110"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-50210__p17935380415">Source</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-50210__p187931338134115">Specifies the cluster or system for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-50210__row48724307"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-50210__p54354790">ServiceName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-50210__p40661878">Specifies the service for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-50210__row30412584"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-50210__p47500221">RoleName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-50210__p22312707">Specifies the role for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-50210__row66596640"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-50210__p25618737">HostName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-50210__p61851848">Specifies the host for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-50210__row19795720"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-50210__p59949472">Trigger Condition</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-50210__p24069040">Specifies the threshold for triggering the alarm.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-50210__section12235000"><h4 class="sectiontitle"><span id="ALM-50210__text158162411552">Impact on the System</span></h4><p id="ALM-50210__p1615870205116">Query or write may be delayed.</p>
</div>
<div class="section" id="ALM-50210__section43006140"><h4 class="sectiontitle"><span id="ALM-50210__text1078914441051">Possible Causes</span></h4><p id="ALM-50210__p1816812695119">The number of concurrent service requests is large in the cluster, or the compaction queue is small.</p>
</div>
<div class="section" id="ALM-50210__section333352655112"><h4 class="sectiontitle"><span id="ALM-50210__text1749718472517">Handling Procedure</span></h4><p id="ALM-50210__p1214212184619"><strong id="ALM-50210__b060183916612">Check whether the alarm threshold or alarm trigger count is properly configured.</strong></p>
<ol id="ALM-50210__ol65061854183616"><li id="ALM-50210__li1505135403619"><span>Log in to FusionInsight Manager, choose <strong id="ALM-50210__b14459077714">O&amp;M</strong> &gt; <strong id="ALM-50210__b157037108717">Alarm</strong> &gt; <strong id="ALM-50210__b9575159718">Thresholds</strong>, click the name of the desired cluster, and choose <strong id="ALM-50210__b649125120713">Doris</strong> &gt; <strong id="ALM-50210__b5136205022520">Performance</strong> &gt; <strong id="ALM-50210__b12645145510197">Maximum compaction score of all BE nodes (BE)</strong>.</span></li><li id="ALM-50210__li450525473617"><span>Click the edit button next to <strong id="ALM-50210__b12417358399">Trigger Count</strong>, change the number based on site requirements, and click <strong id="ALM-50210__b24171658796">OK</strong>.</span></li><li class="litext" id="ALM-50210__li11505195493616"><span>Click <strong id="ALM-50210__b348016116101">Modify</strong> in the <strong id="ALM-50210__b9480101141018">Operation</strong> column, change the alarm threshold based on site requirements, and click <strong id="ALM-50210__b104804113101">OK</strong>.</span></li><li id="ALM-50210__li85061254183616"><span>Wait 2 minutes and check whether the alarm is cleared in the alarm list.</span><p><ul class="subitemlist" id="ALM-50210__ul05055540369"><li id="ALM-50210__li15505165410369">If yes, no further action is required.</li><li id="ALM-50210__li1650518546360">If no, go to <a href="#ALM-50210__li11386141118209">5</a>.</li></ul>
</p></li><li class="subitemlist" id="ALM-50210__li11386141118209"><a name="ALM-50210__li11386141118209"></a><a name="li11386141118209"></a><span>Choose <strong id="ALM-50210__b1386655116">Cluster</strong> &gt; <strong id="ALM-50210__b382868151210">Services</strong> &gt; <strong id="ALM-50210__b075820118121">Doris</strong> &gt; <strong id="ALM-50210__b620416143124">Configurations</strong> &gt; <strong id="ALM-50210__b12738249171216">All Configurations</strong> &gt; <strong id="ALM-50210__b15335185651216">BE(Role)</strong> &gt; <strong id="ALM-50210__b8499104171311">Customization</strong>, add the <strong id="ALM-50210__b1764615118131">max_base_compaction_threads</strong> parameter to <strong id="ALM-50210__b732721591319">be.conf</strong> with a value of <strong id="ALM-50210__b179471033181320">10</strong>, and add the <strong id="ALM-50210__b2801040181318">max_cumu_compaction_threads</strong> parameter with a value <strong id="ALM-50210__b2782165991315">20</strong>.</span></li><li class="subitemlist" id="ALM-50210__li18506254163613"><span>Click <strong id="ALM-50210__b49435675972943">Save</strong>. Click <strong id="ALM-50210__b182313812115">Instances</strong>, select the BE instances whose configuration has expired, click <strong id="ALM-50210__b15814192316219">More</strong>, and select <strong id="ALM-50210__b29751536152118">Restart Instance</strong> to restart the Doris BE instances.</span><p><div class="notice" id="ALM-50210__note1777517216301"><span class="noticetitle"><img src="public_sys-resources/notice_3.0-en-us.png"> </span><div class="noticebody"><p id="ALM-50210__p67761421153018">During BE instance restart, the tasks running on BE nodes will fail. The tasks on BE nodes that are not restarted are not affected.</p>
</div></div>
</p></li><li id="ALM-50210__li750665417369"><span>Check whether the alarm is cleared.</span><p><ul id="ALM-50210__ul750616548363"><li id="ALM-50210__li85061854103613">If yes, no further action is required.</li><li id="ALM-50210__li2506754153619">If no, go to <a href="#ALM-50210__li1550365453611">8</a>.</li></ul>
</p></li></ol>
<p id="ALM-50210__p13380337455"><strong id="ALM-50210__b1438014324515">Collect fault information.</strong></p>
<ol start="8" id="ALM-50210__ol1504145483617"><li id="ALM-50210__li1550365453611"><a name="ALM-50210__li1550365453611"></a><a name="li1550365453611"></a><span>On FusionInsight Manager, choose <strong id="ALM-50210__b85233498238">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-50210__b3523349112319">Log</strong> &gt; <strong id="ALM-50210__b3524124982319">Download</strong>.</span></li><li id="ALM-50210__li1450335418368"><span>Expand the <strong id="ALM-50210__b18220105372319">Service</strong> drop-down list, select <strong id="ALM-50210__b822035352320">Doris</strong> for the target cluster, and click <strong id="ALM-50210__b17220125362312">OK</strong>.</span></li><li id="ALM-50210__li2504135411368"><span>Click the edit icon in the upper right corner, and set <strong id="ALM-50210__b9348957202311">Start Date</strong> and <strong id="ALM-50210__b9348115712315">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-50210__b15348145792318">Download</strong>.</span></li><li id="ALM-50210__li750416543363"><span>Contact <span id="ALM-50210__text1576618598236">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div>
<div class="section" id="ALM-50210__section169311343318"><h4 class="sectiontitle"><span id="ALM-50210__text116512504511">Alarm Clearance</span></h4><p id="ALM-50210__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div>
<div class="section" id="ALM-50210__section60945317"><h4 class="sectiontitle"><span id="ALM-50210__text21670669">Related Information</span></h4><p id="ALM-50210__p10326323"><span id="ALM-50210__text19275105817121">None.</span></p>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
</div>
</div>