doc-exports/docs/mrs/umn/ALM-13010.html
Yang, Tong 3b1f73dece MRS UMN 2.0.38.SP20 version
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2022-12-13 12:03:34 +00:00

86 lines
10 KiB
HTML

<a name="ALM-13010"></a><a name="ALM-13010"></a>
<h1 class="topictitle1">ALM-13010 Znode Usage of a Directory with Quota Configured Exceeds the Threshold</h1>
<div id="body1559547426810"><div class="section" id="ALM-13010__section18794533"><h4 class="sectiontitle">Description</h4><p id="ALM-13010__p65268561">The system checks the Znode usage of all service directories with quota configured every hour. This alarm is generated when the system detects that the level-2 Znode usage exceeds the threshold.</p>
</div>
<div class="section" id="ALM-13010__section34933073"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-13010__table52262125" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-13010__row24697033"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-13010__en-us_topic_0070543636_p44032603">Alarm ID</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-13010__en-us_topic_0070543636_p9871120">Alarm Severity</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-13010__en-us_topic_0070543636_p61363278">Automatically Cleared</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-13010__row37919625"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-13010__p1163219417345">13010</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-13010__en-us_topic_0070543638_p9804735">Major</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-13010__en-us_topic_0070543638_p55986102">Yes</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-13010__section45962205"><h4 class="sectiontitle">Parameters</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-13010__table51772816" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-13010__row55869420"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-13010__en-us_topic_0070543636_p28093062">Name</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-13010__en-us_topic_0070543636_p60945575">Meaning</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-13010__row13205923710"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13010__p192431315431">Source</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13010__p692551319435">Specifies the cluster for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-13010__row57640736"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13010__p38388029">ServiceName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13010__en-us_topic_0070543636_p25480534">Specifies the service name for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-13010__row477048"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13010__p38640893">ServiceDirectory</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13010__p17361818514">Specifies the directory for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-13010__row111316194717"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13010__p39186745">RoleName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13010__en-us_topic_0070543636_p60763443">Specifies the role for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-13010__row50597141"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-13010__p4727789">Trigger Condition</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-13010__p2543134315394">Specifies the cause of the alarm.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-13010__section35804574"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-13010__p14730421">A large amount of data is written to the ZooKeeper data directory. As a result, ZooKeeper cannot provide services properly.</p>
</div>
<div class="section" id="ALM-13010__section53805712"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-13010__ul52313462"><li id="ALM-13010__li107411843184019">A large amount of data is written to the ZooKeeper data directory.</li><li id="ALM-13010__li1074119432403">The user-defined threshold is inappropriate.</li></ul>
</div>
<div class="section" id="ALM-13010__section340145113316"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-13010__p33897081"><strong id="ALM-13010__b1311515354016">Check whether a large amount of data is written into the directory for which the alarm is generated.</strong></p>
<ol id="ALM-13010__ol49811923133518"><li id="ALM-13010__li1698018237353"><span>On FusionInsight Manager, choose <strong id="ALM-13010__b1045903114119">O&amp;M</strong> &gt; <strong id="ALM-13010__b154598320415">Alarm &gt; Alarms</strong>. Confirm the Znode for which the alarm is generated in <strong id="ALM-13010__b245983124114">Location </strong>of this alarm.</span></li><li id="ALM-13010__li129813238354"><span>Choose <strong id="ALM-13010__b12418122917460">Cluster</strong> &gt; <em id="ALM-13010__i5418929114619">Name of the desired cluster</em> &gt; <strong id="ALM-13010__b241832910463">Services</strong> &gt; <strong id="ALM-13010__b541862944619">ZooKeeper</strong> and click <strong id="ALM-13010__b1641817296466">Resource</strong>. In <strong id="ALM-13010__b1441822912469">Used Resources (By Second-Level Znode)</strong>, check whether a large amount of data is written into the top Znode.</span><p><ul class="subitemlist" id="ALM-13010__ul16981162353516"><li id="ALM-13010__li6981142312353">If yes, go to <a href="#ALM-13010__li1298122393514">4</a>.</li><li id="ALM-13010__li3981162313518">If no, go to <a href="#ALM-13010__li598192363510">5</a>.</li></ul>
</p></li><li id="ALM-13010__li19446131612914"><span>Log in to FusionInsight Manager, choose<strong id="ALM-13010__b1438682310914"> O&amp;M &gt; Alarm &gt; Alarms</strong>, select Location from the drop-down list box next to<strong id="ALM-13010__b15084412910"> ALM-13010 Znode Usage of a Directory with Quota Configured Exceeds the Threshold</strong>, and obtain the Znode path in ServiceDirectory.</span></li><li id="ALM-13010__li1298122393514"><a name="ALM-13010__li1298122393514"></a><a name="li1298122393514"></a><span>Log in to the ZooKeeper client as a cluster user and delete unwanted data in the Znode for which the alarm is generated.</span></li><li id="ALM-13010__li598192363510"><a name="ALM-13010__li598192363510"></a><a name="li598192363510"></a><span>Log in to FusionInsight Manager, and choose <strong id="ALM-13010__b5604201624714">Cluster</strong> &gt; <em id="ALM-13010__i360411164478">Name of the desired cluster</em> &gt; <strong id="ALM-13010__b9604116104714">Services</strong> &gt; <em id="ALM-13010__i0604101644717">Component of the top Znode for which the alarm isgenerated</em>. Choose <strong id="ALM-13010__b1260411160472">Configuration</strong><strong id="ALM-13010__b1263217711447">s</strong> &gt; <strong id="ALM-13010__b6604116164713">All Configurations</strong>, search for <strong id="ALM-13010__b16041161476">zk.quota.number</strong>, increase its value, click <strong id="ALM-13010__b7604316154712">Save</strong>.</span><p><div class="notice" id="ALM-13010__note631713358407"><span class="noticetitle"><img src="public_sys-resources/notice_3.0-en-us.png"> </span><div class="noticebody"><p id="ALM-13010__p14317113524012">If the Component of the top Znode for which the alarm isgenerated is ClickHouse, change the value of <strong id="ALM-13010__b57891022154110">clickhouse.zookeeper.quota.node.count</strong>.</p>
</div></div>
</p></li><li id="ALM-13010__li498182343517"><span>Check whether the alarm is cleared.</span><p><ul id="ALM-13010__ul17981172319354"><li id="ALM-13010__li746133494718">If yes, no further action is required.</li><li id="ALM-13010__li946233415476">If no, go to <a href="#ALM-13010__li13978523123518">7</a>.</li></ul>
</p></li></ol>
<p class="tableheading" id="ALM-13010__p4863846161727"><strong id="ALM-13010__b60263252161739">Collect fault information.</strong></p>
<ol start="7" id="ALM-13010__ol2978112319358"><li id="ALM-13010__li13978523123518"><a name="ALM-13010__li13978523123518"></a><a name="li13978523123518"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-13010__b199782236356">O&amp;M</strong> &gt; <strong id="ALM-13010__b9978142313355">Log &gt; Download</strong>.</span></li><li id="ALM-13010__li20978323143519"><span>Select <strong id="ALM-13010__b1297816232358">ZooKeeper</strong> in the required cluster from the <strong id="ALM-13010__b197814238358">Service</strong>.</span></li><li id="ALM-13010__li13978152383510"><span>Click <span><img id="ALM-13010__image1997852333516" src="en-us_image_0269383956.png"></span> in the upper right corner, and set <strong id="ALM-13010__b1978132311354">Start Date</strong> and <strong id="ALM-13010__b129788231358">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-13010__b13978172333519">Download</strong>.</span></li><li id="ALM-13010__li10978202313518"><span>Contact the <span id="ALM-13010__text4614151421417">O&amp;M personnel</span> and send the collected logs.</span></li></ol>
</div>
<div class="section" id="ALM-13010__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-13010__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div>
<div class="section" id="ALM-13010__sb2eb8883fb1940d0b05b690215576d2e"><h4 class="sectiontitle">Related Information</h4><p id="ALM-13010__en-us_topic_0070543636_p64481034">None</p>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
</div>
</div>