doc-exports/docs/mrs/umn/ALM-26052.html
Yang, Tong 3b1f73dece MRS UMN 2.0.38.SP20 version
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2022-12-13 12:03:34 +00:00

89 lines
11 KiB
HTML

<a name="ALM-26052"></a><a name="ALM-26052"></a>
<h1 class="topictitle1">ALM-26052 Number of Available Supervisors of the Storm Service Is Less Than the Threshold</h1>
<div id="body62862015"><div class="section" id="ALM-26052__s07e020dcc69b403f856ec6a3bd7d7c3b"><h4 class="sectiontitle">Description</h4><p id="ALM-26052__en-us_topic_0070543551_p39980785">The system periodically checks the number of available Supervisors every 60 seconds and compares the number of available Supervisors with the threshold. This alarm is generated when the number of available Supervisors is less than the threshold.</p>
<p id="ALM-26052__en-us_topic_0070543551_p24282745">You can change the threshold in <strong id="ALM-26052__b10802443175011">O&amp;M</strong> &gt; <strong id="ALM-26052__b1387783920502">Alarm </strong>&gt;<strong id="ALM-26052__b19880113913508"> Thresholds</strong><strong id="ALM-26052__b960052173719"> </strong>&gt; <em id="ALM-26052__i196009263716">Name of the desired cluster</em>.</p>
<p id="ALM-26052__en-us_topic_0070543551_p52490193">This alarm is cleared when the number of available Supervisors is greater than or equal to the threshold.</p>
</div>
<div class="section" id="ALM-26052__sb5a66ba4e70c4e2885be46d44d0e7eb3"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-26052__en-us_topic_0070543551_table23847230" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-26052__en-us_topic_0070543551_row21151959"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-26052__en-us_topic_0070543551_p35587158">Alarm ID</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-26052__en-us_topic_0070543551_p63987582">Alarm Severity</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-26052__en-us_topic_0070543551_p15611638">Automatically Cleared</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-26052__en-us_topic_0070543551_row56583142"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-26052__en-us_topic_0070543551_p19831799">26052</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-26052__en-us_topic_0070543551_p62871888">Major</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-26052__en-us_topic_0070543551_p59458202">Yes</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-26052__sa7c6534f55ff4818af632d10c7280545"><h4 class="sectiontitle">Parameters</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-26052__en-us_topic_0070543551_table51385085" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-26052__en-us_topic_0070543551_row44830087"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-26052__en-us_topic_0070543551_p7358467">Name</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-26052__en-us_topic_0070543551_p59164922">Meaning</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-26052__row117430521135"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-26052__p192431315431">Source</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-26052__p692551319435">Specifies the cluster for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-26052__en-us_topic_0070543551_row27629398"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-26052__en-us_topic_0070543551_p23388799">ServiceName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-26052__en-us_topic_0070543551_p15444572">Specifies the service for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-26052__en-us_topic_0070543551_row4783426"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-26052__en-us_topic_0070543551_p51913225">RoleName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-26052__en-us_topic_0070543551_p44221688">Specifies the role for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-26052__en-us_topic_0070543551_row62450879"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-26052__en-us_topic_0070543551_p25356414">HostName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-26052__en-us_topic_0070543551_p40603642">Specifies the host for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-26052__en-us_topic_0070543551_row29888458"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-26052__en-us_topic_0070543551_p5045995">Trigger condition</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-26052__en-us_topic_0070543551_p6072445">Specifies the threshold triggering the alarm. If the current indicator value exceeds this threshold, the alarm is generated.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-26052__s9fb881b9ace34a08b012c1b45892a355"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-26052__en-us_topic_0070543551_p22106023">Existing tasks in the cluster cannot be performed. The cluster can receive new Storm tasks, but cannot perform these tasks.</p>
</div>
<div class="section" id="ALM-26052__s378118b76e674dc39843ed0f0dda75aa"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-26052__en-us_topic_0070543551_p45757458">The status of some Supervisors in the cluster is abnormal.</p>
</div>
<div class="section" id="ALM-26052__s00b78d84d8ad488ab9a2282ded5d0cc0"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-26052__en-us_topic_0070543551_p15366630"><strong id="ALM-26052__b1900893201042">Check the Supervisor status.</strong></p>
<ol id="ALM-26052__ol65264691201056"><li id="ALM-26052__li30858469201046"><span>Choose <strong id="ALM-26052__b141431002515">Cluster </strong>&gt; <em id="ALM-26052__i1568124763819">Name of the desired cluster</em> &gt;<strong id="ALM-26052__b1866114717389"> Services</strong> &gt; <strong id="ALM-26052__b20265076201046">Storm</strong> &gt; <strong id="ALM-26052__b48167961201046">Supervisor</strong> to go to the Storm service management page.</span></li><li id="ALM-26052__li15295743201046"><span>In <strong id="ALM-26052__b9290769201046">Roles</strong>, check whether any instance whose status is <strong id="ALM-26052__b143721433753">Faulty</strong> or <strong id="ALM-26052__b770144991019">Restoring</strong> exists.</span><p><ul class="subitemlist" id="ALM-26052__ul53895310201046"><li id="ALM-26052__li21891215201046">If yes, go to <a href="#ALM-26052__li14723901201046">3</a>.</li><li id="ALM-26052__li28357989201046">If no, go to <a href="#ALM-26052__li59911910201046">5</a>.</li></ul>
</p></li><li id="ALM-26052__li14723901201046"><a name="ALM-26052__li14723901201046"></a><a name="li14723901201046"></a><span>Select Supervisor role instances whose status is <strong id="ALM-26052__b208021340555">Faulty</strong> or <strong id="ALM-26052__b5249256172516">Restoring</strong>, choose <strong id="ALM-26052__b10525429201046">More</strong> &gt; <strong id="ALM-26052__b27620001201046">Restart Instance</strong>, and check whether the instances restart successfully.</span><p><ul class="subitemlist" id="ALM-26052__ul53831772201046"><li id="ALM-26052__li22627576201046">If yes, go to <a href="#ALM-26052__li58537778201046">4</a>.</li><li id="ALM-26052__li20894388201046">If no, go to <a href="#ALM-26052__li59911910201046">5</a>.</li></ul>
</p></li><li id="ALM-26052__li58537778201046"><a name="ALM-26052__li58537778201046"></a><a name="li58537778201046"></a><span>Wait for 30 seconds, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-26052__ul33862868201046"><li id="ALM-26052__li65406250201046">If yes, no further action is required.</li><li id="ALM-26052__li63414864201046">If no, go to <a href="#ALM-26052__li59911910201046">5</a>.<div class="note" id="ALM-26052__note19132031191414"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-26052__p10852103118145">Services are interrupted when the Supervisor is being restarted. Then, services are restored after the restarting.</p>
</div></div>
</li></ul>
</p></li></ol>
<p class="tableheading" id="ALM-26052__p36330359201046"><strong id="ALM-26052__b54615003201050">Collect fault information.</strong></p>
<ol start="5" id="ALM-26052__ol3255669820111"><li id="ALM-26052__li59911910201046"><a name="ALM-26052__li59911910201046"></a><a name="li59911910201046"></a><span>On the FusionInsight Manager portal, choose <strong id="ALM-26052__b846711495419">O&amp;M</strong> &gt; <strong id="ALM-26052__b56270121543">Log </strong>&gt;<strong id="ALM-26052__b562714122540"> Download</strong>.</span></li><li id="ALM-26052__li27540600201046"><span>Select <strong id="ALM-26052__b2336284201046">Storm</strong> and <strong id="ALM-26052__b21026559201046">ZooKeeper</strong> in the required cluster from the <strong id="ALM-26052__b55021304201046">Service</strong> drop-down list box.</span></li><li id="ALM-26052__li1145664103113"><span>Click <span><img id="ALM-26052__image1945644173117" src="en-us_image_0269417461.png"></span> in the upper right corner, and set <strong id="ALM-26052__b6456941173117">Start Date</strong> and <strong id="ALM-26052__b11456154113318">End Date</strong> for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-26052__b13456164113319">Download</strong>.</span></li><li id="ALM-26052__li29285731201046"><span>Contact the <span id="ALM-26052__text4614151421417">O&amp;M personnel</span> and send the collected logs.</span></li></ol>
</div>
<div class="section" id="ALM-26052__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-26052__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div>
<div class="section" id="ALM-26052__s20acd72ed0ee40d1bac5a5a8ba24e0cf"><h4 class="sectiontitle">Related Information</h4><p id="ALM-26052__en-us_topic_0070543551_p15218414">None</p>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
</div>
</div>