Files
doc-exports/docs/mrs/umn/ALM-29016.html
Yang, Tong 5914b67d13 MRS UMN Doc 20240802 version
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2024-09-28 19:04:58 +00:00

87 lines
9.9 KiB
HTML

<a name="ALM-29016"></a><a name="ALM-29016"></a>
<h1 class="topictitle1">ALM-29016 Impalad Instance in the Sub-healthy State</h1>
<div id="body37584996"><div class="section" id="ALM-29016__section3980122974317"><h4 class="sectiontitle">Alarm Description</h4><p id="ALM-29016__p1598018293434">In MRS 3.1.5, the system checks every 60 seconds whether the Hive Server2 HTTP port (28000) of Impalad responds to cURL requests. This alarm is generated when the returned result has been incorrect for 20 seconds in two consecutive times. This alarm is cleared when the system correctly responds within 20 seconds.</p>
<p id="ALM-29016__p15052028153413">In other MRS versions, the system checks every 60 seconds whether Impalad can execute <strong id="ALM-29016__b345731318319">select 1</strong>. This alarm is generated when the returned result has been incorrect for 20 seconds in two consecutive times. This alarm is cleared when the SQL statement is correctly executed within 20 seconds.</p>
</div>
<div class="section" id="ALM-29016__section19801296431"><h4 class="sectiontitle">Alarm Attributes</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-29016__table8980102912435" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-29016__row159801129114318"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-29016__p1498062964312">Alarm ID</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-29016__p89801029144317">Alarm Severity</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-29016__p09807291430">Auto Cleared</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-29016__row598020294439"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-29016__p398012911433">29016</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-29016__p1798032914435">Minor</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-29016__p1098052917438">Yes</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-29016__section1498172914436"><h4 class="sectiontitle">Alarm Parameters</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-29016__table1498152994320" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-29016__row1098142911430"><th align="left" class="cellrowborder" valign="top" width="20%" id="mcps1.3.3.2.1.4.1.1"><p id="ALM-29016__p13622174219387">Type</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="20%" id="mcps1.3.3.2.1.4.1.2"><p id="ALM-29016__p9981112918439">Parameter</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="60%" id="mcps1.3.3.2.1.4.1.3"><p id="ALM-29016__p1498117299431">Description</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-29016__row16981132912430"><td class="cellrowborder" rowspan="4" valign="top" width="20%" headers="mcps1.3.3.2.1.4.1.1 "><p id="ALM-29016__p8242195393817">Location Information</p>
</td>
<td class="cellrowborder" valign="top" width="20%" headers="mcps1.3.3.2.1.4.1.2 "><p id="ALM-29016__p398172954315">Source</p>
</td>
<td class="cellrowborder" valign="top" width="60%" headers="mcps1.3.3.2.1.4.1.3 "><p id="ALM-29016__p1498152910435">Specifies the cluster for which the alarm was generated.</p>
</td>
</tr>
<tr id="ALM-29016__row298132919438"><td class="cellrowborder" valign="top" headers="mcps1.3.3.2.1.4.1.1 "><p id="ALM-29016__p1298114291434">ServiceName</p>
</td>
<td class="cellrowborder" valign="top" headers="mcps1.3.3.2.1.4.1.2 "><p id="ALM-29016__p7981162911438">Specifies the service for which the alarm was generated.</p>
</td>
</tr>
<tr id="ALM-29016__row10981172954311"><td class="cellrowborder" valign="top" headers="mcps1.3.3.2.1.4.1.1 "><p id="ALM-29016__p598111297437">RoleName</p>
</td>
<td class="cellrowborder" valign="top" headers="mcps1.3.3.2.1.4.1.2 "><p id="ALM-29016__p79811629144314">Specifies the role for which the alarm was generated.</p>
</td>
</tr>
<tr id="ALM-29016__row1898182934315"><td class="cellrowborder" valign="top" headers="mcps1.3.3.2.1.4.1.1 "><p id="ALM-29016__p098115293435">HostName</p>
</td>
<td class="cellrowborder" valign="top" headers="mcps1.3.3.2.1.4.1.2 "><p id="ALM-29016__p1298172934317">Specifies the host for which the alarm was generated.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-29016__section1798112919439"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-29016__p898142994317">Impalad cannot execute SQL statements or SQL statement execution times out, which affects data read and write.</p>
</div>
<div class="section" id="ALM-29016__section29811829134319"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-29016__p39811229174312">The Impalad service maintains too many queries.</p>
</div>
<div class="section" id="ALM-29016__section6981172914311"><h4 class="sectiontitle">Handling Procedure</h4><ol id="ALM-29016__ol109821829164313"><li id="ALM-29016__li10982129144318"><span>Log in to FusionInsight Manager and choose <strong id="ALM-29016__b162912408151">Cluster</strong> &gt; <strong id="ALM-29016__b136931429157">Services</strong> &gt; <strong id="ALM-29016__b5510124691519">Impala</strong> &gt; <strong id="ALM-29016__b48111548171515">Impalad Web UI</strong>. On the displayed page, click any node to go to the web UI.</span></li><li id="ALM-29016__li19622101341418"><span>On the web UI, click <strong id="ALM-29016__b124811016171614">/backends</strong> to view the Impala instance list. Locate the instance for which the alarm is generated and click <strong id="ALM-29016__b169183211619">Web UI</strong>. After the web UI of the subhealthy node is displayed, click <strong id="ALM-29016__b11697151542219">/queries</strong> to check the task execution status and check whether any task is executed slowly.</span><p><ul id="ALM-29016__ul1279633192116"><li id="ALM-29016__li5796435213">If yes, go to <a href="#ALM-29016__li918651451111">3</a>.</li><li id="ALM-29016__li157961332213">If no, go to <a href="#ALM-29016__li668151171315">4</a>.</li></ul>
</p></li><li id="ALM-29016__li918651451111"><a name="ALM-29016__li918651451111"></a><a name="li918651451111"></a><span>After the task is complete, check whether the alarm is cleared.</span><p><ul id="ALM-29016__ul122729421179"><li id="ALM-29016__li13272242121710">If yes, no further action is required.</li><li id="ALM-29016__li17888154471713">If no, go to <a href="#ALM-29016__li668151171315">4</a>.</li></ul>
</p></li><li id="ALM-29016__li668151171315"><a name="ALM-29016__li668151171315"></a><a name="li668151171315"></a><span>On FusionInsight Manager, choose <strong id="ALM-29016__b75927321254">Cluster</strong> &gt; <strong id="ALM-29016__b321002334112">Services</strong> &gt; <strong id="ALM-29016__b125597344517">Impala</strong> &gt; <strong id="ALM-29016__b82444401956">Instances</strong>, select the Impala instance for which the alarm is generated, click <strong id="ALM-29016__b47915574404">More</strong>, and select <strong id="ALM-29016__b1219619624110">Restart Instance</strong>. Then, check whether the alarm is cleared.</span><p><ul id="ALM-29016__ul20421612152818"><li id="ALM-29016__li1942141232810">If yes, no further action is required.</li><li id="ALM-29016__li542131217283">If no, go to <a href="#ALM-29016__li1698242954313">5</a>.<div class="note" id="ALM-29016__note220918329390"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-29016__en-us_topic_0294848303_p0481144124012">The service will become unavailable when all instances are restarted. If a single instance is restarted, the tasks that are being executed on that instance will fail and the service will become available.</p>
</div></div>
</li></ul>
</p></li></ol>
<p class="tableheading" id="ALM-29016__p39821129144316"><strong id="ALM-29016__b383702116612">Collect fault information.</strong></p>
<ol start="5" id="ALM-29016__ol189821329134317"><li id="ALM-29016__li1698242954313"><a name="ALM-29016__li1698242954313"></a><a name="li1698242954313"></a><span>On FusionInsight Manager of the active or standby cluster, choose <strong id="ALM-29016__b4251224665">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-29016__b112513241267">Log</strong> &gt; <strong id="ALM-29016__b6261241861">Download</strong>.</span></li><li id="ALM-29016__li27049781154249"><span>Expand the <strong id="ALM-29016__b638117322617">Service</strong> drop-down list, and select <strong id="ALM-29016__b1738115324611">Impala</strong> for the target cluster.</span></li><li id="ALM-29016__li1498212919436"><span>Click <span><img id="ALM-29016__image2098272984311" src="en-us_image_0000002007530509.png"></span> in the upper right corner, and set <strong id="ALM-29016__b07561599719">Start Date</strong> and <strong id="ALM-29016__b87561697718">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-29016__b19756498719">Download</strong>.</span></li><li id="ALM-29016__li56393916154249"><span>Contact <span id="ALM-29016__text3528218674">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div>
<div class="section" id="ALM-29016__section19982122910436"><h4 class="sectiontitle">Alarm Clearance</h4><p id="ALM-29016__p149821529154312">This alarm is automatically cleared after the fault is rectified.</p>
</div>
<div class="section" id="ALM-29016__section1298211296431"><h4 class="sectiontitle">Related Information</h4><p id="ALM-29016__p12982152913438">None</p>
</div>
<p id="ALM-29016__p8060118"></p>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
</div>
</div>