forked from docs/doc-exports
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
95 lines
20 KiB
HTML
95 lines
20 KiB
HTML
<a name="alm_16004"></a><a name="alm_16004"></a>
|
|
|
|
<h1 class="topictitle1">ALM-16004 Hive Service Unavailable</h1>
|
|
<div id="body8662426"><div class="section" id="alm_16004__en-us_topic_0191813910_section28799665"><h4 class="sectiontitle">Description</h4><p id="alm_16004__en-us_topic_0191813910_p30494806">The system checks the Hive service status every 30 seconds. This alarm is generated when the Hive service is unavailable.</p>
|
|
<p id="alm_16004__en-us_topic_0191813910_p6017800">This alarm is cleared when the Hive service recovers.</p>
|
|
</div>
|
|
<div class="section" id="alm_16004__en-us_topic_0191813910_section57870399"><h4 class="sectiontitle">Attribute</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="alm_16004__en-us_topic_0191813910_table22774600" frame="border" border="1" rules="all"><thead align="left"><tr id="alm_16004__en-us_topic_0191813910_row4640007"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="alm_16004__en-us_topic_0191813910_p40296320">Alarm ID</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="alm_16004__en-us_topic_0191813910_p42776493">Alarm Severity</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="alm_16004__en-us_topic_0191813910_p42343927">Auto Clear</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="alm_16004__en-us_topic_0191813910_row7306053"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="alm_16004__en-us_topic_0191813910_p54919424">16004</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="alm_16004__en-us_topic_0191813910_p19288335">Critical</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="alm_16004__en-us_topic_0191813910_p18851291">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="alm_16004__en-us_topic_0191813910_section51071544"><h4 class="sectiontitle">Parameters</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="alm_16004__en-us_topic_0191813910_table50559563" frame="border" border="1" rules="all"><thead align="left"><tr id="alm_16004__en-us_topic_0191813910_row19612160"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="alm_16004__en-us_topic_0191813910_p45081147">Parameter</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="alm_16004__en-us_topic_0191813910_p27694303">Description</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="alm_16004__en-us_topic_0191813910_row28646034"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="alm_16004__en-us_topic_0191813910_p38627455">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="alm_16004__en-us_topic_0191813910_p41816140">Specifies the service for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="alm_16004__en-us_topic_0191813910_row40800942"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="alm_16004__en-us_topic_0191813910_p16541999">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="alm_16004__en-us_topic_0191813910_p64833508">Specifies the role for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="alm_16004__en-us_topic_0191813910_row46630661"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="alm_16004__en-us_topic_0191813910_p18987236">HostName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="alm_16004__en-us_topic_0191813910_p61571180">Specifies the host for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="alm_16004__en-us_topic_0191813910_section56990719"><h4 class="sectiontitle">Impact on the System</h4><p id="alm_16004__en-us_topic_0191813910_p21209668">The system cannot provide data loading, query, and extraction services.</p>
|
|
</div>
|
|
<div class="section" id="alm_16004__en-us_topic_0191813910_section43154428"><h4 class="sectiontitle">Possible Causes</h4><ul id="alm_16004__en-us_topic_0191813910_ul40261524"><li id="alm_16004__en-us_topic_0191813910_li26809401">Basic services, such as ZooKeeper, HDFS, Yarn, and DBService work incorrectly, or the Hive process is faulty.<ul id="alm_16004__en-us_topic_0191813910_ul39958021"><li id="alm_16004__en-us_topic_0191813910_li24077873">ZooKeeper is abnormal.</li><li id="alm_16004__en-us_topic_0191813910_li15374270">HDFS is abnormal.</li><li id="alm_16004__en-us_topic_0191813910_li4150710">Yarn is abnormal.</li><li id="alm_16004__en-us_topic_0191813910_li37356392">DBService is abnormal.</li><li id="alm_16004__en-us_topic_0191813910_li663215">The Hive service process is faulty. If the alarm is caused by a Hive process fault, the alarm report has a delay of about 5 minutes.</li></ul>
|
|
</li><li id="alm_16004__en-us_topic_0191813910_li5968939">The network communication between the Hive service and basic services is interrupted.</li></ul>
|
|
</div>
|
|
<div class="section" id="alm_16004__en-us_topic_0191813910_section52845536"><h4 class="sectiontitle">Procedure</h4><ol id="alm_16004__en-us_topic_0191813910_ol53142387153452"><li class="tableheading" id="alm_16004__en-us_topic_0191813910_li14469044153452"><span>Check the HiveServer/MetaStore process status.</span><p><ol type="a" id="alm_16004__en-us_topic_0191813910_ol28274331153452"><li id="alm_16004__en-us_topic_0191813910_li51692872">Go to the MRS cluster details page and click <strong id="alm_16004__b1686714454550">Components</strong>.<div class="note" id="alm_16004__en-us_topic_0191813910_note161357103467"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="alm_16004__en-us_topic_0191813910_p01361310104613">For MRS 1.7.2 or earlier, log in to MRS Manager and click <strong id="alm_16004__b4154536919">Services</strong>.</p>
|
|
</div></div>
|
|
</li><li id="alm_16004__en-us_topic_0191813910_li53944325153452">Choose <strong id="alm_16004__b15771252135514">Hive</strong> > <strong id="alm_16004__b125817524559">Instances</strong>. In the Hive instance list, check whether the status of all HiveSserver/MetaStore instances is <strong id="alm_16004__b105816525551">Unknown</strong>.<ul class="subitemlist" id="alm_16004__en-us_topic_0191813910_ul58189597153452"><li id="alm_16004__en-us_topic_0191813910_li62401770153452">If yes, go to <a href="#alm_16004__en-us_topic_0191813910_li15736882153452">1.c</a>.</li><li id="alm_16004__en-us_topic_0191813910_li21378591153452">If no, go to <a href="#alm_16004__en-us_topic_0191813910_li63276134153458">2</a>.</li></ul>
|
|
</li><li id="alm_16004__en-us_topic_0191813910_li15736882153452"><a name="alm_16004__en-us_topic_0191813910_li15736882153452"></a><a name="en-us_topic_0191813910_li15736882153452"></a>Above the Hive instance list, choose <strong id="alm_16004__b124536317564">More</strong> > <strong id="alm_16004__b19453163165618">Restart Instance</strong> to restart the HiveServer/MetaStore process.</li><li id="alm_16004__en-us_topic_0191813910_li50683708153452">In the alarm list, check whether ALM-16004 Hive Service Unavailable is cleared.<ul class="subitemlist" id="alm_16004__en-us_topic_0191813910_ul36251418153452"><li id="alm_16004__en-us_topic_0191813910_li7414211153452">If yes, no further action is required.</li><li id="alm_16004__en-us_topic_0191813910_li63680258153452">If no, go to <a href="#alm_16004__en-us_topic_0191813910_li63276134153458">2</a>.</li></ul>
|
|
</li></ol>
|
|
</p></li><li class="tableheading" id="alm_16004__en-us_topic_0191813910_li63276134153458"><a name="alm_16004__en-us_topic_0191813910_li63276134153458"></a><a name="en-us_topic_0191813910_li63276134153458"></a><span>Check the ZooKeeper status.</span><p><ol type="a" id="alm_16004__en-us_topic_0191813910_ol4775140417310"><li id="alm_16004__en-us_topic_0191813910_li1487713813414">Go to the cluster details page and choose <strong id="alm_16004__b31961311020">Alarms</strong>.</li><li id="alm_16004__en-us_topic_0191813910_li50303679153452">On MRS Manager, check whether the ALM-12007 Process Fault alarm is reported.<ul class="subitemlist" id="alm_16004__en-us_topic_0191813910_ul35415459153452"><li id="alm_16004__en-us_topic_0191813910_li11739699153452">If yes, go to <a href="#alm_16004__en-us_topic_0191813910_li17867059153452">2.c</a>.</li><li id="alm_16004__en-us_topic_0191813910_li11391591153452">If no, go to <a href="#alm_16004__en-us_topic_0191813910_li315441715352">3</a>.</li></ul>
|
|
</li><li id="alm_16004__en-us_topic_0191813910_li17867059153452"><a name="alm_16004__en-us_topic_0191813910_li17867059153452"></a><a name="en-us_topic_0191813910_li17867059153452"></a>In the <strong id="alm_16004__b1251812362024">Alarm Details</strong> area of ALM-12007 Process Fault, check whether <strong id="alm_16004__b18518536321">ServiceName</strong> is <strong id="alm_16004__b12518103617213">ZooKeeper</strong>.<ul class="subitemlist" id="alm_16004__en-us_topic_0191813910_ul9441769153452"><li id="alm_16004__en-us_topic_0191813910_li48066212153452">If yes, go to <a href="#alm_16004__en-us_topic_0191813910_li26585804153452">2.d</a>.</li><li id="alm_16004__en-us_topic_0191813910_li1049085153452">If no, go to <a href="#alm_16004__en-us_topic_0191813910_li315441715352">3</a>.</li></ul>
|
|
</li><li id="alm_16004__en-us_topic_0191813910_li26585804153452"><a name="alm_16004__en-us_topic_0191813910_li26585804153452"></a><a name="en-us_topic_0191813910_li26585804153452"></a>Rectify the fault by following steps provided in ALM-12007 Process Fault.</li><li id="alm_16004__en-us_topic_0191813910_li21657095153452">In the alarm list, check whether ALM-16004 Hive Service Unavailable is cleared.<ul class="subitemlist" id="alm_16004__en-us_topic_0191813910_ul13523443153452"><li id="alm_16004__en-us_topic_0191813910_li37945645153452">If yes, no further action is required.</li><li id="alm_16004__en-us_topic_0191813910_li53698387153452">If no, go to <a href="#alm_16004__en-us_topic_0191813910_li315441715352">3</a>.</li></ul>
|
|
</li></ol>
|
|
</p></li><li class="tableheading" id="alm_16004__en-us_topic_0191813910_li315441715352"><a name="alm_16004__en-us_topic_0191813910_li315441715352"></a><a name="en-us_topic_0191813910_li315441715352"></a><span>Check the HDFS status.</span><p><ol type="a" id="alm_16004__en-us_topic_0191813910_ol3323556817310"><li id="alm_16004__en-us_topic_0191813910_li142374209515">Go to the cluster details page and choose <strong id="alm_16004__b21455165618">Alarms</strong>.</li><li id="alm_16004__en-us_topic_0191813910_li30070184153452">In the alarm list, check whether the alarm ALM-14000 HDFS Service Unavailable exists.<ul class="subitemlist" id="alm_16004__en-us_topic_0191813910_ul3341131153452"><li id="alm_16004__en-us_topic_0191813910_li9394300153452">If yes, go to <a href="#alm_16004__en-us_topic_0191813910_li2196200153452">3.c</a>.</li><li id="alm_16004__en-us_topic_0191813910_li22740858153452">If no, go to <a href="#alm_16004__en-us_topic_0191813910_li3789476315357">4</a>.</li></ul>
|
|
</li><li id="alm_16004__en-us_topic_0191813910_li2196200153452"><a name="alm_16004__en-us_topic_0191813910_li2196200153452"></a><a name="en-us_topic_0191813910_li2196200153452"></a>Rectify the fault by following the steps provided in ALM-14000 HDFS Service Unavailable.</li><li id="alm_16004__en-us_topic_0191813910_li60853387153452">In the alarm list, check whether ALM-16004 Hive Service Unavailable is cleared.<ul class="subitemlist" id="alm_16004__en-us_topic_0191813910_ul47976032153452"><li id="alm_16004__en-us_topic_0191813910_li19765806153452">If yes, no further action is required.</li><li id="alm_16004__en-us_topic_0191813910_li57526453153452">If no, go to <a href="#alm_16004__en-us_topic_0191813910_li3789476315357">4</a>.</li></ul>
|
|
</li></ol>
|
|
</p></li><li class="tableheading" id="alm_16004__en-us_topic_0191813910_li3789476315357"><a name="alm_16004__en-us_topic_0191813910_li3789476315357"></a><a name="en-us_topic_0191813910_li3789476315357"></a><span>Check the Yarn status.</span><p><ol type="a" id="alm_16004__en-us_topic_0191813910_ol375865117310"><li id="alm_16004__en-us_topic_0191813910_li39183275519">Go to the cluster details page and choose <strong id="alm_16004__b526412551761">Alarms</strong>.</li><li id="alm_16004__en-us_topic_0191813910_li22053158153452">In the alarm list on MRS Manager, check whether the alarm ALM-18000 Yarn Service Unavailable is generated.<ul class="subitemlist" id="alm_16004__en-us_topic_0191813910_ul54646134153452"><li id="alm_16004__en-us_topic_0191813910_li30177290153452">If yes, go to <a href="#alm_16004__en-us_topic_0191813910_li64260695153452">4.c</a>.</li><li id="alm_16004__en-us_topic_0191813910_li28441414153452">If no, go to <a href="#alm_16004__en-us_topic_0191813910_li3789476315357">4</a>.</li></ul>
|
|
</li><li id="alm_16004__en-us_topic_0191813910_li64260695153452"><a name="alm_16004__en-us_topic_0191813910_li64260695153452"></a><a name="en-us_topic_0191813910_li64260695153452"></a>Rectify the fault by following the steps provided in ALM-18000 Yarn Service Unavailable.</li><li id="alm_16004__en-us_topic_0191813910_li6871230153452">In the alarm list, check whether ALM-16004 Hive Service Unavailable is cleared.<ul class="subitemlist" id="alm_16004__en-us_topic_0191813910_ul36539027153452"><li id="alm_16004__en-us_topic_0191813910_li41475346153452">If yes, no further action is required.</li><li id="alm_16004__en-us_topic_0191813910_li4059891153452">If no, go to <a href="#alm_16004__en-us_topic_0191813910_li3789476315357">4</a>.</li></ul>
|
|
</li></ol>
|
|
</p></li><li class="tableheading" id="alm_16004__en-us_topic_0191813910_li28516824153512"><span>Check the DBService status.</span><p><ol type="a" id="alm_16004__en-us_topic_0191813910_ol1262218617310"><li id="alm_16004__en-us_topic_0191813910_li1593312351458">Go to the cluster details page and choose <strong id="alm_16004__b747916147817">Alarms</strong>.</li><li id="alm_16004__en-us_topic_0191813910_li59067090153452">In the alarm list on MRS Manager, check whether ALM-27001 DBService Unavailable is generated.<ul class="subitemlist" id="alm_16004__en-us_topic_0191813910_ul66215333153452"><li id="alm_16004__en-us_topic_0191813910_li19698770153452">If yes, go to <a href="#alm_16004__en-us_topic_0191813910_li19704975153452">5.c</a>.</li><li id="alm_16004__en-us_topic_0191813910_li52096501153452">If no, go to <a href="#alm_16004__en-us_topic_0191813910_li23165657153517">6</a>.</li></ul>
|
|
</li><li id="alm_16004__en-us_topic_0191813910_li19704975153452"><a name="alm_16004__en-us_topic_0191813910_li19704975153452"></a><a name="en-us_topic_0191813910_li19704975153452"></a>Rectify the fault by following the handling procedure in <a href="alm_27001.html">ALM-27001 DBService Is Unavailable</a>.</li><li id="alm_16004__en-us_topic_0191813910_li29078296153452">In the alarm list, check whether ALM-16004 Hive Service Unavailable is cleared.<ul class="subitemlist" id="alm_16004__en-us_topic_0191813910_ul32670666153452"><li id="alm_16004__en-us_topic_0191813910_li43127049153452">If yes, no further action is required.</li><li id="alm_16004__en-us_topic_0191813910_li3630074153452">If no, go to <a href="#alm_16004__en-us_topic_0191813910_li23165657153517">6</a>.</li></ul>
|
|
</li></ol>
|
|
</p></li><li class="tableheading" id="alm_16004__en-us_topic_0191813910_li23165657153517"><a name="alm_16004__en-us_topic_0191813910_li23165657153517"></a><a name="en-us_topic_0191813910_li23165657153517"></a><span>Check the network connection between Hive and ZooKeeper, HDFS, Yarn, and DBService.</span><p><ol type="a" id="alm_16004__en-us_topic_0191813910_ol950707617310"><li id="alm_16004__en-us_topic_0191813910_li154943451759">Go to the MRS cluster details page and click <strong id="alm_16004__b158432409918">Components</strong>.<div class="note" id="alm_16004__en-us_topic_0191813910_note104951457520"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="alm_16004__en-us_topic_0191813910_p18495445658">For MRS 1.7.2 or earlier, log in to MRS Manager and click <strong id="alm_16004__b164871361090">Services</strong>.</p>
|
|
</div></div>
|
|
</li><li id="alm_16004__en-us_topic_0191813910_li60378080153452">Click <strong id="alm_16004__b13138124410915">Hive</strong>.</li><li id="alm_16004__en-us_topic_0191813910_li58786343153452">Click <strong id="alm_16004__b85041745590">Instances</strong>.<p class="litext" id="alm_16004__en-us_topic_0191813910_p6531815153452">The HiveServer instance list is displayed.</p>
|
|
</li><li id="alm_16004__en-us_topic_0191813910_li64073305153452">Click <strong id="alm_16004__b16353162061111">Host Name</strong> in the row of <strong id="alm_16004__b9354162012118">HiveServer</strong>.<p class="litext" id="alm_16004__en-us_topic_0191813910_p59315039153452">The HiveServer host status page is displayed.</p>
|
|
</li><li id="alm_16004__en-us_topic_0191813910_li39788839153452"><a name="alm_16004__en-us_topic_0191813910_li39788839153452"></a><a name="en-us_topic_0191813910_li39788839153452"></a>Record the IP address under <strong id="alm_16004__b111893245112">Summary</strong>.</li><li id="alm_16004__en-us_topic_0191813910_li15034532153452">Use the IP address obtained in <a href="#alm_16004__en-us_topic_0191813910_li39788839153452">6.e</a> to log in to the host where HiveServer is located.</li><li id="alm_16004__en-us_topic_0191813910_li4973502153452">Run the <strong id="alm_16004__b1939216349110">ping</strong> command to check whether the network connection between the host that runs HiveServer and the hosts that run the ZooKeeper, HDFS, Yarn, and DBService services is normal. Methods of obtaining IP addresses of the hosts that run ZooKeeper, HDFS, Yarn, and DBService services as well as the HiveServer IP address are the same.<ul class="subitemlist" id="alm_16004__en-us_topic_0191813910_ul52748394153452"><li id="alm_16004__en-us_topic_0191813910_li21429361153452">If yes, go to <a href="#alm_16004__en-us_topic_0191813910_li572522141314">7</a>.</li><li id="alm_16004__en-us_topic_0191813910_li58056715153452">If no, go to <a href="#alm_16004__en-us_topic_0191813910_li44761520153452">6.h</a>.</li></ul>
|
|
</li><li id="alm_16004__en-us_topic_0191813910_li44761520153452"><a name="alm_16004__en-us_topic_0191813910_li44761520153452"></a><a name="en-us_topic_0191813910_li44761520153452"></a>Contact the O&M personnel to restore the network.</li><li id="alm_16004__en-us_topic_0191813910_li40393336153452">In the alarm list, check whether ALM-16004 Hive Service Unavailable is cleared.<ul class="subitemlist" id="alm_16004__en-us_topic_0191813910_ul11944688153452"><li id="alm_16004__en-us_topic_0191813910_li200497153452">If yes, no further action is required.</li><li id="alm_16004__en-us_topic_0191813910_li16240268153452">If no, go to <a href="#alm_16004__en-us_topic_0191813910_li572522141314">7</a>.</li></ul>
|
|
</li></ol>
|
|
</p></li><li id="alm_16004__en-us_topic_0191813910_li572522141314"><a name="alm_16004__en-us_topic_0191813910_li572522141314"></a><a name="en-us_topic_0191813910_li572522141314"></a><span>Collect fault information.</span><p><ol type="a" id="alm_16004__en-us_topic_0191813910_en-us_topic_0191813935_ol6089206913036"><li id="alm_16004__en-us_topic_0191813910_en-us_topic_0191813935_li4478836213036">On MRS Manager, choose <strong id="alm_16004__b79171017201217">System</strong> > <strong id="alm_16004__b1492319171121">Export Log</strong>.</li><li id="alm_16004__li18574327401">Contact technical support engineers for help. For details, see <a href="https://docs.otc.t-systems.com/en-us/public/learnmore.html" target="_blank" rel="noopener noreferrer">technical support</a>.</li></ol>
|
|
</p></li></ol>
|
|
</div>
|
|
<div class="section" id="alm_16004__en-us_topic_0191813910_section5847780"><h4 class="sectiontitle">Reference</h4><p id="alm_16004__en-us_topic_0191813910_p25482430">None</p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_0241.html">Alarm Reference (Applicable to Versions Earlier Than MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|