forked from docs/doc-exports
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com> Reviewed-by: Rechenburg, Matthias <matthias.rechenburg@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
109 lines
24 KiB
HTML
109 lines
24 KiB
HTML
<a name="ALM-45000"></a><a name="ALM-45000"></a>
|
|
|
|
<h1 class="topictitle1">ALM-45000 HetuEngine Service Unavailable</h1>
|
|
<div id="body8662426"><div class="section" id="ALM-45000__en-us_topic_0254455541_section114220291078"><h4 class="sectiontitle">Description</h4><p id="ALM-45000__en-us_topic_0254455541_p138313155710">The system checks the <span id="ALM-45000__text719333394">HetuEngine</span> service status every 300 seconds. This alarm is generated when the <span id="ALM-45000__text15464413399">HetuEngine</span> service is unavailable.</p>
|
|
<p id="ALM-45000__en-us_topic_0254455541_p163832155718">This alarm is cleared when the <span id="ALM-45000__text14115563915">HetuEngine</span> service recovers.</p>
|
|
</div>
|
|
<div class="section" id="ALM-45000__en-us_topic_0254455541_section1981920439715"><h4 class="sectiontitle">Attribute</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-45000__en-us_topic_0254455541_table8767151877" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-45000__en-us_topic_0254455541_row2383515576"><th align="left" class="cellrowborder" valign="top" width="33.65336533653365%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-45000__en-us_topic_0254455541_p438381520719">Alarm ID</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.01330133013302%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-45000__en-us_topic_0254455541_p23831156713">Alarm Severity</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-45000__en-us_topic_0254455541_p838421513713">Auto Clear</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-45000__en-us_topic_0254455541_row538412158712"><td class="cellrowborder" valign="top" width="33.65336533653365%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-45000__en-us_topic_0254455541_p113842015872">45000</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.01330133013302%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-45000__en-us_topic_0254455541_p19384131518713">Critical</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-45000__en-us_topic_0254455541_p3384215370">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-45000__en-us_topic_0254455541_section15821659972"><h4 class="sectiontitle">Parameters</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-45000__en-us_topic_0254455541_table8924154716" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-45000__en-us_topic_0254455541_row538401519719"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-45000__en-us_topic_0254455541_p1838411151978">Name</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-45000__en-us_topic_0254455541_p1038481516719">Meaning</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-45000__en-us_topic_0254455541_row10384815775"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45000__en-us_topic_0254455541_p238491510718">Source</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45000__en-us_topic_0254455541_p173848156712">Specifies the cluster for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-45000__en-us_topic_0254455541_row153848151576"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45000__en-us_topic_0254455541_p11384151514714">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45000__en-us_topic_0254455541_p17384131512717">Specifies the service for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-45000__en-us_topic_0254455541_row5384131513717"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45000__en-us_topic_0254455541_p03857158720">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45000__en-us_topic_0254455541_p43855151719">Specifies the role for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-45000__en-us_topic_0254455541_row93851615478"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45000__en-us_topic_0254455541_p13385915673">HostName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45000__en-us_topic_0254455541_p538561517718">Specifies the host for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-45000__en-us_topic_0254455541_section329212131082"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-45000__en-us_topic_0254455541_p650910164811"><span id="ALM-45000__text88316593915">HetuEngine</span> tasks fail to execute.</p>
|
|
</div>
|
|
<div class="section" id="ALM-45000__en-us_topic_0254455541_section1552442916817"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-45000__en-us_topic_0254455541_ul93855151277"><li id="ALM-45000__en-us_topic_0254455541_li1738516151370">The KrbServer service is abnormal.</li><li id="ALM-45000__en-us_topic_0254455541_li03851157716">The ZooKeeper service is abnormal.</li><li id="ALM-45000__en-us_topic_0254455541_li1938520157719">The HDFS service is abnormal.</li><li id="ALM-45000__en-us_topic_0254455541_li17385201515710">The Yarn service is abnormal.</li><li id="ALM-45000__en-us_topic_0254455541_li038510151274">The DBService service is abnormal.</li><li id="ALM-45000__en-us_topic_0254455541_li184631630122114">The Hive service is abnormal.</li><li id="ALM-45000__li54121934122616">Thre are no HSBroker instances in <span id="ALM-45000__text199245512218">HetuEngine</span>.</li></ul>
|
|
</div>
|
|
<div class="section" id="ALM-45000__en-us_topic_0254455541_section1318114712919"><h4 class="sectiontitle">Procedure</h4><p id="ALM-45000__en-us_topic_0254455541_p1738517151711"><strong id="ALM-45000__b19372163361118">Check the KrbServer service status.</strong></p>
|
|
<ol id="ALM-45000__en-us_topic_0254455541_ol1568902571013"><li id="ALM-45000__en-us_topic_0254455541_li18689122521018"><span>On <span id="ALM-45000__text34789336432">MRS</span> Manager, choose <strong id="ALM-45000__en-us_topic_0254455541_b159941399319">O&M</strong> > <strong id="ALM-45000__en-us_topic_0254455541_b18314111333120">Alarm</strong> > <strong id="ALM-45000__en-us_topic_0254455541_b16826131563120">Alarm</strong>.</span></li><li id="ALM-45000__en-us_topic_0254455541_li13400154141114"><span>In the alarm list, check whether the "ALM-25500 KrbServer Service Unavailable" alarm is generated.</span><p><ul id="ALM-45000__en-us_topic_0254455541_ul8406185417117"><li id="ALM-45000__en-us_topic_0254455541_li1640685415116">If yes, go to <a href="#ALM-45000__en-us_topic_0254455541_li6449155931114">3</a>.</li><li id="ALM-45000__en-us_topic_0254455541_li5406105417113">If no, go to <a href="#ALM-45000__en-us_topic_0254455541_li9811171991313">5</a>.</li></ul>
|
|
</p></li><li id="ALM-45000__en-us_topic_0254455541_li6449155931114"><a name="ALM-45000__en-us_topic_0254455541_li6449155931114"></a><a name="en-us_topic_0254455541_li6449155931114"></a><span>Clear "ALM-25500 KrbServer Service Unavailable" according to the alarm help.</span></li><li id="ALM-45000__li688822017426"><span>In the alarm list, check whether the alarm "ALM-45000 <span id="ALM-45000__text38871820194211">HetuEngine</span> Service Unavailable" is cleared.</span><p><ul id="ALM-45000__ul8888152020422"><li id="ALM-45000__li7888112034214">If yes, no further action is required.</li><li id="ALM-45000__li588822094211">If no, go to <a href="#ALM-45000__en-us_topic_0254455541_li9811171991313">5</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p id="ALM-45000__en-us_topic_0254455541_p438631511716"><strong id="ALM-45000__b536326193813">Check the ZooKeeper service status.</strong></p>
|
|
<ol start="5" id="ALM-45000__en-us_topic_0254455541_ol1581191981313"><li id="ALM-45000__en-us_topic_0254455541_li9811171991313"><a name="ALM-45000__en-us_topic_0254455541_li9811171991313"></a><a name="en-us_topic_0254455541_li9811171991313"></a><span>In the alarm list, check whether the alarm "ALM-12007 Process Fault" is generated.</span><p><ul id="ALM-45000__en-us_topic_0254455541_ul538641512717"><li id="ALM-45000__en-us_topic_0254455541_li18386715479">If yes, go to <a href="#ALM-45000__en-us_topic_0254455541_li078811388136">6</a>.</li><li id="ALM-45000__en-us_topic_0254455541_li1038614151179">If no, go to <a href="#ALM-45000__en-us_topic_0254455541_li1453212224251">9</a>.</li></ul>
|
|
</p></li><li id="ALM-45000__en-us_topic_0254455541_li078811388136"><a name="ALM-45000__en-us_topic_0254455541_li078811388136"></a><a name="en-us_topic_0254455541_li078811388136"></a><span>In the alarm list, click <span><img id="ALM-45000__en-us_topic_0254455541_image46371955175420" src="en-us_image_0000001532767734.png"></span> in the row that contains the "Process Fault" alarm. Check whether the name of the service for which the alarm is generated is ZooKeeper in <strong id="ALM-45000__en-us_topic_0254455541_b10945214163613">Location Information</strong>.</span><p><ul id="ALM-45000__en-us_topic_0254455541_ul7386111511717"><li id="ALM-45000__en-us_topic_0254455541_li638620151378">If yes, go to <a href="#ALM-45000__en-us_topic_0254455541_li8279173711910">7</a>.</li><li id="ALM-45000__en-us_topic_0254455541_li838619151079">If no, go to <a href="#ALM-45000__en-us_topic_0254455541_li1453212224251">9</a>.</li></ul>
|
|
</p></li><li id="ALM-45000__en-us_topic_0254455541_li8279173711910"><a name="ALM-45000__en-us_topic_0254455541_li8279173711910"></a><a name="en-us_topic_0254455541_li8279173711910"></a><span>Clear "ALM-12007 Process Fault" according to the alarm help.</span></li><li id="ALM-45000__en-us_topic_0254455541_li6789124102017"><span>In the alarm list, check whether the alarm "ALM-45000 <span id="ALM-45000__text1626811259552">HetuEngine</span> Service Unavailable" is cleared.</span><p><ul id="ALM-45000__en-us_topic_0254455541_ul738616151779"><li id="ALM-45000__en-us_topic_0254455541_li163869157716">If yes, no further action is required.</li><li id="ALM-45000__en-us_topic_0254455541_li43865157720">If no, go to <a href="#ALM-45000__en-us_topic_0254455541_li1453212224251">9</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p id="ALM-45000__en-us_topic_0254455541_p10386315771"><strong id="ALM-45000__b1189611123818">Check the HDFS service status.</strong></p>
|
|
<ol start="9" id="ALM-45000__en-us_topic_0254455541_ol145329229250"><li id="ALM-45000__en-us_topic_0254455541_li1453212224251"><a name="ALM-45000__en-us_topic_0254455541_li1453212224251"></a><a name="en-us_topic_0254455541_li1453212224251"></a><span>In the alarm list, check whether the "ALM-14000 HDFS Service Unavailable" alarm is generated.</span><p><ul id="ALM-45000__en-us_topic_0254455541_ul138720156717"><li id="ALM-45000__en-us_topic_0254455541_li838719151074">If yes, go to <a href="#ALM-45000__en-us_topic_0254455541_li11186103716269">10</a>.</li><li id="ALM-45000__en-us_topic_0254455541_li6387315171">If no, go to <a href="#ALM-45000__en-us_topic_0254455541_li164797109298">12</a>.</li></ul>
|
|
</p></li><li id="ALM-45000__en-us_topic_0254455541_li11186103716269"><a name="ALM-45000__en-us_topic_0254455541_li11186103716269"></a><a name="en-us_topic_0254455541_li11186103716269"></a><span>Clear "ALM-14000 HDFS Service Unavailable" according to the alarm help.</span></li><li id="ALM-45000__en-us_topic_0254455541_li15474519152812"><span>In the alarm list, check whether the "ALM-45000 <span id="ALM-45000__text1061316295569">HetuEngine</span> Service Unavailable" alarm is cleared.</span><p><ul id="ALM-45000__en-us_topic_0254455541_ul03870151775"><li id="ALM-45000__en-us_topic_0254455541_li153876151874">If yes, no further action is required.</li><li id="ALM-45000__en-us_topic_0254455541_li12387101513719">If no, go to <a href="#ALM-45000__en-us_topic_0254455541_li164797109298">12</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p id="ALM-45000__en-us_topic_0254455541_p1138713153715"><strong id="ALM-45000__b12333716133817">Check the YARN service status.</strong></p>
|
|
<ol start="12" id="ALM-45000__en-us_topic_0254455541_ol2479110132911"><li id="ALM-45000__en-us_topic_0254455541_li164797109298"><a name="ALM-45000__en-us_topic_0254455541_li164797109298"></a><a name="en-us_topic_0254455541_li164797109298"></a><span>In the alarm list, check whether the "ALM-18000 YARN Service Unavailable" alarm is generated.</span><p><ul id="ALM-45000__en-us_topic_0254455541_ul1738719151376"><li id="ALM-45000__en-us_topic_0254455541_li15387141514718">If yes, go to <a href="#ALM-45000__en-us_topic_0254455541_li850063073216">13</a>.</li><li id="ALM-45000__en-us_topic_0254455541_li1038721510718">If no, go to <a href="#ALM-45000__en-us_topic_0254455541_li1336315336331">15</a>.</li></ul>
|
|
</p></li><li id="ALM-45000__en-us_topic_0254455541_li850063073216"><a name="ALM-45000__en-us_topic_0254455541_li850063073216"></a><a name="en-us_topic_0254455541_li850063073216"></a><span>Clear "ALM-18000 YARN Service Unavailable" according to the alarm help.</span></li><li id="ALM-45000__en-us_topic_0254455541_li963973433219"><span>In the alarm list, check whether the "ALM-45000 <span id="ALM-45000__text103942419574">HetuEngine</span> Service Unavailable" alarm is cleared.</span><p><ul id="ALM-45000__en-us_topic_0254455541_ul14387615172"><li id="ALM-45000__en-us_topic_0254455541_li163872015476">If yes, no further action is required.</li><li id="ALM-45000__en-us_topic_0254455541_li4387161511720">If no, go to <a href="#ALM-45000__en-us_topic_0254455541_li1336315336331">15</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p id="ALM-45000__en-us_topic_0254455541_p138711151713"><strong id="ALM-45000__b59761220103817">Check the DBService service status.</strong></p>
|
|
<ol start="15" id="ALM-45000__en-us_topic_0254455541_ol14363193319338"><li id="ALM-45000__en-us_topic_0254455541_li1336315336331"><a name="ALM-45000__en-us_topic_0254455541_li1336315336331"></a><a name="en-us_topic_0254455541_li1336315336331"></a><span>In the alarm list, check whether the "ALM-27001 DBService Service Unavailable" alarm is generated.</span><p><ul id="ALM-45000__en-us_topic_0254455541_ul103871415373"><li id="ALM-45000__en-us_topic_0254455541_li17387141519717">If yes, go to <a href="#ALM-45000__en-us_topic_0254455541_li1427826153416">16</a>.</li><li id="ALM-45000__en-us_topic_0254455541_li03883151178">If no, go to <a href="#ALM-45000__li9867630175315">20</a>.</li></ul>
|
|
</p></li><li id="ALM-45000__en-us_topic_0254455541_li1427826153416"><a name="ALM-45000__en-us_topic_0254455541_li1427826153416"></a><a name="en-us_topic_0254455541_li1427826153416"></a><span>Clear "ALM-27001 DBService Service Unavailable" according to the alarm help.</span></li><li id="ALM-45000__en-us_topic_0254455541_li475201017347"><span>In the alarm list, check whether the "ALM-45000 <span id="ALM-45000__text25911865819">HetuEngine</span> Service Unavailable" alarm is cleared.</span><p><ul id="ALM-45000__en-us_topic_0254455541_ul1438810156714"><li id="ALM-45000__en-us_topic_0254455541_li1738816155714">If yes, no further action is required.</li><li id="ALM-45000__en-us_topic_0254455541_li1938819152718">If no, go to <a href="#ALM-45000__li9867630175315">20</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p id="ALM-45000__en-us_topic_0254455541_p9253204312612"><strong id="ALM-45000__b1374772619382">Check the Hive service status.</strong></p>
|
|
<ol start="18" id="ALM-45000__en-us_topic_0254455541_ol2046875816264"><li id="ALM-45000__en-us_topic_0254455541_li652418772713"><span>In the alarm list, check whether the "ALM-16004 Hive Service Unavailable" alarm is generated.</span><p><ul id="ALM-45000__en-us_topic_0254455541_ul14524373272"><li id="ALM-45000__en-us_topic_0254455541_li75242718279">If yes, go to <a href="#ALM-45000__en-us_topic_0254455541_li552411772716">19</a>.</li><li id="ALM-45000__en-us_topic_0254455541_li552407162717">If no, go to <a href="#ALM-45000__li9867630175315">20</a>.</li></ul>
|
|
</p></li><li id="ALM-45000__en-us_topic_0254455541_li552411772716"><a name="ALM-45000__en-us_topic_0254455541_li552411772716"></a><a name="en-us_topic_0254455541_li552411772716"></a><span>Clear "ALM-16004 Hive Service Unavailable" according to the alarm help.</span></li><li id="ALM-45000__en-us_topic_0254455541_li45241762717"><span>In the alarm list, check whether the "ALM-45000 <span id="ALM-45000__text1788917439585">HetuEngine</span> Service Unavailable" alarm is cleared.</span><p><ul id="ALM-45000__en-us_topic_0254455541_ul1452512718271"><li id="ALM-45000__en-us_topic_0254455541_li8525127122712">If yes, no further action is required.</li><li id="ALM-45000__en-us_topic_0254455541_li45259762716">If no, go to <a href="#ALM-45000__li9867630175315">20</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p id="ALM-45000__p83486419536"><strong id="ALM-45000__b521553273812">Check whether there are no HSBroker instances in <span id="ALM-45000__text19464812133912">HetuEngine</span>.</strong></p>
|
|
<ol start="21" id="ALM-45000__ol38681930175312"><li id="ALM-45000__li9867630175315"><a name="ALM-45000__li9867630175315"></a><a name="li9867630175315"></a><span>On <span id="ALM-45000__text3425563195">MRS</span> Manager, choose <strong id="ALM-45000__b18831243424724">Cluster</strong> > <em id="ALM-45000__i50391701224724">Name of the desired cluster</em> > <strong id="ALM-45000__b3737670524724">Services</strong> > <strong id="ALM-45000__b19721235827"><span id="ALM-45000__text65304137397">HetuEngine</span></strong>. On the page that is displayed, click the <strong id="ALM-45000__b1320113574215">Instance</strong> tab.</span></li><li id="ALM-45000__li1786813015319"><span>Check whether there are no HSBroker instances.</span><p><ul id="ALM-45000__ul17867123016536"><li id="ALM-45000__li586773014537">If yes, click <strong id="ALM-45000__b136458271940">Add Instance</strong> to add one.</li><li id="ALM-45000__li1186713303537">If no, go to <a href="#ALM-45000__en-us_topic_0254455541_li1994811814357">23</a>.</li></ul>
|
|
</p></li><li id="ALM-45000__li386818302532"><span>In the alarm list, check whether the "ALM-45000 <span id="ALM-45000__text101503919717">HetuEngine</span> Service Unavailable" alarm is cleared.</span><p><ul id="ALM-45000__ul3868830135311"><li id="ALM-45000__li48681030175314">If yes, no further action is required.</li><li id="ALM-45000__li10868103013536">If no, go to <a href="#ALM-45000__en-us_topic_0254455541_li1994811814357">23</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p id="ALM-45000__en-us_topic_0254455541_p173881415978"><strong id="ALM-45000__b6682163793810">Check the network connection between <span id="ALM-45000__text1940901563916">HetuEngine</span> and ZooKeeper, HDFS, YARN, DBService, and Hive.</strong></p>
|
|
<ol start="24" id="ALM-45000__en-us_topic_0254455541_ol1294717817350"><li id="ALM-45000__en-us_topic_0254455541_li1994811814357"><a name="ALM-45000__en-us_topic_0254455541_li1994811814357"></a><a name="en-us_topic_0254455541_li1994811814357"></a><span>On <span id="ALM-45000__text11313157121919">MRS</span> Manager, choose <strong id="ALM-45000__b52521339715">Cluster</strong> > <em id="ALM-45000__i925419331379">Name of the desired cluster</em> > <strong id="ALM-45000__b325518331478">Services</strong> > <strong id="ALM-45000__b14257433379"><span id="ALM-45000__text1425617331671">HetuEngine</span></strong>. On the page that is displayed, click the <strong id="ALM-45000__b425812331072">Instance</strong> tab.</span></li><li id="ALM-45000__en-us_topic_0254455541_li7948128193518"><a name="ALM-45000__en-us_topic_0254455541_li7948128193518"></a><a name="en-us_topic_0254455541_li7948128193518"></a><span>Click the host name in the <strong id="ALM-45000__en-us_topic_0254455541_b176549114211">HSBroker</strong> row and record the management IP address in the <strong id="ALM-45000__en-us_topic_0254455541_b17363183234211">Basic Information</strong> area.</span></li><li id="ALM-45000__en-us_topic_0254455541_li012010364511"><span>Log in to the host where HSBroker resides as user <strong id="ALM-45000__en-us_topic_0254455541_b193887151474">omm</strong> using the IP address obtained in <a href="#ALM-45000__en-us_topic_0254455541_li7948128193518">25</a>.</span></li></ol><ol start="27" id="ALM-45000__en-us_topic_0254455541_ol35439211467"><li id="ALM-45000__en-us_topic_0254455541_li854392104615"><span>Run the <strong id="ALM-45000__en-us_topic_0254455541_b16951117114511">ping</strong> command to check whether the network connection between the host where HSBroker resides and the hosts where ZooKeeper, HDFS, Yarn, DBService, and Hive reside is in the normal state.</span><p><ul id="ALM-45000__en-us_topic_0254455541_ul10388015172"><li id="ALM-45000__en-us_topic_0254455541_li103881151271">If yes, go to <a href="#ALM-45000__en-us_topic_0254455541_li760014619484">30</a>.</li><li id="ALM-45000__en-us_topic_0254455541_li1438817153714">If no, go to <a href="#ALM-45000__en-us_topic_0254455541_li10151810164812">28</a>.</li></ul>
|
|
</p></li><li id="ALM-45000__en-us_topic_0254455541_li10151810164812"><a name="ALM-45000__en-us_topic_0254455541_li10151810164812"></a><a name="en-us_topic_0254455541_li10151810164812"></a><span>Contact the network administrator to restore the network.</span></li><li id="ALM-45000__en-us_topic_0254455541_li7665181304814"><span>In the alarm list, check whether the "ALM-45000 <span id="ALM-45000__text10647216911">HetuEngine</span> Service Unavailable" alarm is cleared.</span><p><ul id="ALM-45000__en-us_topic_0254455541_ul113891215378"><li id="ALM-45000__en-us_topic_0254455541_li173894151273">If yes, no further action is required.</li><li id="ALM-45000__en-us_topic_0254455541_li1238913151712">If no, go to <a href="#ALM-45000__en-us_topic_0254455541_li760014619484">30</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p id="ALM-45000__en-us_topic_0254455541_p1738913151711"><strong id="ALM-45000__b37839620919">Collect fault information.</strong></p>
|
|
<ol start="30" id="ALM-45000__en-us_topic_0254455541_ol1560064614813"><li id="ALM-45000__en-us_topic_0254455541_li760014619484"><a name="ALM-45000__en-us_topic_0254455541_li760014619484"></a><a name="en-us_topic_0254455541_li760014619484"></a><span>On <span id="ALM-45000__text16705586196">MRS</span> Manager, choose <strong id="ALM-45000__en-us_topic_0254455541_b181432183474">O&M</strong> > <strong id="ALM-45000__en-us_topic_0254455541_b1414461814472">Log</strong> > <strong id="ALM-45000__en-us_topic_0254455541_b1214461844711">Download</strong>.</span></li><li id="ALM-45000__en-us_topic_0254455541_li160094604812"><span>Expand the <strong id="ALM-45000__b1369717793">Service</strong> drop-down list. In the <strong id="ALM-45000__b837814178911">Services</strong> dialog box that is displayed, select <strong id="ALM-45000__b160712311097"><span id="ALM-45000__text1928021811394">HetuEngine</span></strong> under the target cluster name, and click <strong id="ALM-45000__b113798178912">OK</strong>.</span></li><li id="ALM-45000__li159311279437"><span>Expand the <strong id="ALM-45000__b03111443991">Hosts</strong> drop-down list. In the <strong id="ALM-45000__b18311114318916">Select Host</strong> dialog box that is displayed, select the hosts to which the role belongs, and click <strong id="ALM-45000__b231212437912">OK</strong>.</span></li><li id="ALM-45000__en-us_topic_0254455541_li76001546164810"><span>Click <span><img id="ALM-45000__en-us_topic_0254455541_image0875184517513" src="en-us_image_0000001532927662.png"></span> in the upper right corner, and set <strong id="ALM-45000__b1286912541497">Start Date</strong> and <strong id="ALM-45000__b887005411919">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-45000__b1987114541195">Download</strong>.</span></li><li id="ALM-45000__en-us_topic_0254455541_li14600164664818"><span>Contact <span id="ALM-45000__text1798016631019">O&M personnel</span> and provide the collected logs.</span></li></ol>
|
|
</div>
|
|
<div class="section" id="ALM-45000__en-us_topic_0254455541_section5452182514811"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-45000__en-us_topic_0254455541_p545262513819">After the fault is rectified, the system automatically clears this alarm.</p>
|
|
</div>
|
|
<div class="section" id="ALM-45000__en-us_topic_0254455541_section1070673311818"><h4 class="sectiontitle">Reference</h4><p id="ALM-45000__en-us_topic_0254455541_p19470124113816">None</p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|