forked from docs/doc-exports
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
82 lines
9.4 KiB
HTML
82 lines
9.4 KiB
HTML
<a name="ALM-50402"></a><a name="ALM-50402"></a>
|
|
|
|
<h1 class="topictitle1">ALM-50402 JobGateway Service Unavailable</h1>
|
|
<div id="body55187498"><div class="note" id="ALM-50402__note14744151615401"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-50402__p1431519566">This section applies only to MRS 3.3.0 or later.</p>
|
|
</div></div>
|
|
<div class="section" id="ALM-50402__section8280367"><h4 class="sectiontitle"><span id="ALM-50402__text8183144004216">Alarm Description</span></h4><p id="ALM-50402__p610514381811">The system checks the JobGateway service status every 60 seconds. This alarm is generated when the JobGateway service is abnormal.</p>
|
|
<p id="ALM-50402__p11105143101817">This alarm is cleared when the JobGateway service recovers.</p>
|
|
</div>
|
|
<div class="section" id="ALM-50402__section7414445"><h4 class="sectiontitle"><span id="ALM-50402__text817617154720">Alarm Attributes</span></h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-50402__table45079949" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-50402__row5683496"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.3.2.1.4.1.1"><p id="ALM-50402__p57710042"><span id="ALM-50402__text1978535731719">Alarm ID</span></p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.3.2.1.4.1.2"><p id="ALM-50402__p44001849"><span id="ALM-50402__text32561148182">Alarm Severity</span></p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.3.2.1.4.1.3"><p id="ALM-50402__p7380012"><span id="ALM-50402__text165810151810">Auto Cleared</span></p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-50402__row60910108"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.3.2.1.4.1.1 "><p id="ALM-50402__p16488194717492">50402</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.3.2.1.4.1.2 "><p id="ALM-50402__p588994817496">Critical</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.3.2.1.4.1.3 "><p id="ALM-50402__p34071398">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-50402__section66730009"><h4 class="sectiontitle"><span id="ALM-50402__text118491115201814">Alarm Parameters</span></h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-50402__table8319831" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-50402__row40868022"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.4.2.1.3.1.1"><p id="ALM-50402__p21975462"><span id="ALM-50402__text18881122111811">Parameter</span></p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.4.2.1.3.1.2"><p id="ALM-50402__p35182007"><span id="ALM-50402__text17679183001818">Description</span></p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-50402__row594512751512"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.1 "><p id="ALM-50402__p8838358184914">Source</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.2 "><p id="ALM-50402__p837170125015">Specifies the cluster or system for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-50402__row31170320"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.1 "><p id="ALM-50402__p39123317">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.2 "><p id="ALM-50402__p172628810500">Specifies the service for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-50402__row13127105964111"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.1 "><p id="ALM-50402__p8127135964119">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.2 "><p id="ALM-50402__p212715599414">Specifies the role for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-50402__row722366124213"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.1 "><p id="ALM-50402__p522314610427">HostName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.2 "><p id="ALM-50402__p222314615429">Specifies the host for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-50402__section63699172"><h4 class="sectiontitle"><span id="ALM-50402__text964023613187">Impact on the System</span></h4><p id="ALM-50402__p1131514217182">No job submission operation can be performed on JobGateway in the cluster. The components that depend on JobGateway in the cluster will become faulty.</p>
|
|
</div>
|
|
<div class="section" id="ALM-50402__section36421639"><h4 class="sectiontitle"><span id="ALM-50402__text8444154310180">Possible Causes</span></h4><p id="ALM-50402__p5131335101812">The node where the JobGateway service locates is faulty.</p>
|
|
</div>
|
|
<div class="section" id="ALM-50402__section633182013020"><h4 class="sectiontitle"><span id="ALM-50402__text1420165031818">Handling Procedure</span></h4><ol id="ALM-50402__ol143181257111813"><li id="ALM-50402__li431925741816"><a name="ALM-50402__li431925741816"></a><a name="li431925741816"></a><span>Log in to FusionInsight Manager, choose <strong id="ALM-50402__b9977191363316">Cluster</strong> > <strong id="ALM-50402__b129781213163319">Services</strong> > <strong id="ALM-50402__b597813134330">JobGateway</strong>, and click the <strong id="ALM-50402__b1997821311339">Instance</strong> tab. Check for JobServer or JobBalancer instances that are faulty or not started and view the host names of these instances.</span></li><li id="ALM-50402__li7429180201910"><span>On the <strong id="ALM-50402__b330583816319">Alarm</strong> page of FusionInsight Manager, check whether the <strong id="ALM-50402__b3305103843113">NodeAgent Process Is Abnormal</strong> alarm is generated.</span><p><ul id="ALM-50402__ul122054535225"><li id="ALM-50402__li720595316220">If yes, go to <a href="#ALM-50402__li854418235198">3</a>.</li><li id="ALM-50402__li82051953152213">If no, go to <a href="#ALM-50402__li2051913258201">6</a>.</li></ul>
|
|
</p></li><li id="ALM-50402__li854418235198"><a name="ALM-50402__li854418235198"></a><a name="li854418235198"></a><span>Check whether the host name in the alarm information is the same as the host name in <a href="#ALM-50402__li431925741816">1</a>.</span><p><ul id="ALM-50402__ul1037155613222"><li id="ALM-50402__li193711556142211">If yes, go to <a href="#ALM-50402__li952112310198">4</a>.</li><li id="ALM-50402__li133711156102219">If no, go to <a href="#ALM-50402__li2051913258201">6</a>.</li></ul>
|
|
</p></li><li id="ALM-50402__li952112310198"><a name="ALM-50402__li952112310198"></a><a name="li952112310198"></a><span>Clear the alarm by following the instructions provided in <strong id="ALM-50402__b294954353910">ALM-12006 NodeAgent Process Is Abnormal</strong>.</span></li><li id="ALM-50402__li1152216311193"><span>In the alarm list, check whether alarm <strong id="ALM-50402__b10989113324114">JobGateway Service Unavailable</strong> is cleared.</span><p><ul id="ALM-50402__ul7373175912222"><li id="ALM-50402__li737310593229">If yes, no further action is required.</li><li id="ALM-50402__li1637315913222">If no, go to <a href="#ALM-50402__li2051913258201">6</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p id="ALM-50402__p1155424631914"><strong id="ALM-50402__b1680964811419">Collect fault information.</strong></p>
|
|
<ol start="6" id="ALM-50402__ol95599822111"><li id="ALM-50402__li2051913258201"><a name="ALM-50402__li2051913258201"></a><a name="li2051913258201"></a><span>On FusionInsight Manager, choose <strong id="ALM-50402__b43011751154112">O&M</strong>. In the navigation pane on the left, choose <strong id="ALM-50402__b930214511419">Log</strong> > <strong id="ALM-50402__b530210513410">Download</strong>.</span></li><li id="ALM-50402__li7226428132014"><span>Expand the <strong id="ALM-50402__b1272475218415">Service</strong> drop-down list, and select <strong id="ALM-50402__b1672585216416">JobGateway</strong> for the target cluster.</span></li><li id="ALM-50402__li10716230122015"><span>Click <span><img id="ALM-50402__image149001122173310" src="en-us_image_0000002007533201.png"></span> in the upper right corner, and set <strong id="ALM-50402__b1284935744117">Start Date</strong> and <strong id="ALM-50402__b584913579419">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-50402__b9850185717412">Download</strong>.</span></li><li id="ALM-50402__li1631620210209"><span>Contact <span id="ALM-50402__text68090417421">O&M personnel</span> and provide the collected logs.</span></li></ol>
|
|
</div>
|
|
<div class="section" id="ALM-50402__section169311343318"><h4 class="sectiontitle"><span id="ALM-50402__text107871207193">Alarm Clearance</span></h4><p id="ALM-50402__p55781648135011">This alarm is automatically cleared after the fault is rectified.</p>
|
|
</div>
|
|
<div class="section" id="ALM-50402__section53362350"><h4 class="sectiontitle"><span id="ALM-50402__text13981012171915">Related Information</span></h4><p id="ALM-50402__p7522741"><span id="ALM-50402__text19872115196">None.</span></p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|