forked from docs/doc-exports
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
86 lines
10 KiB
HTML
86 lines
10 KiB
HTML
<a name="alm_12010"></a><a name="alm_12010"></a>
|
|
|
|
<h1 class="topictitle1">ALM-12010 Manager Heartbeat Interruption Between the Active and Standby Nodes</h1>
|
|
<div id="body8662426"><div class="section" id="alm_12010__en-us_topic_0191813932_section3336541317510"><h4 class="sectiontitle">Description</h4><p id="alm_12010__en-us_topic_0191813932_p61029329115950">This alarm is generated when the active Manager does not receive any heartbeat signal from the standby Manager within 7 seconds.</p>
|
|
<p id="alm_12010__en-us_topic_0191813932_p2573829115950">This alarm is cleared when the active Manager receives heartbeat signals from the standby Manager.</p>
|
|
</div>
|
|
<div class="section" id="alm_12010__en-us_topic_0191813932_section6589867417620"><h4 class="sectiontitle">Attribute</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="alm_12010__en-us_topic_0191813932_table51335144115950" frame="border" border="1" rules="all"><thead align="left"><tr id="alm_12010__en-us_topic_0191813932_row22457334115950"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="alm_12010__en-us_topic_0191813932_p31502817115950"><strong id="alm_12010__b938618581702">Alarm ID</strong></p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="alm_12010__en-us_topic_0191813932_p22174081115950"><strong id="alm_12010__b12563591015">Alarm Severity</strong></p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="alm_12010__en-us_topic_0191813932_p27776904115950"><strong id="alm_12010__b197240591802">Auto Clear</strong></p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="alm_12010__en-us_topic_0191813932_row1945234115950"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="alm_12010__en-us_topic_0191813932_p26013574115950">12010</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="alm_12010__en-us_topic_0191813932_p21947919115950">Major</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="alm_12010__en-us_topic_0191813932_p22246739115950">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="alm_12010__en-us_topic_0191813932_section4656225517628"><h4 class="sectiontitle">Parameters</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="alm_12010__en-us_topic_0191813932_table30473597115950" frame="border" border="1" rules="all"><thead align="left"><tr id="alm_12010__en-us_topic_0191813932_row3202901115950"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="alm_12010__en-us_topic_0191813932_p51699739115950"><strong id="alm_12010__b678219418113">Parameter</strong></p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="alm_12010__en-us_topic_0191813932_p3813318115950"><strong id="alm_12010__b546111516117">Description</strong></p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="alm_12010__en-us_topic_0191813932_row63329947115950"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="alm_12010__en-us_topic_0191813932_p37698384115950">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="alm_12010__en-us_topic_0191813932_p38746956115950">Specifies the service for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="alm_12010__en-us_topic_0191813932_row57870114115950"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="alm_12010__en-us_topic_0191813932_p26943429115950">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="alm_12010__en-us_topic_0191813932_p12845059115950">Specifies the role for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="alm_12010__en-us_topic_0191813932_row10748285115950"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="alm_12010__en-us_topic_0191813932_p62476968115950">HostName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="alm_12010__en-us_topic_0191813932_p31473313115950">Specifies the host for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="alm_12010__en-us_topic_0191813932_row50174577115950"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="alm_12010__en-us_topic_0191813932_p32369325115950">Local Manager HA Name</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="alm_12010__en-us_topic_0191813932_p63610061115950">Specifies a local Manager HA.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="alm_12010__en-us_topic_0191813932_row46406907115950"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="alm_12010__en-us_topic_0191813932_p33396547115950">Peer Manager HA Name</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="alm_12010__en-us_topic_0191813932_p3666786115950">Specifies a peer Manager HA.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="alm_12010__en-us_topic_0191813932_section6050290517637"><h4 class="sectiontitle">Impact on the System</h4><p id="alm_12010__en-us_topic_0191813932_p66386893115950">When the active Manager process is abnormal, an active/standby failover cannot be performed, and services are affected.</p>
|
|
</div>
|
|
<div class="section" id="alm_12010__en-us_topic_0191813932_section852502017642"><h4 class="sectiontitle">Possible Causes</h4><p id="alm_12010__en-us_topic_0191813932_p13611868115950">The link between the active and standby Manager servers is abnormal.</p>
|
|
</div>
|
|
<div class="section" id="alm_12010__en-us_topic_0191813932_section2641184617833"><h4 class="sectiontitle">Procedure</h4><ol id="alm_12010__en-us_topic_0191813932_ol6529643917647"><li id="alm_12010__en-us_topic_0191813932_li4623234517647"><span>Check whether the network between the active and standby Manager servers is normal.</span><p><ol type="a" id="alm_12010__en-us_topic_0191813932_ol5367122517818"><li id="alm_12010__en-us_topic_0191813932_li1860412317818">Go to the MRS cluster details page. In the alarm list on the alarm management tab page, click the row that contains the alarm. In the alarm details, view the address of the standby Manager server.</li><li id="alm_12010__en-us_topic_0191813932_li5774108217818">Log in to the active management node.</li><li id="alm_12010__en-us_topic_0191813932_li842093317838">Run the following command to check whether the standby Manager is reachable:<p id="alm_12010__en-us_topic_0191813932_p637480517846"><a name="alm_12010__en-us_topic_0191813932_li842093317838"></a><a name="en-us_topic_0191813932_li842093317838"></a><strong id="alm_12010__b14711122717">ping</strong> <em id="alm_12010__i14712521172">heartbeat IP address of the standby Manager</em></p>
|
|
<ul id="alm_12010__en-us_topic_0191813932_ul4202515117939"><li id="alm_12010__en-us_topic_0191813932_li320616617939">If yes, go to <a href="#alm_12010__li7265151273516">2</a>.</li><li id="alm_12010__en-us_topic_0191813932_li3058285917939">If no, go to <a href="#alm_12010__en-us_topic_0191813932_li233941717940">1.d</a>.</li></ul>
|
|
</li><li id="alm_12010__en-us_topic_0191813932_li233941717940"><a name="alm_12010__en-us_topic_0191813932_li233941717940"></a><a name="en-us_topic_0191813932_li233941717940"></a>Contact the O&M personnel to check whether the network is faulty.<ul id="alm_12010__en-us_topic_0191813932_ul1585026317104"><li id="alm_12010__en-us_topic_0191813932_li2205430717104">If yes, go to <a href="#alm_12010__en-us_topic_0191813932_li4279289717106">1.e</a>.</li><li id="alm_12010__en-us_topic_0191813932_li1160370917104">If no, go to <a href="#alm_12010__li7265151273516">2</a>.</li></ul>
|
|
</li><li id="alm_12010__en-us_topic_0191813932_li4279289717106"><a name="alm_12010__en-us_topic_0191813932_li4279289717106"></a><a name="en-us_topic_0191813932_li4279289717106"></a>Rectify the network fault and check whether the alarm is cleared from the alarm list.<ul id="alm_12010__en-us_topic_0191813932_ul48365907171018"><li id="alm_12010__en-us_topic_0191813932_li65605641171018">If yes, no further action is required.</li><li id="alm_12010__en-us_topic_0191813932_li2362146171018">If no, go to <a href="#alm_12010__li7265151273516">2</a>.</li></ul>
|
|
</li></ol>
|
|
</p></li><li id="alm_12010__li7265151273516"><a name="alm_12010__li7265151273516"></a><a name="li7265151273516"></a><span>Log in to all master nodes in the cluster and run the following commands to find all <strong id="alm_12010__b0776637125317">sed</strong><em id="alm_12010__i5542183919534">xxx</em> files and delete them:</span><p><p id="alm_12010__p2909105943715"><strong id="alm_12010__b1580025163819">find /srv/BigData/ -name "sed*"</strong></p>
|
|
<p id="alm_12010__p20882193511370"><strong id="alm_12010__b1180255143812">find /opt -name "sed*"</strong></p>
|
|
</p></li><li id="alm_12010__en-us_topic_0191813932_li572522141314"><span>Collect fault information.</span><p><ol type="a" id="alm_12010__en-us_topic_0191813932_en-us_topic_0191813935_ol6089206913036"><li id="alm_12010__en-us_topic_0191813932_en-us_topic_0191813935_li4478836213036">On MRS Manager, choose <span class="menucascade" id="alm_12010__menucascade61691649389"><b><span class="uicontrol" id="alm_12010__uicontrol19164104913811">System</span></b> > <b><span class="uicontrol" id="alm_12010__uicontrol31696491185">Export Log</span></b></span>.</li><li id="alm_12010__li18574327401">Contact technical support engineers for help. For details, see <a href="https://docs.otc.t-systems.com/en-us/public/learnmore.html" target="_blank" rel="noopener noreferrer">technical support</a>.</li></ol>
|
|
</p></li></ol>
|
|
</div>
|
|
<div class="section" id="alm_12010__en-us_topic_0191813932_section55635852162510"><h4 class="sectiontitle">Reference</h4><p id="alm_12010__en-us_topic_0191813932_p40031700162511">None</p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_0241.html">Alarm Reference (Applicable to Versions Earlier Than MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|