forked from docs/doc-exports
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
107 lines
14 KiB
HTML
107 lines
14 KiB
HTML
<a name="alm_12002"></a><a name="alm_12002"></a>
|
|
|
|
<h1 class="topictitle1">ALM-12002 HA Resource Is Abnormal</h1>
|
|
<div id="body8662426"><div class="section" id="alm_12002__en-us_topic_0191813914_section448827651381"><h4 class="sectiontitle">Description</h4><p id="alm_12002__en-us_topic_0191813914_p65328378115518">The high availability (HA) software periodically checks the WebService floating IP addresses and databases of Manager. This alarm is generated when the HA software detects that the WebService floating IP addresses or databases are abnormal.</p>
|
|
<p id="alm_12002__en-us_topic_0191813914_p52996736115518">This alarm is cleared when the HA software detects that the floating IP addresses or databases are normal.</p>
|
|
</div>
|
|
<div class="section" id="alm_12002__en-us_topic_0191813914_section610286913819"><h4 class="sectiontitle"><strong id="alm_12002__en-us_topic_0191813914_b12540622115518">Attribute</strong></h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="alm_12002__en-us_topic_0191813914_table41020271115518" frame="border" border="1" rules="all"><thead align="left"><tr id="alm_12002__en-us_topic_0191813914_row20195275115518"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="alm_12002__en-us_topic_0191813914_p64486337115518"><strong id="alm_12002__en-us_topic_0191813914_b18218632115518">Alarm ID</strong></p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="alm_12002__en-us_topic_0191813914_p48371754115518"><strong id="alm_12002__en-us_topic_0191813914_b14511135115518">Alarm Severity</strong></p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="alm_12002__en-us_topic_0191813914_p57987160115518"><strong id="alm_12002__en-us_topic_0191813914_b37124549115518">Auto Clear</strong></p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="alm_12002__en-us_topic_0191813914_row62885452115518"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="alm_12002__en-us_topic_0191813914_p39401635115518">12002</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="alm_12002__en-us_topic_0191813914_p11553158115518">Major</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="alm_12002__en-us_topic_0191813914_p35457693115518">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="alm_12002__en-us_topic_0191813914_section4392026713828"><h4 class="sectiontitle">Parameter</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="alm_12002__en-us_topic_0191813914_table25861131115518" frame="border" border="1" rules="all"><thead align="left"><tr id="alm_12002__en-us_topic_0191813914_row42913945115518"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="alm_12002__en-us_topic_0191813914_p29893129115518"><strong id="alm_12002__en-us_topic_0191813914_b60298881115518">Parameter</strong></p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="alm_12002__en-us_topic_0191813914_p18815149115518"><strong id="alm_12002__en-us_topic_0191813914_b62040571115518">Description</strong></p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="alm_12002__en-us_topic_0191813914_row57825289115518"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="alm_12002__en-us_topic_0191813914_p41738492115518">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="alm_12002__en-us_topic_0191813914_p67058542115518">Specifies the service for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="alm_12002__en-us_topic_0191813914_row61020013115518"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="alm_12002__en-us_topic_0191813914_p40821002115518">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="alm_12002__en-us_topic_0191813914_p55863226115518">Specifies the role for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="alm_12002__en-us_topic_0191813914_row48564051115518"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="alm_12002__en-us_topic_0191813914_p7869652115518">HostName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="alm_12002__en-us_topic_0191813914_p60772316115518">Specifies the host for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="alm_12002__en-us_topic_0191813914_row38584145115518"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="alm_12002__en-us_topic_0191813914_p55160031115518">RESName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="alm_12002__en-us_topic_0191813914_p9951817115518">Specifies the resource for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="alm_12002__en-us_topic_0191813914_section4853205713841"><h4 class="sectiontitle">Impact on the System</h4><p id="alm_12002__en-us_topic_0191813914_p14825960115518">If the WebService floating IP addresses of Manager are abnormal, users cannot log in to or use Manager. If databases of Manager are abnormal, all core services and related service processes, such as alarms and monitoring functions, are affected.</p>
|
|
</div>
|
|
<div class="section" id="alm_12002__en-us_topic_0191813914_section2646833913851"><h4 class="sectiontitle">Possible Causes</h4><ul id="alm_12002__en-us_topic_0191813914_ul259757181395"><li id="alm_12002__en-us_topic_0191813914_li330266481395">The floating IP address is abnormal.</li><li id="alm_12002__en-us_topic_0191813914_li588557991395">The database is abnormal.</li></ul>
|
|
</div>
|
|
<div class="section" id="alm_12002__en-us_topic_0191813914_section24350321397"><h4 class="sectiontitle">Procedure</h4><ol id="alm_12002__en-us_topic_0191813914_ol1324174013915"><li id="alm_12002__en-us_topic_0191813914_li1436970913915"><span>Check the floating IP address status of the active management node.</span><p><ol type="a" id="alm_12002__en-us_topic_0191813914_ol5631024613939"><li id="alm_12002__en-us_topic_0191813914_li1935016213939">Go to the MRS cluster details page. In the alarm list on the alarm management tab page, click the row that contains the alarm. In the alarm details, view the host address and resource name of the alarm.</li><li id="alm_12002__en-us_topic_0191813914_li1906944913939">Log in to the active management node. Run the following commands to switch the user:<p id="alm_12002__en-us_topic_0191813914_p6617592102925"><a name="alm_12002__en-us_topic_0191813914_li1906944913939"></a><a name="en-us_topic_0191813914_li1906944913939"></a><strong id="alm_12002__en-us_topic_0191813914_b26930074102939">sudo su - root</strong></p>
|
|
<p id="alm_12002__en-us_topic_0191813914_p31047252102932"><strong id="alm_12002__en-us_topic_0191813914_b46848621141256">su - omm</strong></p>
|
|
</li><li id="alm_12002__en-us_topic_0191813914_li15088482131021">Go to the <strong id="alm_12002__b180113810233630">${BIGDATA_HOME}/om-0.0.1/sbin/</strong> directory, run the <strong id="alm_12002__b190249075233630">status-oms.sh</strong> script to check whether the floating IP address of the active Manager is normal. View the command output, locate the row where <strong id="alm_12002__b196287516433630">ResName</strong> is <strong id="alm_12002__b52060402333630">floatip</strong>, and check whether the following information is displayed.<p id="alm_12002__en-us_topic_0191813914_p1641552115518">Example:</p>
|
|
<pre class="screen" id="alm_12002__en-us_topic_0191813914_screen53274402131514">10-10-10-160 floatip Normal Normal Single_active</pre>
|
|
<ul id="alm_12002__en-us_topic_0191813914_ul55061142131751"><li id="alm_12002__en-us_topic_0191813914_li9833109131751">If yes, go to <a href="#alm_12002__en-us_topic_0191813914_li50663096131636">2</a>.</li><li id="alm_12002__en-us_topic_0191813914_li62620129131751">If no, go to <a href="#alm_12002__en-us_topic_0191813914_li41799423131631">1.d</a>.</li></ul>
|
|
</li><li id="alm_12002__en-us_topic_0191813914_li41799423131631"><a name="alm_12002__en-us_topic_0191813914_li41799423131631"></a><a name="en-us_topic_0191813914_li41799423131631"></a>Contact the O&M personnel to check whether the floating IP NIC exists.<ul id="alm_12002__en-us_topic_0191813914_ul66564617131753"><li id="alm_12002__en-us_topic_0191813914_li30629667131753">If yes, go to <a href="#alm_12002__en-us_topic_0191813914_li50663096131636">2</a>.</li><li id="alm_12002__en-us_topic_0191813914_li42574111131753">If no, go to <a href="#alm_12002__en-us_topic_0191813914_li6978622131725">1.e</a>.</li></ul>
|
|
</li><li id="alm_12002__en-us_topic_0191813914_li6978622131725"><a name="alm_12002__en-us_topic_0191813914_li6978622131725"></a><a name="en-us_topic_0191813914_li6978622131725"></a>Contact O&M personnel to rectify the NIC fault.<p id="alm_12002__en-us_topic_0191813914_p6208368131615"><a name="alm_12002__en-us_topic_0191813914_li6978622131725"></a><a name="en-us_topic_0191813914_li6978622131725"></a>Wait 5 minutes and check whether the alarm is cleared.</p>
|
|
<ul id="alm_12002__en-us_topic_0191813914_ul9500213131756"><li id="alm_12002__en-us_topic_0191813914_li55982140131756">If yes, no further action is required.</li><li id="alm_12002__en-us_topic_0191813914_li9782942131756">If no, go to <a href="#alm_12002__en-us_topic_0191813914_li50663096131636">2</a>.</li></ul>
|
|
</li></ol>
|
|
</p></li><li id="alm_12002__en-us_topic_0191813914_li50663096131636"><a name="alm_12002__en-us_topic_0191813914_li50663096131636"></a><a name="en-us_topic_0191813914_li50663096131636"></a><span>Check the database status of the active and standby management nodes.</span><p><ol type="a" id="alm_12002__en-us_topic_0191813914_ol58702441142223"><li id="alm_12002__en-us_topic_0191813914_li52660603142223">Log in to the active and standby management nodes, run the <strong id="alm_12002__b160764785333630">sudo su - root</strong> and <strong id="alm_12002__b36957355133630">su - ommdba</strong> commands to switch to user <strong id="alm_12002__b153542149033630">ommdba</strong>, and run the <strong id="alm_12002__b174492773433630">gs_ctl query</strong> command to check whether the following information is displayed in the command output.<p id="alm_12002__en-us_topic_0191813914_p7617767115518">Command output of the active management node:</p>
|
|
<pre class="screen" id="alm_12002__en-us_topic_0191813914_screen21483773132111">Ha state:
|
|
LOCAL_ROLE: Primary
|
|
STATIC_CONNECTIONS: 1
|
|
DB_STATE: Normal
|
|
DETAIL_INFORMATION: user/password invalid
|
|
Senders info:
|
|
No information
|
|
Receiver info:
|
|
No information</pre>
|
|
<p id="alm_12002__en-us_topic_0191813914_p62214538115518">Command output of the standby management node:</p>
|
|
<pre class="screen" id="alm_12002__en-us_topic_0191813914_screen18614098132137">Ha state:
|
|
LOCAL_ROLE: Standby
|
|
STATIC_CONNECTIONS: 1
|
|
DB_STATE : Normal
|
|
DETAIL_INFORMATION: user/password invalid
|
|
Senders info:
|
|
No information
|
|
Receiver info:
|
|
No information</pre>
|
|
<ul id="alm_12002__en-us_topic_0191813914_ul26122474141242"><li id="alm_12002__en-us_topic_0191813914_li45189483141242">If yes, go to <a href="#alm_12002__en-us_topic_0191813914_li55696398142240">2.c</a>.</li><li id="alm_12002__en-us_topic_0191813914_li9528054141242">If no, go to <a href="#alm_12002__en-us_topic_0191813914_li40232703142216">2.b</a>.</li></ul>
|
|
</li></ol><ol type="a" start="2" id="alm_12002__en-us_topic_0191813914_ol34519563142238"><li id="alm_12002__en-us_topic_0191813914_li40232703142216"><a name="alm_12002__en-us_topic_0191813914_li40232703142216"></a><a name="en-us_topic_0191813914_li40232703142216"></a>Contact the O&M personnel to check whether a network fault occurs and rectify the fault.<ul id="alm_12002__en-us_topic_0191813914_ul17427239141244"><li id="alm_12002__en-us_topic_0191813914_li46203793141244">If yes, go to <a href="#alm_12002__en-us_topic_0191813914_li55696398142240">2.c</a>.</li><li id="alm_12002__en-us_topic_0191813914_li42315102141244">If no, go to <a href="#alm_12002__en-us_topic_0191813935_li2924012813025">3</a>.</li></ul>
|
|
</li></ol><ol type="a" start="3" id="alm_12002__en-us_topic_0191813914_ol3105449142243"><li id="alm_12002__en-us_topic_0191813914_li55696398142240"><a name="alm_12002__en-us_topic_0191813914_li55696398142240"></a><a name="en-us_topic_0191813914_li55696398142240"></a>Wait 5 minutes and check whether the alarm is cleared.<ul id="alm_12002__en-us_topic_0191813914_ul57005670141248"><li id="alm_12002__en-us_topic_0191813914_li34086749141248">If yes, no further action is required.</li><li id="alm_12002__en-us_topic_0191813914_li10397534141248">If no, go to <a href="#alm_12002__en-us_topic_0191813935_li2924012813025">3</a>.</li></ul>
|
|
</li></ol>
|
|
</p></li><li id="alm_12002__en-us_topic_0191813935_li2924012813025"><a name="alm_12002__en-us_topic_0191813935_li2924012813025"></a><a name="en-us_topic_0191813935_li2924012813025"></a><span>Collect fault information.</span><p><ol type="a" id="alm_12002__ol785716327403"><li id="alm_12002__li19857532184016">On MRS Manager, choose <span class="menucascade" id="alm_12002__menucascade98247193445"><b><span class="uicontrol" id="alm_12002__uicontrol1982361984412">System</span></b> > <b><span class="uicontrol" id="alm_12002__uicontrol10824111944419">Export Log</span></b></span>.</li><li id="alm_12002__li18574327401">Contact technical support engineers for help. For details, see <a href="https://docs.otc.t-systems.com/en-us/public/learnmore.html" target="_blank" rel="noopener noreferrer">technical support</a>.</li></ol>
|
|
</p></li></ol>
|
|
</div>
|
|
<div class="section" id="alm_12002__en-us_topic_0191813914_section27750880144445"><h4 class="sectiontitle">Reference</h4><p id="alm_12002__en-us_topic_0191813914_p6763044144452">None</p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_0241.html">Alarm Reference (Applicable to Versions Earlier Than MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|