doc-exports/docs/mrs/umn/ALM-12062.html
Yang, Tong 3b1f73dece MRS UMN 2.0.38.SP20 version
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2022-12-13 12:03:34 +00:00

98 lines
12 KiB
HTML

<a name="ALM-12062"></a><a name="ALM-12062"></a>
<h1 class="topictitle1">ALM-12062 OMS Parameter Configurations Mismatch with the Cluster Scale</h1>
<div id="body1546915438900"><div class="section" id="ALM-12062__section2747821101717"><h4 class="sectiontitle">Description</h4><p id="ALM-12062__p53271255205214">The system checks whether the OMS parameter configurations match with the cluster scale at each top hour. If the OMS parameter configurations do not meet the cluster scale requirements, the system generates this alarm. This alarm is automatically cleared when the OMS parameter configurations are modified.</p>
</div>
<div class="section" id="ALM-12062__section127478213171"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-12062__table7749721191719" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-12062__row6867152161714"><th align="left" class="cellrowborder" valign="top" width="34.37343734373437%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-12062__p03908133538">Alarm ID</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="34.31343134313431%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-12062__p239001375320">Alarm Severity</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="31.313131313131308%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-12062__p1939041395319">Auto Clear</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-12062__row586722117171"><td class="cellrowborder" valign="top" width="34.37343734373437%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-12062__p33906131535">12062</p>
</td>
<td class="cellrowborder" valign="top" width="34.31343134313431%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-12062__p73902013155315">Major</p>
</td>
<td class="cellrowborder" valign="top" width="31.313131313131308%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-12062__p1539021315312">Yes</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-12062__section14755172115173"><h4 class="sectiontitle">Parameters</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-12062__table17756521131714" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-12062__row18671421131712"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-12062__p786772121719">Parameter</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-12062__p286742191711">Description</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-12062__row15959022415"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-12062__p192431315431">Source</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-12062__p692551319435">Specifies the cluster or system for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-12062__row786710211177"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-12062__p58673218178">ServiceName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-12062__p4868821191713">Specifies the name of the service for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-12062__row286819215174"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-12062__p186818216176">RoleName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-12062__p7868721131716">Specifies the role for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-12062__row14868221161713"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-12062__p1986842116171">HostName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-12062__p10868132118175">Specifies the host for which the alarm is generated.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-12062__section1776462111715"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-12062__p13799111525419">The OMS configuration is not modified when the cluster is installed or the system capacity is expanded.</p>
</div>
<div class="section" id="ALM-12062__section6765152119174"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12062__p12567152916541">The OMS parameter configurations mismatch with the cluster scale.</p>
</div>
<div class="section" id="ALM-12062__section87667210173"><h4 class="sectiontitle">Procedure</h4><p id="ALM-12062__p456793710548"><strong id="ALM-12062__b356720377542">Check whether the OMS parameter configurations match with the cluster scale.</strong></p>
<ol id="ALM-12062__ol87012317557"><li id="ALM-12062__li489962395514"><span>In the alarm list on FusionInsight Manager, locate the row that contains the alarm, and view the IP address of the host for which the alarm is generated.</span></li><li id="ALM-12062__li152261503555"><span>Log in to the host where the alarm is generated as user <strong id="ALM-12062__b022675065516">root</strong>. <span id="ALM-12062__text985593916354"></span></span></li><li id="ALM-12062__li95861858185515"><span>Run the <strong id="ALM-12062__b19586105865511">su - omm</strong> command to switch to user <strong id="ALM-12062__b6602115865515">omm</strong>.</span></li><li id="ALM-12062__li960214583555"><span>Run the <strong id="ALM-12062__b660235865514">vi $BIGDATA_LOG_HOME/controller/scriptlog/modify_manager_param.log</strong> command to open the log file and search for the log file containing the following information: Current oms configurations cannot support <em id="ALM-12062__i260210581552">xx</em> nodes. In the information, <em id="ALM-12062__i1760210587558">xx</em> indicates the number of nodes in the cluster.</span></li><li id="ALM-12062__li1895714113811"><span>Optimize the current cluster configuration by following the instructions in <a href="#ALM-12062__section117861721171717">Optimizing Manager Configurations Based on the Number of Cluster Nodes</a>.</span></li><li id="ALM-12062__li199275175618"><span>One hour later, check whether the alarm is cleared.</span><p><ul id="ALM-12062__ul65231712185619"><li id="ALM-12062__li4861118105614">If it is, no further action is required.</li><li id="ALM-12062__li152720248562">If it is not, go to <a href="#ALM-12062__li8140111212587">7</a>.</li></ul>
</p></li></ol>
<p id="ALM-12062__p13421113195811"><strong id="ALM-12062__b204218131586">Collect fault information.</strong></p>
<ol start="7" id="ALM-12062__ol1514001219584"><li id="ALM-12062__li8140111212587"><a name="ALM-12062__li8140111212587"></a><a name="li8140111212587"></a><span>On FusionInsight Manager, choose <strong id="ALM-12062__b12140112175816">O&amp;M</strong> &gt; <strong id="ALM-12062__b114011127584">Log</strong> &gt; <strong id="ALM-12062__b141404121585">Download</strong>.</span></li><li id="ALM-12062__li9140101216585"><span>Select <strong id="ALM-12062__b15140101214581">Controller</strong> from the <strong id="ALM-12062__b214071255817">Service</strong> and click <strong id="ALM-12062__b3991118545">OK</strong>.</span></li><li id="ALM-12062__li121401712195814"><span>Click <span><img id="ALM-12062__image1914021213589" src="en-us_image_0269383907.png"></span> in the upper right corner, and set <strong id="ALM-12062__b15140101215811">Start Date</strong> and <strong id="ALM-12062__b121408123588">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12062__b1214091210583">Download</strong>.</span></li><li id="ALM-12062__li495644512588"><span>Contact the <span id="ALM-12062__text4614151421417">O&amp;M personnel</span> and send the collected log information.</span></li></ol>
</div>
<div class="section" id="ALM-12062__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12062__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div>
<div class="section" id="ALM-12062__section117861721171717"><a name="ALM-12062__section117861721171717"></a><a name="section117861721171717"></a><h4 class="sectiontitle">Related Information</h4><p id="ALM-12062__p6786101413374"><strong id="ALM-12062__b1539194154415">Optimizing Manager Configurations Based on the Number of Cluster Nodes</strong></p>
<ol id="ALM-12062__ol05141233717"><li id="ALM-12062__en-us_topic_0165590374_li26979450111823"><span>Log in to the active Manager node as user <strong id="ALM-12062__en-us_topic_0165590374_b51895181112128">omm</strong>.</span></li><li id="ALM-12062__en-us_topic_0165590374_li37226470112023"><span>Run the following command to switch the directory:</span><p><p id="ALM-12062__en-us_topic_0165590374_p57368764112211"><strong id="ALM-12062__en-us_topic_0165590374_b25553026112214">cd ${BIGDATA_HOME}/om-server/om/sbin</strong></p>
</p></li><li id="ALM-12062__en-us_topic_0165590374_li42402569112040"><span>Run the following command to view the current Manager configurations.</span><p><p id="ALM-12062__en-us_topic_0165590374_p49977307112647"><strong id="ALM-12062__en-us_topic_0165590374_b52915491112650">sh oms_config_info.sh -q</strong></p>
</p></li><li id="ALM-12062__en-us_topic_0165590374_li49167719112555"><span>Run the following command to specify the number of nodes in the current cluster.</span><p><p id="ALM-12062__en-us_topic_0165590374_p64566987112853">Command format: <strong id="ALM-12062__en-us_topic_0165590374_b7323796112918">sh oms_config_info.sh -s </strong><em id="ALM-12062__en-us_topic_0165590374_i45810750112920">number of nodes</em></p>
<p id="ALM-12062__en-us_topic_0165590374_p13336332113026">Example:</p>
<p id="ALM-12062__en-us_topic_0165590374_p28514502112923"><strong id="ALM-12062__en-us_topic_0165590374_b34554882153757">sh oms_config_info.sh -s 10</strong><strong id="ALM-12062__en-us_topic_0165590374_b42558486153757">00</strong></p>
<p id="ALM-12062__en-us_topic_0165590374_p56151975113352">Enter <span class="parmname" id="ALM-12062__en-us_topic_0165590374_parmname54856661113358"><b>y</b></span> as prompted.</p>
<pre class="screen" id="ALM-12062__en-us_topic_0165590374_screen1838109215412">The following configurations will be modified:
Module Parameter Current Target
Controller controller.Xmx 4096m =&gt; 16384m
Controller controller.Xms 1024m =&gt; 8192m
Controller controller.node.heartbeat.error.threshold 30000 =&gt; 60000
Pms pms.mem 8192m =&gt; 10240m
Do you really want to do this operation? (y/n):</pre>
<p id="ALM-12062__en-us_topic_0165590374_p33978317113511">The configurations are updated successfully if the following information is displayed:</p>
<pre class="screen" id="ALM-12062__en-us_topic_0165590374_screen66711405113653">...
Operation has been completed. Now restarting OMS server. [done]
Restarted oms server successfully.</pre>
<div class="note" id="ALM-12062__en-us_topic_0165590374_note26248943114621"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><ul id="ALM-12062__en-us_topic_0165590374_ul56308272114644"><li id="ALM-12062__en-us_topic_0165590374_li30858860114644">OMS is automatically restarted during the configuration update process.</li><li id="ALM-12062__en-us_topic_0165590374_li28603951114646">Clusters with similar quantities of nodes have same Manager configurations. For example, when the number of nodes is changed from 100 to 101, no configuration item needs to be updated.</li></ul>
</div></div>
</p></li></ol>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
</div>
</div>