forked from docs/doc-exports
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
88 lines
9.9 KiB
HTML
88 lines
9.9 KiB
HTML
<a name="ALM-12027"></a><a name="ALM-12027"></a>
|
|
|
|
<h1 class="topictitle1">ALM-12027 Host PID Usage Exceeds the Threshold</h1>
|
|
<div id="body15885728"><div class="section" id="ALM-12027__s4ff73d9b6e5e4103a3820abbc876532e"><h4 class="sectiontitle">Description</h4><p id="ALM-12027__en-us_topic_0070543581_p5836323">The system checks the PID usage every 30 seconds and compares the actual PID usage with the default PID usage threshold. This alarm is generated when the system detects that the PID usage exceeds the threshold.</p>
|
|
<p id="ALM-12027__p4934657911347">When the <strong id="ALM-12027__b44134084101639">Trigger Count</strong> is 1, this alarm is cleared when the PID usage is less than or equal to the threshold. When the <strong id="ALM-12027__b1741410113352">Trigger Count</strong> is greater than 1, this alarm is cleared when the PID usage is less than or equal to 90% of the threshold.</p>
|
|
</div>
|
|
<div class="section" id="ALM-12027__s100ea51423104e978209f1955534fa27"><h4 class="sectiontitle">Attribute</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-12027__en-us_topic_0070543581_table26821252" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-12027__en-us_topic_0070543581_row57828837"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-12027__en-us_topic_0070543581_p53624206">Alarm ID</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-12027__en-us_topic_0070543581_p48593428">Alarm Severity</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-12027__en-us_topic_0070543581_p43753566">Auto Clear</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-12027__en-us_topic_0070543581_row54377940"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-12027__en-us_topic_0070543581_p42537056">12027</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-12027__en-us_topic_0070543581_p22949479">Major</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-12027__en-us_topic_0070543581_p46968531">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-12027__s0e56b478a67a4be0bc1ff52da93ed720"><h4 class="sectiontitle">Parameters</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-12027__en-us_topic_0070543581_table46354700" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-12027__en-us_topic_0070543581_row29477662"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-12027__en-us_topic_0070543581_p38880413">Name</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-12027__en-us_topic_0070543581_p62305773">Meaning</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-12027__row4837155215414"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-12027__p192431315431">Source</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-12027__p692551319435">Specifies the cluster or system for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-12027__en-us_topic_0070543581_row13602870"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-12027__en-us_topic_0070543581_p28090655">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-12027__en-us_topic_0070543581_p60750544">Specifies the service for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-12027__en-us_topic_0070543581_row9883987"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-12027__en-us_topic_0070543581_p62405508">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-12027__en-us_topic_0070543581_p21681426">Specifies the role for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-12027__en-us_topic_0070543581_row60915111"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-12027__en-us_topic_0070543581_p35176971">HostName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-12027__en-us_topic_0070543581_p30762366">Specifies the host for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-12027__en-us_topic_0070543581_row8425846"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-12027__en-us_topic_0070543581_p11404934">Trigger Condition</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-12027__en-us_topic_0070543581_p51384442">Specifies the threshold triggering the alarm. If the current indicator value exceeds this threshold, the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-12027__s0fe82127f2e84450a24be46e715835ca"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-12027__en-us_topic_0070543581_p1390267">No PID is available for new processes and service processes are unavailable.</p>
|
|
</div>
|
|
<div class="section" id="ALM-12027__s3ddd6cfc758a404a82adc3dfe898bd66"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-12027__p681753145417">Too many processes are running on the node. You need to increase the value of <strong id="ALM-12027__en-us_topic_0070543581_b61845569">pid_max</strong>.</p>
|
|
</div>
|
|
<div class="section" id="ALM-12027__s9445b6fc399a470295ea751769713fde"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-12027__en-us_topic_0070543581_p55372696"><strong id="ALM-12027__b360029529747">Increase the value of pid_max.</strong></p>
|
|
<ol id="ALM-12027__ol240915109757"><li id="ALM-12027__li639798269750"><span>In the alarm list on FusionInsight Manager, click <span><img id="ALM-12027__image168221113135319" src="en-us_image_0269383832.png"></span> in the row where the alarm is located to view the alarm host address in the alarm details.</span></li><li id="ALM-12027__li149834549750"><span>Log in to the host where the alarm is generated as user <strong id="ALM-12027__b389475309750">root</strong>. <span id="ALM-12027__text43649449460"></span></span></li><li id="ALM-12027__li513020679750"><span>Run the <strong id="ALM-12027__b6333589750">cat /proc/sys/kernel/pid_max</strong>command to check the value of <strong id="ALM-12027__b57002299750">pid_max</strong>.</span></li><li id="ALM-12027__li205272659750"><span>If the PID usage exceeds the threshold, run the command <strong id="ALM-12027__b590654259750">echo </strong><em id="ALM-12027__i618267859750">new value </em><strong id="ALM-12027__b195701549750">> /proc/sys/kernel/pid_max</strong> to enlarge the value of <strong id="ALM-12027__b419136639750">pid_max</strong>.</span><p><p class="litext" id="ALM-12027__p395635099750">Example: <strong id="ALM-12027__b416786479750">echo 65536 > /proc/sys/kernel/pid_max</strong></p>
|
|
<div class="note" id="ALM-12027__note163571615102916"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-12027__p10664145203015">The maximum value of <span class="parmname" id="ALM-12027__parmname1566455103015"><b>pid_max</b></span> is as follows:</p>
|
|
<ul id="ALM-12027__ul13990143413014"><li id="ALM-12027__li7990034173015">On 32-bit systems: 32768</li><li id="ALM-12027__li799018345307">On 64-bit systems: 4194304 (2^22)</li></ul>
|
|
</div></div>
|
|
</p></li><li id="ALM-12027__li148339459750"><span>Wait for 5 minutes, and check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-12027__ul590069549750"><li id="ALM-12027__li505276609750">If yes, no further action is required.</li><li id="ALM-12027__li662086519750">If no, go to <a href="#ALM-12027__li377225729750">6</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p class="tableheading" id="ALM-12027__p61837339750"><strong id="ALM-12027__b361001479817">Collect fault information.</strong></p>
|
|
<ol start="6" id="ALM-12027__ol116595289821"><li id="ALM-12027__li377225729750"><a name="ALM-12027__li377225729750"></a><a name="li377225729750"></a><span>On the FusionInsight Manager home page of the active cluster, choose <strong id="ALM-12027__b311203779750">O&M</strong> > <strong id="ALM-12027__b116479379750">Log > Download</strong>.</span></li><li id="ALM-12027__li3107269750"><span>Select all services from the <strong id="ALM-12027__b356295299750">Service</strong> and click <strong id="ALM-12027__b3991118545">OK</strong>.</span></li><li id="ALM-12027__li1145664103113"><span>Click <span><img id="ALM-12027__image1945644173117" src="en-us_image_0269383834.png"></span> in the upper right corner, and set <strong id="ALM-12027__b6456941173117">Start Date</strong> and <strong id="ALM-12027__b11456154113318">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-12027__b13456164113319">Download</strong>.</span></li><li id="ALM-12027__li495644512588"><span>Contact the <span id="ALM-12027__text4614151421417">O&M personnel</span> and send the collected log information.</span></li></ol>
|
|
</div>
|
|
<div class="section" id="ALM-12027__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-12027__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
|
|
</div>
|
|
<div class="section" id="ALM-12027__s99f69a6c05834c85bf47a731f55376c2"><h4 class="sectiontitle">Related Information</h4><p id="ALM-12027__en-us_topic_0070543581_p32793969">None</p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|