Files
doc-exports/docs/mrs/umn/alm_43006.html
Yang, Tong 3b1f73dece MRS UMN 2.0.38.SP20 version
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2022-12-13 12:03:34 +00:00

71 lines
9.9 KiB
HTML

<a name="alm_43006"></a><a name="alm_43006"></a>
<h1 class="topictitle1">ALM-43006 Heap Memory Usage of the JobHistory Process Exceeds the Threshold</h1>
<div id="body8662426"><div class="section" id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_section43920869"><h4 class="sectiontitle">Description</h4><p id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039367_p24102752">The system checks the JobHistory process status every 30 seconds. The alarm is generated when the heap memory usage of the JobHistory process exceeds the threshold (90% of the maximum memory).</p>
</div>
<div class="section" id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_section59743502"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_table64843092" frame="border" border="1" rules="all"><thead align="left"><tr id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_row10409628"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_p37873528">Alarm ID</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_p47856888">Alarm Severity</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_p51202692">Automatically Cleared</p>
</th>
</tr>
</thead>
<tbody><tr id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_row53777413"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_p61003235">43006</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_p42315013">Major</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_p4964052">Yes</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_section820607"><h4 class="sectiontitle">Parameters</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_table66543927" frame="border" border="1" rules="all"><thead align="left"><tr id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_row61284534"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_p65100236">Parameter</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_p38627770">Description</p>
</th>
</tr>
</thead>
<tbody><tr id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_row41841705"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_p33734977">ServiceName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_p48178601">Specifies the service for which the alarm is generated.</p>
</td>
</tr>
<tr id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_row30954226"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_p24264406">RoleName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_p19259870">Specifies the role for which the alarm is generated.</p>
</td>
</tr>
<tr id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_row39121107"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_p14693133">HostName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_p49293152">Specifies the host for which the alarm is generated.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_section7385465"><h4 class="sectiontitle">Impact on the System</h4><p id="alm_43006__en-us_topic_0191813968_p64414474151214">If the available JobHistory process heap memory is insufficient, a memory overflow occurs and the service breaks down.</p>
</div>
<div class="section" id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_section66469189"><h4 class="sectiontitle">Possible Causes</h4><p id="alm_43006__en-us_topic_0191813968_p18959449181526">The heap memory of the JobHistory process is overused or the heap memory is inappropriately allocated.</p>
</div>
<div class="section" id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_section61351797"><h4 class="sectiontitle">Procedure</h4><ol id="alm_43006__en-us_topic_0191813968_ol20218781175629"><li id="alm_43006__en-us_topic_0191813968_li47751305175629"><span>Check the heap memory usage.</span><p><ol type="a" id="alm_43006__en-us_topic_0191813968_ol17980506181612"><li id="alm_43006__en-us_topic_0191813968_li1487713813414">Go to the cluster details page and choose <strong id="alm_43006__b103432358254555">Alarms</strong>.</li><li id="alm_43006__en-us_topic_0191813968_li58727490181612">Select the alarm whose <strong id="alm_43006__b23459885954555">Alarm ID</strong> is <strong id="alm_43006__b82347847854555">43006</strong> and view the IP address and role name of the instance in <strong id="alm_43006__b12814901254555">Location</strong>.</li><li id="alm_43006__en-us_topic_0191813968_li37461388181615">Choose <strong id="alm_43006__b31661941654555">Components</strong> &gt; <strong id="alm_43006__b95571814554555">Spark</strong> &gt; <strong id="alm_43006__b158383996654555">Instance</strong> &gt; <strong id="alm_43006__b164666889154555">JobHistory</strong> (IP address of the instance for which the alarm is generated) &gt; <strong id="alm_43006__b16402060554555">Customize</strong> &gt; <strong id="alm_43006__b16580335054555">Heap Memory Statistics of the JobHistory Process</strong>. Click <strong id="alm_43006__b173219205054555">OK</strong> to view the heap memory usage.</li><li id="alm_43006__en-us_topic_0191813968_li5803814181617">Check whether the used heap memory of JobHistory reaches 90% of the maximum heap memory specified for JobHistory.<ul id="alm_43006__en-us_topic_0191813968_ul2889331181625"><li id="alm_43006__en-us_topic_0191813968_li5345176181624">If yes, go to <a href="#alm_43006__en-us_topic_0191813968_li1011493181634">1.e</a>.</li><li id="alm_43006__en-us_topic_0191813968_li14368473181624">If no, go to <a href="#alm_43006__en-us_topic_0191813968_li572522141314">2</a>.</li></ul>
</li><li id="alm_43006__en-us_topic_0191813968_li1011493181634"><a name="alm_43006__en-us_topic_0191813968_li1011493181634"></a><a name="en-us_topic_0191813968_li1011493181634"></a>Choose <strong id="alm_43006__b176451504318">Components</strong> &gt; <strong id="alm_43006__b664570123115">Spark</strong> &gt; <strong id="alm_43006__b1264519003112">Service Configuration</strong>. Set <strong id="alm_43006__b264510193115">Type</strong> to <strong id="alm_43006__b7645110143116">All</strong> and choose <strong id="alm_43006__b4646701312">JobHistory</strong> &gt; <strong id="alm_43006__b564600103110">Default</strong>. Increase the value of <span class="parmname" id="alm_43006__en-us_topic_0191813968_parmname48625566112446"><b>SPARK_DAEMON_MEMORY</b></span> as required.</li><li id="alm_43006__li671751111018">Click <strong id="alm_43006__b53852882215">Save Configuration</strong> and select <strong id="alm_43006__b1139228122216">Restart the affected services or instances</strong>. Click <strong id="alm_43006__b3864286529409">OK</strong>.</li><li id="alm_43006__en-us_topic_0191813968_li11969688181637">Check whether the alarm is cleared.<ul id="alm_43006__en-us_topic_0191813968_ul51315766181641"><li id="alm_43006__en-us_topic_0191813968_li54070239181641">If yes, no further action is required.</li><li id="alm_43006__en-us_topic_0191813968_li17383192181641">If no, go to <a href="#alm_43006__en-us_topic_0191813968_li572522141314">2</a>.</li></ul>
</li></ol>
</p></li><li id="alm_43006__en-us_topic_0191813968_li572522141314"><a name="alm_43006__en-us_topic_0191813968_li572522141314"></a><a name="en-us_topic_0191813968_li572522141314"></a><span>Collect fault information.</span><p><ol type="a" id="alm_43006__en-us_topic_0191813968_en-us_topic_0191813935_ol6089206913036"><li id="alm_43006__en-us_topic_0191813968_en-us_topic_0191813935_li4478836213036">On MRS Manager, choose <span class="menucascade" id="alm_43006__menucascade208385165954555"><b><span class="uicontrol" id="alm_43006__uicontrol104862443154555">System</span></b> &gt; <b><span class="uicontrol" id="alm_43006__uicontrol75010173554555">Export Log</span></b></span>.</li><li id="alm_43006__li18574327401">Contact technical support engineers for help. For details, see <a href="https://docs.otc.t-systems.com/en-us/public/learnmore.html" target="_blank" rel="noopener noreferrer">technical support</a>.</li></ol>
</p></li></ol>
</div>
<div class="section" id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_section15295265"><h4 class="sectiontitle">Reference</h4><p id="alm_43006__en-us_topic_0191813968_en-us_topic_0087039425_p7510612">None</p>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_0241.html">Alarm Reference (Applicable to Versions Earlier Than MRS 3.x)</a></div>
</div>
</div>