forked from docs/doc-exports
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
88 lines
11 KiB
HTML
88 lines
11 KiB
HTML
<a name="ALM-17005"></a><a name="ALM-17005"></a>
|
|
|
|
<h1 class="topictitle1">ALM-17005 Oozie Non Heap Memory Usage Exceeds the Threshold</h1>
|
|
<div id="body4303804"><div class="section" id="ALM-17005__section41852036"><h4 class="sectiontitle">Description</h4><p id="ALM-17005__p13063849">The system checks the non heap memory usage of Oozie every 30 seconds. This alarm is reported if the non heap memory usage of Oozie exceeds the threshold (80%). This alarm is cleared if the non heap memory usage is lower than the threshold.</p>
|
|
</div>
|
|
<div class="section" id="ALM-17005__section41124012"><h4 class="sectiontitle">Attribute</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-17005__table51538867" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-17005__row55464715"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-17005__p63456919">Alarm ID</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-17005__p39736775">Alarm Severity</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-17005__p64562179">Auto Clear</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-17005__row62154036"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-17005__p1312194">17005</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-17005__p39178900">Major</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-17005__p19374355">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-17005__section34571792"><h4 class="sectiontitle">Parameters</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-17005__table25818918" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-17005__row2151805"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-17005__p40078531">Name</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-17005__p25135585">Meaning</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-17005__row1143823817213"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-17005__p156438591896">Source</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-17005__p187931338134115">Specifies the cluster for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-17005__row22716544"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-17005__p65062640">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-17005__p61572960">Specifies the service for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-17005__row17285729"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-17005__p35626567">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-17005__p64802221">Specifies the role for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-17005__row46349082"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-17005__p51620924">HostName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-17005__p26070279">Specifies the host for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-17005__row33305921"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-17005__p13425051">Trigger Condition</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-17005__p13687329">Specifies the threshold for triggering the alarm.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-17005__section42710672"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-17005__p34931901">Non-heap memory overflow may cause service breakdown.</p>
|
|
</div>
|
|
<div class="section" id="ALM-17005__section48851735"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-17005__p10911700">The non-heap memory of the Oozie instance is overused or the non-heap memory is inappropriately allocated.</p>
|
|
</div>
|
|
<div class="section" id="ALM-17005__section37012437"><h4 class="sectiontitle">Procedure</h4><p id="ALM-17005__p11432528"><strong id="ALM-17005__b26039876191326">Check non-heap memory usage.</strong></p>
|
|
<ol id="ALM-17005__ol35783896"><li id="ALM-17005__li53619609"><span>On FusionInsight Manager, choose <strong id="ALM-17005__b3978900926917">O&M</strong> > <strong id="ALM-17005__b19578228986917">Alarm</strong> > <strong id="ALM-17005__b13915297806917">Alarms</strong> > <strong id="ALM-17005__b12063639536917">Oozie Non Heap Memory Usage Exceeds the Threshold</strong>. On the displayed page, check the location information of the alarm. Check the name of the instance host for which the alarm is generated.</span></li><li id="ALM-17005__li12814438"><span>On FusionInsight Manager, choose <strong id="ALM-17005__b11994192361810">Cluster</strong> > <em id="ALM-17005__i1042501363319">Name of the target cluster</em> > <strong id="ALM-17005__b1371618311184">Services</strong> > <strong id="ALM-17005__b29486376186">Oozie</strong> and click the <strong id="ALM-17005__b3687559171815">Instance</strong> tab. On the displayed page, select the role corresponding to the host name for which the alarm is generated and select <strong id="ALM-17005__b18721759173111">Customize</strong> from the drop-down list in the upper right corner of the chart area. Choose <strong id="ALM-17005__b1221965222019">Memory</strong> and select <strong id="ALM-17005__b76583161215">Oozie Non Heap Memory Resource Percentage</strong>. Click <strong id="ALM-17005__b1977217015229">OK</strong>.</span></li><li id="ALM-17005__li48221078"><span>Check whether the non-heap memory used by Oozie reaches the threshold (80% of the maximum non-heap memory by default).</span><p><ul id="ALM-17005__ul31336524"><li id="ALM-17005__li13593264">If yes, go to <a href="#ALM-17005__l3672051debd1416aa3b54541a7a480cb">4</a>.</li><li id="ALM-17005__li27312590">If no, go to <a href="#ALM-17005__d0e31729">6</a>.</li></ul>
|
|
</p></li><li id="ALM-17005__l3672051debd1416aa3b54541a7a480cb"><a name="ALM-17005__l3672051debd1416aa3b54541a7a480cb"></a><a name="l3672051debd1416aa3b54541a7a480cb"></a><span>On FusionInsight Manager, choose <strong id="ALM-17005__b575817682417">Cluster</strong> > <em id="ALM-17005__i089412109241">Name of the target cluster</em> > <strong id="ALM-17005__b13771418152414">Services</strong> > <strong id="ALM-17005__b206169248248">Oozie</strong> and click the <strong id="ALM-17005__b7642113814247">Configurations</strong> and then <strong id="ALM-17005__b0707849172410">All Configurations</strong>. On the displayed page, search for the <strong id="ALM-17005__b22312328254">GC_OPTS</strong> parameter in the search box and check whether it contains <strong id="ALM-17005__b7592750172516">-XX: MaxMetaspaceSize</strong>. If yes, increase the value of <strong id="ALM-17005__b659132517271">-XX: MaxMetaspaceSize</strong> based on the site requirements. If no, manually add <strong id="ALM-17005__b386023616271">-XX: MaxMetaspaceSize</strong> and set its value to 1/8 of the value of <strong id="ALM-17005__b46443815299">-Xmx</strong>. Click <strong id="ALM-17005__b6384786066917">Save</strong>, and then click <strong id="ALM-17005__b11483163386917">OK</strong></span><p><div class="note" id="ALM-17005__note135236213815"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-17005__p11638229889">JDK1.8 does not support the <strong id="ALM-17005__b13580174183017">MaxPermSize</strong> parameter.</p>
|
|
<p id="ALM-17005__p711773516567">Suggestions on GC parameter settings for Oozie:</p>
|
|
<p id="ALM-17005__p1083617504270">Set the value of <strong id="ALM-17005__b16895012296917">-XX:MaxMetaspaceSize</strong> to 1/8 of the value of <strong id="ALM-17005__b3943837476917">-Xmx</strong>. For example, if <strong id="ALM-17005__b1603839256917">-Xmx</strong> is set to 2 GB, <strong id="ALM-17005__b7863337416917">-XX:MaxMetaspaceSize</strong> is set to 256 MB. If <strong id="ALM-17005__b14746299016917">-Xmx</strong> is set to 4 GB, <strong id="ALM-17005__b13244742996917">-XX:MaxMetaspaceSize</strong> is set to 512 MB.</p>
|
|
</div></div>
|
|
</p></li><li id="ALM-17005__li46654754"><span>Restart the affected services or instances and check whether the alarm is cleared.</span><p><ul id="ALM-17005__ul17239608"><li id="ALM-17005__li20938747">If yes, no further action is required.</li><li id="ALM-17005__li54230996">If no, go to <a href="#ALM-17005__d0e31729">6</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p id="ALM-17005__p30634516"><strong id="ALM-17005__b21649344191335">Collect the fault information.</strong></p>
|
|
<ol start="6" id="ALM-17005__ol29627899191348"><li id="ALM-17005__d0e31729"><a name="ALM-17005__d0e31729"></a><a name="d0e31729"></a><span>On FusionInsight Manager, choose <strong id="ALM-17005__b52681523142810">O&M</strong>. In the navigation pane on the left, choose <strong id="ALM-17005__b427092362817">Log</strong> > <strong id="ALM-17005__b1827120233287">Download</strong>.</span></li><li id="ALM-17005__li52419431"><span>Expand the <strong id="ALM-17005__b1523887146917">Service</strong> drop-down list, and select <strong id="ALM-17005__b21350890886917">Oozie</strong> for the target cluster.</span></li><li id="ALM-17005__li2012833"><span>Click <span><img id="ALM-17005__image104601319175315" src="en-us_image_0263895663.png"></span> in the upper right corner, and set <strong id="ALM-17005__b11743172413214">Start Date</strong> and <strong id="ALM-17005__b974422419324">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-17005__b9744122483219">Download</strong>.</span></li><li id="ALM-17005__li18115505"><span>Contact <span id="ALM-17005__text126301214142412">O&M personnel</span> and provide the collected logs.</span></li></ol>
|
|
</div>
|
|
<div class="section" id="ALM-17005__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-17005__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
|
|
</div>
|
|
<div class="section" id="ALM-17005__section56407894"><h4 class="sectiontitle">Related Information</h4><p id="ALM-17005__p40534999">None</p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|