forked from docs/doc-exports
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
86 lines
13 KiB
HTML
86 lines
13 KiB
HTML
<a name="ALM-43020"></a><a name="ALM-43020"></a>
|
|
|
|
<h1 class="topictitle1">ALM-43020 Non-Heap Memory Usage of the IndexServer2x Process Exceeds the Threshold</h1>
|
|
<div id="body1573719169454"><div class="section" id="ALM-43020__s654199794cb646f5baa4518aefce49a3"><h4 class="sectiontitle">Description</h4><p id="ALM-43020__ab2a25e177015443ca3ded37167e5fc4d">The system checks the IndexServer2x process status every 30 seconds. The alarm is generated when the non-heap memory usage of the IndexServer2x process exceeds the threshold (95% of the maximum memory).</p>
|
|
</div>
|
|
<div class="section" id="ALM-43020__sa26ae86d3dad41409f83a1377a9ffcfa"><h4 class="sectiontitle">Attribute</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-43020__tcf229e81dd344017b6e4cffa8812ea38" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-43020__r71b321b2dfa44544b9d38c31a7c564c0"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-43020__a8c676edf82c34fe9ac00e771db46396a">Alarm ID</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-43020__a49e228ba3e9744a9ba62df793cb9f48a">Severity</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-43020__acf332948bf634701b2eb985488faaf8b">Auto Clear</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-43020__r97968a0e761c4d90b952c9bfc25f44f9"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-43020__a441c0b910dff45a3822b47f0c38788b2">43020</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-43020__a3ea3972a589749afb36b6cb0998f8acf">Major</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-43020__a14026af7cc9948e494ecb783327d2acd">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-43020__s8847e557c6b3453aaee9f6581c60c7f0"><h4 class="sectiontitle">Parameters</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-43020__ta4b460384c754c91b30862b9ff824f4f" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-43020__r99aa4ff1011848d48389427aebb04c06"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-43020__aecb6aec3722f463da013cd6a9681d943">Parameter</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-43020__ab6eca64948c8482f8161c84f93f75401">Description</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-43020__row88669469128"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43020__p17935380415">Source</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43020__p187931338134115">Specifies the cluster for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-43020__r6d7be8e0e35a4cf08993625fc62e4301"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43020__p41293795">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43020__afadd76ad17914fd18b2494f51b17997f">Specifies the service for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-43020__r5e5fd6c56f564e7ea629ac99dc22bcce"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43020__p23892775">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43020__ad6d4ca57de514e04b1eb96e905a2938f">Specifies the role for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-43020__r9a551a22222e4651b10174987c965499"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43020__p14847206">HostName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43020__aa6ded27644dd42aba2f76e0ecd52a010">Specifies the host for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-43020__rb39a34fe9a5d406d8ffdb11e868ddecd"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-43020__a22165862128b459a910b476586ac7149">Trigger Condition</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-43020__a082c50ad862e4407b31c3d7e28fc781a">Specifies the threshold for triggering the alarm.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-43020__s7e65a524bbfd4a28ae8d0ad568b2d9bd"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-43020__af87172183dc64942a222a190ec490a2f">If the available IndexServer2x process non-heap memory is insufficient, a memory overflow occurs and the service breaks down.</p>
|
|
</div>
|
|
<div class="section" id="ALM-43020__s01cd2ae89bd34527b0a20a4ae96da722"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-43020__a5f14f3167e2848079676c82ff8ea6912">The non-heap memory of the IndexServer2x process is overused or the non-heap memory is inappropriately allocated.</p>
|
|
</div>
|
|
<div class="section" id="ALM-43020__section39311354121312"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-43020__a10260c3a3ddf4e069f6222fd6e97959f"><strong id="ALM-43020__b1676733103119">Check non-heap memory usage.</strong></p>
|
|
<ol id="ALM-43020__ol6115520193113"><li id="ALM-43020__li10114182010315"><span>On FusionInsight Manager, choose <strong id="ALM-43020__b838111632017">O&M</strong> > <strong id="ALM-43020__b1688165991316">Alarm</strong><strong id="ALM-43020__b27872374104950"> > Alarms</strong>. In the displayed alarm list, choose the alarm for which the ID is <strong id="ALM-43020__b1439121614204">43020</strong>, and check the <strong id="ALM-43020__b1955573445015">RoleName</strong> in <strong id="ALM-43020__b052583712505">Location</strong> and confirm the IP address of <strong id="ALM-43020__b1241513413507">HostName</strong>.</span></li><li id="ALM-43020__li5114122073117"><span>On FusionInsight Manager, choose <strong id="ALM-43020__b369453112017">Cluster</strong> > <span id="ALM-43020__text96951316208"><em id="ALM-43020__i369433122019">Name of the desired cluster</em></span> > <strong id="ALM-43020__b369593118208">Services</strong> > <strong id="ALM-43020__b669673111208">Spark2x</strong> > <strong id="ALM-43020__b669603182018">Instance</strong>. Click the IndexServer2x that reported the alarm to go to the <strong id="ALM-43020__b659464922315">Dashboard</strong> page. Click the drop-down list in the upper right corner of the chart area, and choose <strong id="ALM-43020__b11697123115209">Customize</strong> > <strong id="ALM-43020__b16698143142020">Memory > IndexServer2x Memory Usage Statistics</strong> > <strong id="ALM-43020__b5698131152011">OK</strong>. Check whether the non-heap memory used by the IndexServer2x process reaches the maximum non-heap memory threshold (95% by default).</span><p><ul class="subitemlist" id="ALM-43020__ul51141720103115"><li id="ALM-43020__li1411417208314">If the threshold is reached, go to <a href="#ALM-43020__li311482053120">3</a>.</li><li id="ALM-43020__li411419202318">If the threshold is not reached, go to <a href="#ALM-43020__li141131720123116">7</a>.</li></ul>
|
|
</p></li><li id="ALM-43020__li311482053120"><a name="ALM-43020__li311482053120"></a><a name="li311482053120"></a><span>On FusionInsight Manager, choose <strong id="ALM-43020__b223865710239">Cluster</strong> > <span id="ALM-43020__text181141520113113"><em id="ALM-43020__i81141020103112">Name of the desired cluster</em></span> > <strong id="ALM-43020__b6240115712239">Services</strong> > <strong id="ALM-43020__b024117575230">Spark2x</strong> > <strong id="ALM-43020__b1224155792312">Instance</strong>. Click the IndexServer2x that reported the alarm to go to the <strong id="ALM-43020__b28381556192411">Dashboard</strong> page. Click the drop-down list in the upper right corner of the chart area, and choose <strong id="ALM-43020__b324212579235">Customize</strong> ><strong id="ALM-43020__b1698729151719"> Memory</strong> > <strong id="ALM-43020__b10243105713236">Statistics for the non-heap memory of the IndexServer2x Process</strong> > <strong id="ALM-43020__b3243185722316">OK</strong>. Based on the alarm generation time, check the values of the used non-heap memory of the IndexServer2x process in the corresponding period and obtain the maximum value.</span></li><li id="ALM-43020__li1115192011314"><span>On FusionInsight Manager, choose <strong id="ALM-43020__b1701413122619">Cluster</strong> > <span id="ALM-43020__text127031315269"><em id="ALM-43020__i97031314262">Name of the desired cluster</em></span> > <strong id="ALM-43020__b1670713132610">Services</strong> > <strong id="ALM-43020__b117041332614">Spark2x</strong> > <strong id="ALM-43020__b1707135263">Configurations</strong> > <strong id="ALM-43020__b1470161332610">All Configuration</strong><strong id="ALM-43020__b1964616347434">s</strong> > <strong id="ALM-43020__b1771111313268">IndexServer2x</strong>> <strong id="ALM-43020__b171111311266">Tuning</strong>. You can change the value of <strong id="ALM-43020__b6699154811277">XX:MaxMetaspaceSize</strong> in the <strong id="ALM-43020__b162571302282">spark.driver.extraJavaOptions</strong> parameter based on the ratio of the maximum non-heap memory used by the IndexServer2x process to the threshold specified by <strong id="ALM-43020__b772161322612">IndexServer2x Non-Heap Memory Usage Statistics (IndexServer2x)</strong> in the alarm period.</span><p><div class="note" id="ALM-43020__note18115182020315"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-43020__p011532019315">On FusionInsight Manager, you can choose <strong id="ALM-43020__b6817124115282">O&M</strong> > <strong id="ALM-43020__b19818104132813">Alarm</strong> > <strong id="ALM-43020__b2819541192813">Thresholds</strong> > <em id="ALM-43020__i8820341112810">Name of the desired cluster</em> > <strong id="ALM-43020__b282084182813">Spark2x</strong> > <strong id="ALM-43020__b1982134116286">Memory</strong> > <strong id="ALM-43020__b1782274110282">IndexServer2x Non-Heap Memory Usage Statistics (IndexServer2x)</strong> to view the threshold.</p>
|
|
</div></div>
|
|
</p></li><li id="ALM-43020__li656515289295"><span>Restart all IndexServer2x instances.</span></li><li id="ALM-43020__li211582033115"><span>After 10 minutes, check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-43020__ul12115320123119"><li id="ALM-43020__li61151220153112">If the alarm is cleared, no further action is required.</li><li id="ALM-43020__li911512012316">If the alarm is not cleared, go to <a href="#ALM-43020__li141131720123116">7</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p id="ALM-43020__a5f67c845ae194d15b4e5da41d8afcf80"><strong id="ALM-43020__b665112153315">Collect fault information</strong>.</p>
|
|
<ol start="7" id="ALM-43020__ol5113320203117"><li id="ALM-43020__li141131720123116"><a name="ALM-43020__li141131720123116"></a><a name="li141131720123116"></a><span>On FusionInsight Manager, choose <strong id="ALM-43020__b1854624844415">O&M</strong> > <strong id="ALM-43020__b1547144819446">Log</strong> > <strong id="ALM-43020__b115474485449">Download</strong>.</span></li><li id="ALM-43020__li17113122012310"><span>Expand the <strong id="ALM-43020__b1899192712389">Service</strong> drop-down list, and select <strong id="ALM-43020__b71306317335">Spark2x</strong> for the target cluster.</span></li><li id="ALM-43020__li13113620203117"><span>Click <span><img id="ALM-43020__image11113620133114" src="en-us_image_0269417546.png"></span> in the upper right corner, and set <strong id="ALM-43020__b18831547123618">Start Date</strong> and <strong id="ALM-43020__b1483434743616">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-43020__b583564713611">Download</strong>.</span></li><li id="ALM-43020__li211320200310"><span>Contact the <span id="ALM-43020__text4614151421417">O&M personnel</span> and provide the collected logs.</span></li></ol>
|
|
</div>
|
|
<div class="section" id="ALM-43020__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-43020__p754913417333">After the fault is rectified, the system automatically clears this alarm.</p>
|
|
</div>
|
|
<div class="section" id="ALM-43020__s9121af30e9174ff4a8ea197579ce835d"><h4 class="sectiontitle">Reference</h4><p id="ALM-43020__ab49c3640168848f38fc60ef20476b006">None</p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|