Files
doc-exports/docs/mrs/umn/ALM-45586.html
Yang, Tong 2195db241c MRS UMN 20231220 version update
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com>
Reviewed-by: Rechenburg, Matthias <matthias.rechenburg@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2024-05-16 09:40:21 +00:00

82 lines
10 KiB
HTML

<a name="ALM-45586"></a><a name="ALM-45586"></a>
<h1 class="topictitle1">ALM-45586 IoTDBServer Heap Memory Usage Exceeds the Threshold</h1>
<div id="body0000001154314995"><div class="section" id="ALM-45586__section13447226"><h4 class="sectiontitle">Description</h4><p id="ALM-45586__p61726833">The system checks the IoTDBServer process status every 60 seconds. The alarm is generated when the heap memory usage of the IoTDBServer process exceeds the threshold (90% of the maximum memory).</p>
</div>
<div class="section" id="ALM-45586__section53916176"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-45586__table33817547" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-45586__row8931076"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-45586__p52328576">Alarm ID</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-45586__p10756297">Alarm Severity</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-45586__p65953734">Auto Clear</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-45586__row40652256"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-45586__p16829917173515">45586</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-45586__p28828553">Major</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-45586__p53411432">Yes</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-45586__section15483537"><h4 class="sectiontitle">Parameters</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-45586__table31358724" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-45586__row33518103"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-45586__p30611809">Name</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-45586__p63637484">Meaning</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-45586__row163311621185116"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45586__p17935380415">Source</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45586__p187931338134115">Specifies the cluster for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-45586__row54362592"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45586__p41293795">ServiceName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45586__p56463136">Specifies the service for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-45586__row38406179"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45586__p23892775">RoleName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45586__p56266616">Specifies the role for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-45586__row36637496"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45586__p14847206">HostName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45586__p61773077">Specifies the host for which the alarm is generated.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-45586__section5134112"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-45586__p42871020143312">If the available IoTDBServer process heap memory is insufficient, a memory overflow occurs and the service breaks down.</p>
</div>
<div class="section" id="ALM-45586__section46207013"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-45586__p62452010">The heap memory of the IoTDBServer process is overused or the heap memory is inappropriately allocated.</p>
</div>
<div class="section" id="ALM-45586__section69009731417"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-45586__p25448063"><strong id="ALM-45586__b501332858554">Check the heap memory usage.</strong></p>
<ol id="ALM-45586__ol64950458558"><li id="ALM-45586__li823572385459"><span>Log in to <span id="ALM-45586__text34789336432">MRS</span> Manager and choose <strong id="ALM-45586__b75361028304">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-45586__b1454492812015">Alarm</strong> &gt; <strong id="ALM-45586__b1654515289015">Alarms</strong>. On the page that is displayed, locate the row containing the alarm whose <strong id="ALM-45586__b254512286016">Alarm ID</strong> is <strong id="ALM-45586__b4546328908">45586</strong>, view the role name in <strong id="ALM-45586__b1554612288016">Location</strong>, and check the instance IP address.</span></li><li id="ALM-45586__li701264785459"><span>Choose <strong id="ALM-45586__b12187125218422">Cluster</strong> &gt; <em id="ALM-45586__i1018865294216">Name of the desired cluster</em> &gt; <strong id="ALM-45586__b918875211426">Service</strong> &gt; <strong id="ALM-45586__b6189105213429">IoTDB</strong> &gt; <strong id="ALM-45586__b13189155219425">Instance</strong>. Click the IoTDBServer for which the alarm is generated to go to <strong id="ALM-45586__b3190552134210">Dashboard</strong>. Click the drop-down list in the upper right corner of the chart area and choose <strong id="ALM-45586__b15107114131317">Customize</strong> &gt; <strong id="ALM-45586__b13315129101416">Memory</strong>. In the dialog box that is displayed, select <strong id="ALM-45586__b1410724118137">IoTDBServer Heap Memory Resource Percentage</strong>, and click <strong id="ALM-45586__b1010710414139">OK</strong>. Check whether the used non-heap memory of the IoTDBServer process reaches 90% (by default) of the maximum non-heap memory specified for IoTDBServer.</span><p><ul id="ALM-45586__ul651740114116"><li id="ALM-45586__li17511540194110">If yes, go to <a href="#ALM-45586__li1141514131368">3</a>.</li><li id="ALM-45586__li677103994314">If no, go to <a href="#ALM-45586__li4749473185459">5</a>.</li></ul>
</p></li><li id="ALM-45586__li1141514131368"><a name="ALM-45586__li1141514131368"></a><a name="li1141514131368"></a><span>Choose <strong id="ALM-45586__b1265015532551">Cluster</strong> &gt; <em id="ALM-45586__i122285665512">Name of the desired cluster</em> &gt; <strong id="ALM-45586__b1738175814554">Service</strong> &gt; <strong id="ALM-45586__b119971735564">IoTDB</strong> &gt; <strong id="ALM-45586__b659261815562">Configuration</strong>, click <strong id="ALM-45586__b1022932313569">All Configurations</strong>, choose <strong id="ALM-45586__b10134203610564">IoTDBServer</strong> &gt; <strong id="ALM-45586__b0924133805618">System</strong>, and increase the value of <strong id="ALM-45586__b1039864510574">-Xmx</strong> in the <strong id="ALM-45586__b635819479576">GC_OPTS</strong> parameter.</span><p><div class="note" id="ALM-45586__note07452211661"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><ul id="ALM-45586__ul945169983"><li id="ALM-45586__li14454917813">The default value of -<strong id="ALM-45586__b6237101585815">Xmx</strong> is <strong id="ALM-45586__b15700181795810">2G</strong>.</li><li id="ALM-45586__li345191981">If this alarm is occasionally generated, increase the value by 0.5 times. If this alarm is frequently generated, double the value.</li><li id="ALM-45586__li8451090817">In the case of large service volume and high service concurrency, you are advised to add instances.</li></ul>
</div></div>
</p></li><li id="ALM-45586__li5001643585459"><span>Check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-45586__ul5612728785459"><li id="ALM-45586__li4030548885459">If yes, no further action is required.</li><li id="ALM-45586__li4351906785459">If no, go to <a href="#ALM-45586__li4749473185459">5</a>.</li></ul>
</p></li></ol>
<p class="tableheading" id="ALM-45586__p3538354385459"><strong id="ALM-45586__b6160463585522">Collect the fault information.</strong></p>
<ol start="5" id="ALM-45586__ol4790308885524"><li id="ALM-45586__li4749473185459"><a name="ALM-45586__li4749473185459"></a><a name="li4749473185459"></a><span>On <span id="ALM-45586__text1024710127233">MRS</span> Manager, choose <strong id="ALM-45586__b980452253312">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-45586__b78051422143311">Log</strong> &gt; <strong id="ALM-45586__b10807182243316">Download</strong>.</span></li><li id="ALM-45586__li2648019085459"><span>Expand the <strong id="ALM-45586__b107731555204212">Service</strong> drop-down list, select <strong id="ALM-45586__b377917558423">IoTDB</strong> for the target cluster, and click <strong id="ALM-45586__b37801655154216">OK</strong>.</span></li><li id="ALM-45586__li3699511985459"><span>Click <span><img id="ALM-45586__image104601319175315" src="en-us_image_0000001582927541.png"></span> in the upper right corner, and set <strong id="ALM-45586__b105181640194716">Start Date</strong> and <strong id="ALM-45586__b251834044718">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-45586__b951820402479">Download</strong>.</span></li><li id="ALM-45586__li4381466885459"><span>Contact <span id="ALM-45586__text727143394012">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div>
<div class="section" id="ALM-45586__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-45586__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div>
<div class="section" id="ALM-45586__section51780573"><h4 class="sectiontitle">Related Information</h4><p id="ALM-45586__p54528917">None</p>
</div>
<p id="ALM-45586__p8060118"></p>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
</div>
</div>