Files
doc-exports/docs/mrs/umn/ALM-45587.html
Yang, Tong 2195db241c MRS UMN 20231220 version update
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com>
Reviewed-by: Rechenburg, Matthias <matthias.rechenburg@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2024-05-16 09:40:21 +00:00

86 lines
11 KiB
HTML

<a name="ALM-45587"></a><a name="ALM-45587"></a>
<h1 class="topictitle1">ALM-45587 IoTDBServer GC Duration Exceeds the Threshold</h1>
<div id="body0000001154395079"><div class="section" id="ALM-45587__section13447226"><h4 class="sectiontitle">Description</h4><p id="ALM-45587__p1320416552418">The system checks the GC duration of the IoTDBServer process every 60 seconds. This alarm is generated when the GC duration exceeds the threshold (12 seconds by default) for three consecutive times. You can choose <strong id="ALM-45587__b362717574556">O&amp;M</strong> &gt; <strong id="ALM-45587__b4315959165516">Alarm</strong> &gt; <strong id="ALM-45587__b935418235614">Threshold Configuration</strong> &gt; <em id="ALM-45587__i16503618562">Name of the desired cluster</em> &gt; <strong id="ALM-45587__b1593241218566">IoTDB</strong> &gt; <strong id="ALM-45587__b20266131410568">GC</strong> &gt; <strong id="ALM-45587__b19500191412589">Total GC duration of IoTDBServer process (IoTDBServer)</strong> to change the threshold. This alarm is cleared when the GC duration is less than the threshold.</p>
</div>
<div class="section" id="ALM-45587__section53916176"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-45587__table33817547" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-45587__row8931076"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-45587__p52328576">Alarm ID</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-45587__p10756297">Alarm Severity</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-45587__p65953734">Auto Clear</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-45587__row40652256"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-45587__p179511920165120">45587</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-45587__p28828553">Major</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-45587__p53411432">Yes</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-45587__section15483537"><h4 class="sectiontitle">Parameters</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-45587__table31358724" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-45587__row33518103"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-45587__p30611809">Name</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-45587__p63637484">Meaning</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-45587__row163311621185116"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45587__p17935380415">Source</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45587__p187931338134115">Specifies the cluster for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-45587__row54362592"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45587__p41293795">ServiceName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45587__p56463136">Specifies the service for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-45587__row38406179"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45587__p23892775">RoleName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45587__p56266616">Specifies the role for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-45587__row36637496"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45587__p14847206">HostName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45587__p61773077">Specifies the host for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-45587__row183811022195213"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45587__p338211225529">Trigger Condition</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45587__p1738242213528">Specifies the threshold for triggering the alarm.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-45587__section5134112"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-45587__p42871020143312">A long GC duration of the IoTDBServer process may interrupt the services.</p>
</div>
<div class="section" id="ALM-45587__section46207013"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-45587__p62452010">The heap memory of the IoTDBServer process is overused or inappropriately allocated, causing frequent occurrence of the GC process.</p>
</div>
<div class="section" id="ALM-45587__section207482016103817"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-45587__p25448063"><strong id="ALM-45587__b4446828118">Check the GC duration.</strong></p>
<ol id="ALM-45587__ol64950458558"><li id="ALM-45587__li823572385459"><span>Log in to <span id="ALM-45587__text34789336432">MRS</span> Manager and choose <strong id="ALM-45587__b1391125194511">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-45587__b1597162574514">Alarm</strong> &gt; <strong id="ALM-45587__b1698102594518">Alarms</strong>. On the page that is displayed, locate the row containing the alarm whose <strong id="ALM-45587__b15112113625614">Alarm ID</strong> is <strong id="ALM-45587__b233324315565">45587</strong>, view the role name in <strong id="ALM-45587__b1522415375817">Location</strong>, and check the instance IP address.</span></li><li id="ALM-45587__li701264785459"><span>Choose <strong id="ALM-45587__b2613029435852">Cluster</strong> &gt; <em id="ALM-45587__i20133844615852">Name of the desired cluster</em> &gt; <strong id="ALM-45587__b20447374275852">Service</strong> &gt; <strong id="ALM-45587__b2458828025852">IoTDB</strong> &gt; <strong id="ALM-45587__b4188270515852">Instance</strong>. Click the IoTDBServer for which the alarm is generated to go to <strong id="ALM-45587__b18534253115852">Dashboard</strong>. Click the drop-down list in the upper right corner of the chart area and choose <strong id="ALM-45587__b8972536155915">Customize</strong> &gt; <strong id="ALM-45587__b19722036105911">GC</strong>. In the dialog box that is displayed, select <strong id="ALM-45587__b0972193645913">Garbage Collection (GC) Time of IoTDBServer</strong>, and click <strong id="ALM-45587__b14972436125912">OK</strong>. Check whether the GC time of the IoTDBServer process is greater than 12 seconds.</span><p><ul id="ALM-45587__ul14960656105713"><li id="ALM-45587__li096019561574">If yes, go to <a href="#ALM-45587__li1141514131368">3</a>.</li><li id="ALM-45587__li1632819106595">If no, go to <a href="#ALM-45587__li4749473185459">5</a>.</li></ul>
</p></li><li id="ALM-45587__li1141514131368"><a name="ALM-45587__li1141514131368"></a><a name="li1141514131368"></a><span>Choose <strong id="ALM-45587__b1265015532551">Cluster</strong> &gt; <em id="ALM-45587__i122285665512">Name of the desired cluster</em> &gt; <strong id="ALM-45587__b1738175814554">Service</strong> &gt; <strong id="ALM-45587__b119971735564">IoTDB</strong> &gt; <strong id="ALM-45587__b659261815562">Configuration</strong>, click <strong id="ALM-45587__b1022932313569">All Configurations</strong>, choose <strong id="ALM-45587__b10134203610564">IoTDBServer</strong> &gt; <strong id="ALM-45587__b0924133805618">System</strong>, and increase the value of <strong id="ALM-45587__b1039864510574">-Xmx</strong> in the <strong id="ALM-45587__b635819479576">GC_OPTS</strong> parameter.</span><p><div class="note" id="ALM-45587__note07452211661"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><ul id="ALM-45587__ul945169983"><li id="ALM-45587__li14454917813">The default value of -<strong id="ALM-45587__b6237101585815">Xmx</strong> is <strong id="ALM-45587__b15700181795810">2G</strong>.</li><li id="ALM-45587__li345191981">If this alarm is occasionally generated, increase the value by 0.5 times. If this alarm is frequently generated, double the value.</li><li id="ALM-45587__li8451090817">In the case of large service volume and high service concurrency, you are advised to add instances.</li></ul>
</div></div>
</p></li><li id="ALM-45587__li5001643585459"><span>Check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-45587__ul5612728785459"><li id="ALM-45587__li4030548885459">If yes, no further action is required.</li><li id="ALM-45587__li4351906785459">If no, go to <a href="#ALM-45587__li4749473185459">5</a>.</li></ul>
</p></li></ol>
<p class="tableheading" id="ALM-45587__p3538354385459"><strong id="ALM-45587__b6160463585522">Collect the fault information.</strong></p>
<ol start="5" id="ALM-45587__ol4790308885524"><li id="ALM-45587__li4749473185459"><a name="ALM-45587__li4749473185459"></a><a name="li4749473185459"></a><span>On <span id="ALM-45587__text4621151520235">MRS</span> Manager, choose <strong id="ALM-45587__b89121328133312">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-45587__b1391652813332">Log</strong> &gt; <strong id="ALM-45587__b1591852812331">Download</strong>.</span></li><li id="ALM-45587__li2648019085459"><span>Expand the <strong id="ALM-45587__b92880034613">Service</strong> drop-down list, select <strong id="ALM-45587__b112941305464">IoTDB</strong> for the target cluster, and click <strong id="ALM-45587__b1329417014618">OK</strong>.</span></li><li id="ALM-45587__li3699511985459"><span>Click <span><img id="ALM-45587__image104601319175315" src="en-us_image_0000001532448186.png"></span> in the upper right corner, and set <strong id="ALM-45587__b105181640194716">Start Date</strong> and <strong id="ALM-45587__b251834044718">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-45587__b951820402479">Download</strong>.</span></li><li id="ALM-45587__li4381466885459"><span>Contact <span id="ALM-45587__text482963954014">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div>
<div class="section" id="ALM-45587__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-45587__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div>
<div class="section" id="ALM-45587__section51780573"><h4 class="sectiontitle">Related Information</h4><p id="ALM-45587__p54528917">None</p>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
</div>
</div>