forked from docs/doc-exports
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com> Reviewed-by: Rechenburg, Matthias <matthias.rechenburg@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
89 lines
12 KiB
HTML
89 lines
12 KiB
HTML
<a name="ALM-38004"></a><a name="ALM-38004"></a>
|
|
|
|
<h1 class="topictitle1">ALM-38004 Kafka Direct Memory Usage Exceeds the Threshold</h1>
|
|
<div id="body64553962"><div class="section" id="ALM-38004__s3f9abcf61a5343c18cbdfd213c494862"><h4 class="sectiontitle">Description</h4><p id="ALM-38004__en-us_topic_0070543588_p20705175">The system checks the direct memory usage of the Kafka service every 30 seconds. This alarm is generated when the direct memory usage of a Kafka instance exceeds the threshold (80% of the maximum memory) for 10 consecutive times.</p>
|
|
<p id="ALM-38004__p3405783145625">When the <strong id="ALM-38004__b1855881691815">Trigger Count</strong> is 1, this alarm is cleared when the direct memory usage is less than or equal to the threshold. When the <strong id="ALM-38004__b33035973145626"><strong id="ALM-38004__b168411222201818">Trigger Count</strong></strong> is greater than 1, this alarm is cleared when the direct memory usage is less than or equal to 90% of the threshold.</p>
|
|
</div>
|
|
<div class="section" id="ALM-38004__saa77c56e57c5469a901087da3d6e31ce"><h4 class="sectiontitle">Attribute</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-38004__en-us_topic_0070543588_table61687666" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-38004__en-us_topic_0070543588_row4236722"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-38004__en-us_topic_0070543588_p7630192">Alarm ID</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-38004__en-us_topic_0070543588_p14065819">Alarm Severity</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-38004__en-us_topic_0070543588_p65589530">Automatically Cleared</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-38004__en-us_topic_0070543588_row11151702"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-38004__en-us_topic_0070543588_p30872630">38004</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-38004__en-us_topic_0070543588_p17655074">Major</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-38004__en-us_topic_0070543588_p20774915">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-38004__se2b3440da9a44cffa579989c2806d657"><h4 class="sectiontitle">Parameters</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-38004__en-us_topic_0070543588_table5046593" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-38004__en-us_topic_0070543588_row32950917"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-38004__en-us_topic_0070543588_p51778658">Name</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-38004__en-us_topic_0070543588_p33321759">Meaning</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-38004__row1430911625811"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-38004__p192431315431">Source</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-38004__p692551319435">Specifies the cluster for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-38004__en-us_topic_0070543588_row14707940"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-38004__en-us_topic_0070543588_p50492506">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-38004__en-us_topic_0070543588_p63361192">Specifies the service for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-38004__en-us_topic_0070543588_row33379824"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-38004__en-us_topic_0070543588_p19411255">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-38004__en-us_topic_0070543588_p28807785">Specifies the role for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-38004__en-us_topic_0070543588_row57943479"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-38004__en-us_topic_0070543588_p62910199">HostName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-38004__en-us_topic_0070543588_p62561368">Specifies the host for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-38004__en-us_topic_0070543588_row26181404"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-38004__en-us_topic_0070543588_p40318993">Trigger Condition</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-38004__en-us_topic_0070543588_p44612986">Specifies the threshold triggering the alarm. If the current indicator value exceeds this threshold, the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-38004__sf50a65789f3c4605a27204bc93c2d7a8"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-38004__en-us_topic_0070543588_p56882112">If the available direct memory of the Kafka service is insufficient, a memory overflow occurs and the service breaks down.</p>
|
|
</div>
|
|
<div class="section" id="ALM-38004__s0e95022b33334729a02b671376645b83"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-38004__en-us_topic_0070543588_p44048330">The direct memory of the Kafka instance is overused or the direct memory is inappropriately allocated.</p>
|
|
</div>
|
|
<div class="section" id="ALM-38004__se13ba0d0863347558cd9cc9e82807f4f"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-38004__en-us_topic_0070543588_p11144960"><strong id="ALM-38004__b37706666155347">Check the direct memory usage.</strong></p>
|
|
<ol id="ALM-38004__ol2154620155356"><li id="ALM-38004__li601548155343"><span>On the <span id="ALM-38004__text34789336432">MRS</span> Manager portal, choose <strong id="ALM-38004__b20452193312620">O&M </strong>><strong id="ALM-38004__b07222354262"> Alarm </strong>><strong id="ALM-38004__b87221035162614"> Alarm</strong><strong id="ALM-38004__b1286016379296">s</strong><strong id="ALM-38004__b58621037162915"> </strong>> <strong id="ALM-38004__b115691849143">Kafka Direct Memory Usage Exceeds the Threshold</strong> > <strong id="ALM-38004__b14979919155343">Location</strong> to check the host name of the instance for which the alarm is generated.</span></li><li id="ALM-38004__li28837961155343"><a name="ALM-38004__li28837961155343"></a><a name="li28837961155343"></a><span>On the <span id="ALM-38004__text15741551141618">MRS</span> Manager portal, choose <strong id="ALM-38004__b4831185113332">Cluster</strong> > <em id="ALM-38004__i14240109123418">Name of the desired cluster</em><strong id="ALM-38004__b9831125114333"> </strong>><strong id="ALM-38004__b1365827182720"> Services</strong> > <strong id="ALM-38004__b48725417155343">Kafka</strong> > <strong id="ALM-38004__b35875571155343">Instance</strong>. Click the instance for which the alarm is generated to go to the page for the instance. Click the drop-down menu in the Chart area and choose <strong id="ALM-38004__b19474172052218">Customize</strong> > <strong id="ALM-38004__b16996122482220">Process</strong> > <strong id="ALM-38004__b1063672922217">Kafka</strong> <strong id="ALM-38004__b1588463093117">Direct Memory Usage</strong>, and click <strong id="ALM-38004__b3504172818136">OK</strong>.</span></li><li id="ALM-38004__li23677411155343"><span>Check whether the used direct memory of Kafka reaches 80% of the maximum direct memory specified for Kafka.</span><p><ul class="subitemlist" id="ALM-38004__ul32456985155343"><li id="ALM-38004__li54173482155343">If yes, go to <a href="#ALM-38004__li11113491818">4</a>.</li><li id="ALM-38004__li25975953155343">If no, go to <a href="#ALM-38004__li52892950155343">7</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p class="subitemlist" id="ALM-38004__p79966481912"><strong id="ALM-38004__b14679165714114">Check the direct memory size configured for the Kafka.</strong></p>
|
|
<ol start="4" id="ALM-38004__ol7111494119"><li id="ALM-38004__li11113491818"><a name="ALM-38004__li11113491818"></a><a name="li11113491818"></a><span>On the <span id="ALM-38004__text936154131618">MRS</span> Manager portal, choose <strong id="ALM-38004__b61112492120">Cluster</strong> > <em id="ALM-38004__i31119491211">Name of the desired cluster</em><strong id="ALM-38004__b41115491914"> </strong>><strong id="ALM-38004__b10111249613"> Services</strong> > <strong id="ALM-38004__b1111649211">Kafka</strong> > <strong id="ALM-38004__b17111549616">Configurations</strong> > <strong id="ALM-38004__b13111649719">All</strong> <strong id="ALM-38004__b151115491115">Configurations</strong> > <strong id="ALM-38004__b911154915115">Broker(Role)</strong> > <strong id="ALM-38004__b411184910119">Environment </strong>to increase the value of <strong id="ALM-38004__b3114491616">-Xmx </strong>configured in the <strong id="ALM-38004__b17117491111">KAFKA_HEAP_OPTS </strong>parameter by referring to the Note.</span><p><div class="note" id="ALM-38004__note711449515"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><ul id="ALM-38004__ul61104919114"><li id="ALM-38004__li11111491115">It is recommended that <strong id="ALM-38004__b10111749914">-Xmx</strong> and <strong id="ALM-38004__b171124918116">-Xms</strong> be set to the same value.</li><li id="ALM-38004__li1211164910116">You are advised to view <strong id="ALM-38004__b31112491919">Kafka</strong> <strong id="ALM-38004__b161154910118">Direct Memory Usage</strong> by referring to <a href="#ALM-38004__li28837961155343">2</a>, and set the value of <strong id="ALM-38004__b1411104918110">KAFKA_HEAP_OPTS</strong> to twice the value of <strong id="ALM-38004__b81116491110">Direct Memory Used by Kafka.</strong></li></ul>
|
|
</div></div>
|
|
</p></li><li id="ALM-38004__li11119490115"><span>Save the configuration and restart the Kafka service.</span></li><li id="ALM-38004__li191194917114"><span>Check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-38004__ul11111249617"><li id="ALM-38004__li14111949312">If yes, no further action is required.</li><li id="ALM-38004__li15117499113">If no, go to <a href="#ALM-38004__li52892950155343">7</a>.</li></ul>
|
|
</p></li></ol>
|
|
<p class="tableheading" id="ALM-38004__p3280138315540"><strong id="ALM-38004__b2677699615540">Collect fault information.</strong></p>
|
|
<ol start="7" id="ALM-38004__ol1251527815543"><li id="ALM-38004__li52892950155343"><a name="ALM-38004__li52892950155343"></a><a name="li52892950155343"></a><span>On the <span id="ALM-38004__text17793105531612">MRS</span> Manager portal, choose <strong id="ALM-38004__b66161443351">O&M</strong> > <strong id="ALM-38004__b230513219449">Log </strong>><strong id="ALM-38004__b330532154413"> Download</strong>.</span></li><li id="ALM-38004__li10714857155343"><span>Select <strong id="ALM-38004__b6274509155343">Kafka</strong> in the required cluster from the <strong id="ALM-38004__b56470587155343">Service</strong> drop-down list.</span></li><li id="ALM-38004__li1145664103113"><span>Click <span><img id="ALM-38004__image1945644173117" src="en-us_image_0000001583087461.png"></span> in the upper right corner, and set <strong id="ALM-38004__b6456941173117">Start Date</strong> and <strong id="ALM-38004__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-38004__b13456164113319">Download</strong>.</span></li><li id="ALM-38004__li60192052155343"><span>Contact the <span id="ALM-38004__text4614151421417">O&M personnel</span> and send the collected logs.</span></li></ol>
|
|
</div>
|
|
<div class="section" id="ALM-38004__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-38004__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
|
|
</div>
|
|
<div class="section" id="ALM-38004__se459f109ef174947aaaf93ca57201cbe"><h4 class="sectiontitle">Related Information</h4><p id="ALM-38004__en-us_topic_0070543588_p35647420">None</p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|