doc-exports/docs/mrs/umn/ALM-38007.html
Yang, Tong 3b1f73dece MRS UMN 2.0.38.SP20 version
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2022-12-13 12:03:34 +00:00

88 lines
11 KiB
HTML

<a name="ALM-38007"></a><a name="ALM-38007"></a>
<h1 class="topictitle1">ALM-38007 Status of Kafka Default User Is Abnormal</h1>
<div id="body1544512676719"><div class="section" id="ALM-38007__s3c5af36d89d44702bd46d9e007a3d832"><h4 class="sectiontitle">Description</h4><p id="ALM-38007__p150415761711">The system checks the default user of Kafka every 60 seconds. This alarm is generated when the system detects that the user status is abnormal.</p>
<p id="ALM-38007__p10504471177"><strong id="ALM-38007__b165048711718">Trigger Count</strong> is set to <strong id="ALM-38007__b15041174177">1</strong>. This alarm is cleared when the user status becomes normal.</p>
</div>
<div class="section" id="ALM-38007__s6d1548c0ed8e453dad7e8bfa61016bbd"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-38007__en-us_topic_0070543591_table33687968" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-38007__en-us_topic_0070543591_row66116154"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-38007__en-us_topic_0070543591_p53808254">Alarm ID</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-38007__en-us_topic_0070543591_p63501347">Alarm Severity</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-38007__en-us_topic_0070543591_p43335505">Automatically Cleared</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-38007__en-us_topic_0070543591_row20514991"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-38007__en-us_topic_0070543591_p51101541">38007</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-38007__en-us_topic_0070543591_p45584156">Critical</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-38007__en-us_topic_0070543591_p1329147">Yes</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-38007__s6aeb3d426eed41b5b22f2df7ea8fff3b"><h4 class="sectiontitle">Parameters</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-38007__en-us_topic_0070543591_table40552107" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-38007__en-us_topic_0070543591_row50031493"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-38007__en-us_topic_0070543591_p26019105">Name</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-38007__en-us_topic_0070543591_p27172786">Meaning</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-38007__row132554514572"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-38007__p192431315431">Source</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-38007__p692551319435">Specifies the cluster for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-38007__en-us_topic_0070543591_row53512096"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-38007__en-us_topic_0070543591_p39512530">ServiceName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-38007__p854019466171">Specifies the service for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-38007__en-us_topic_0070543591_row14932284"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-38007__en-us_topic_0070543591_p1555529">RoleName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-38007__p1754034613176">Specifies the role for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-38007__row18331234191414"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-38007__p134163471413">HostName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-38007__p28521383146">Specifies the host name for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-38007__en-us_topic_0070543591_row60239080"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-38007__en-us_topic_0070543591_p47527305">Trigger Condition</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-38007__p254013468174">Specifies the condition that the Kafka default user status is abnormal.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-38007__s6246d25e195c44c89205c3294c587977"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-38007__en-us_topic_0070543591_p38866383">If the Kafka default user status is abnormal, metadata synchronization between Brokers and interaction between Kafka and ZooKeeper will be affected, affecting service production, consumption, and topic creation and deletion.</p>
</div>
<div class="section" id="ALM-38007__s073f4bd54c5f43498ef7607a06660555"><h4 class="sectiontitle">Possible Causes</h4><ul id="ALM-38007__ul692424610156"><li id="ALM-38007__li1292494614157">The Sssd service is abnormal.</li><li id="ALM-38007__li1888224815153">Some Broker instances stop running.</li></ul>
</div>
<div class="section" id="ALM-38007__section1386954531616"><h4 class="sectiontitle">Procedure</h4><p class="tableheading" id="ALM-38007__en-us_topic_0070543591_p55770280"><strong id="ALM-38007__b1869012561818">Check whether the Sssd service is abnormal.</strong></p>
<ol id="ALM-38007__ol46864976154410"><li id="ALM-38007__li127431833202512"><span>On the FusionInsight Manager portal, choose <strong id="ALM-38007__b20452193312620">O&amp;M </strong>&gt;<strong id="ALM-38007__b07222354262"> Alarm </strong>&gt;<strong id="ALM-38007__b87221035162614"> Alarm</strong><strong id="ALM-38007__b1286016379296">s</strong><strong id="ALM-38007__b58621037162915"> </strong>&gt; <strong id="ALM-38007__b144947711263">Status of Kafka Default User Is Abnormal</strong> &gt; <strong id="ALM-38007__b14979919155343">Location</strong> to check the host name of the instance for which the alarm is generated.</span></li><li id="ALM-38007__li2868309615440"><span>Find the host information in the alarm information and log in to the host.</span></li><li id="ALM-38007__li1654108315440"><span>Run the <strong id="ALM-38007__b295319241194">id -Gn kafka</strong> command and check whether "No such user" is displayed in the command output.</span><p><ul class="subitemlist" id="ALM-38007__ul1394143651911"><li id="ALM-38007__li7957365192">If yes, record the host name of the node and go to <a href="#ALM-38007__li1465202115440">4</a>.</li><li id="ALM-38007__li297436101920">If no, go to <a href="#ALM-38007__li14809510142018">6</a>.</li></ul>
</p></li><li id="ALM-38007__li1465202115440"><a name="ALM-38007__li1465202115440"></a><a name="li1465202115440"></a><span>On the FusionInsight Manager home page, choose <strong id="ALM-38007__b1056975612018">O&amp;M</strong> &gt; <strong id="ALM-38007__b1456513439539">Alarm </strong>&gt;<strong id="ALM-38007__b4565843155310"> Alarm</strong><strong id="ALM-38007__b932085253011">s</strong>. Check whether there is <strong id="ALM-38007__b129761281657">Sssd Service Exception</strong> in the alarm information. If there is, handle the alarm based on alarm information.</span></li></ol>
<p id="ALM-38007__p157911410192011"><strong id="ALM-38007__b1820181615201">Check the running status of the Broker instance.</strong></p>
<ol start="5" id="ALM-38007__ol6809161015205"><li id="ALM-38007__li48097108205"><span>On the FusionInsight Manager home page, choose <strong id="ALM-38007__b108086103208">Cluster</strong> &gt; <em id="ALM-38007__i38091510102020">Name of the desired cluster</em><strong id="ALM-38007__b178090108201"> </strong>&gt; <strong id="ALM-38007__b3809181082017">Services</strong> &gt; <strong id="ALM-38007__b280917108206">Kafka</strong> &gt; <strong id="ALM-38007__b178091210132016">Instance</strong>. The Kafka instance page is displayed.</span></li><li id="ALM-38007__li14809510142018"><a name="ALM-38007__li14809510142018"></a><a name="li14809510142018"></a><span>Check whether there are stopped nodes on all Broker instances.</span><p><ul class="subitemlist" id="ALM-38007__ul38091610202019"><li id="ALM-38007__li380971022013">If yes, go to <a href="#ALM-38007__li9809111022018">7</a>.</li><li id="ALM-38007__li4809141052010">If no, go to <a href="#ALM-38007__li4809151011206">8</a>.</li></ul>
</p></li><li id="ALM-38007__li9809111022018"><a name="ALM-38007__li9809111022018"></a><a name="li9809111022018"></a><span>Select all stopped Broker instances and click <strong id="ALM-38007__b1180941018206">Start Instance</strong>.</span></li><li id="ALM-38007__li4809151011206"><a name="ALM-38007__li4809151011206"></a><a name="li4809151011206"></a><span>Check whether the alarm is cleared.</span><p><ul id="ALM-38007__ul1380901014209"><li id="ALM-38007__li9809131022017">If yes, no further action is required.</li><li id="ALM-38007__li19809181082017">If no, go to <a href="#ALM-38007__li783366415440">9</a>.</li></ul>
</p></li></ol>
<p class="tableheading" id="ALM-38007__p10382276154414"><strong id="ALM-38007__b55150482154417">Collect fault information.</strong></p>
<ol start="9" id="ALM-38007__ol33468126154421"><li id="ALM-38007__li783366415440"><a name="ALM-38007__li783366415440"></a><a name="li783366415440"></a><span>On FusionInsight Manager, choose <strong id="ALM-38007__b4551182792713">O&amp;M </strong>&gt;<strong id="ALM-38007__b1655392720277"> Log</strong> &gt; <strong id="ALM-38007__b4560964915440">Download</strong>.</span></li><li id="ALM-38007__li5839163215440"><span>In the <strong id="ALM-38007__b128041418162315">Service </strong>area, select <strong id="ALM-38007__b10804418112316">Kafka</strong> in the required cluster.</span></li><li id="ALM-38007__li1145664103113"><span>Click <span><img id="ALM-38007__image1945644173117" src="en-us_image_0269417505.png"></span> in the upper right corner, and set <strong id="ALM-38007__b6456941173117">Start Date</strong> and <strong id="ALM-38007__b11456154113318">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-38007__b13456164113319">Download</strong>.</span></li><li id="ALM-38007__li3186770515440"><span>Contact the <span id="ALM-38007__text4614151421417">O&amp;M personnel</span> and send the collected logs.</span></li></ol>
</div>
<div class="section" id="ALM-38007__section1529716184534"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-38007__p4677152685316">After the fault is rectified, the system automatically clears this alarm.</p>
</div>
<div class="section" id="ALM-38007__s469a3fa614484b529bfca51a77ce5e1d"><h4 class="sectiontitle">Related Information</h4><p id="ALM-38007__en-us_topic_0070543591_p64814367">None</p>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
</div>
</div>