Files
doc-exports/docs/mrs/umn/ALM-45654.html
Yang, Tong 5914b67d13 MRS UMN Doc 20240802 version
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2024-09-28 19:04:58 +00:00

83 lines
11 KiB
HTML

<a name="ALM-45654"></a><a name="ALM-45654"></a>
<h1 class="topictitle1">ALM-45654 Flink HA Certificate Is About to Expire</h1>
<div id="body0000001971621310"><p id="ALM-45654__p12261122253615">This section applies to MRS 3.3.0 or later.</p>
<div class="section" id="ALM-45654__section238231914117"><h4 class="sectiontitle"><span id="ALM-45654__text516373020197">Alarm Description</span></h4><p id="ALM-45654__p5166194361117">Flink checks whether the HA certificate file is about to expire in the first health check or at 01:00:00 every day. This alarm is generated when the remaining validity period is less than or equal to 30 days. This alarm is automatically cleared when the remaining validity period is greater than 30 days.</p>
</div>
<div class="section" id="ALM-45654__section10101081213"><h4 class="sectiontitle"><span id="ALM-45654__text20591447192117">Alarm Attributes</span></h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-45654__table33817547" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-45654__row8931076"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.3.2.1.4.1.1"><p id="ALM-45654__p17386810"><span id="ALM-45654__text1864783145211">Alarm ID</span></p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.3.2.1.4.1.2"><p id="ALM-45654__p66154394"><span id="ALM-45654__text297913110521">Alarm Severity</span></p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.3.2.1.4.1.3"><p id="ALM-45654__p49230886"><span id="ALM-45654__text0890175712305">Auto Cleared</span></p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-45654__row40652256"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.3.2.1.4.1.1 "><p id="ALM-45654__p4498430">45654</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.3.2.1.4.1.2 "><p id="ALM-45654__p28828553">Major</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.3.2.1.4.1.3 "><p id="ALM-45654__p53411432">Yes</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-45654__section15483537"><h4 class="sectiontitle"><span id="ALM-45654__text18171442142214">Alarm Parameters</span></h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-45654__table31358724" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-45654__row33518103"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.4.2.1.3.1.1"><p id="ALM-45654__p42699947"><span id="ALM-45654__text6203173410617">Parameter</span></p>
</th>
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.4.2.1.3.1.2"><p id="ALM-45654__p36143663"><span id="ALM-45654__text10819164319610">Description</span></p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-45654__row163311621185116"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.1 "><p id="ALM-45654__p17935380415">Source</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.2 "><p id="ALM-45654__p187931338134115">Specifies the cluster for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-45654__row54362592"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.1 "><p id="ALM-45654__p41293795">ServiceName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.2 "><p id="ALM-45654__p56463136">Specifies the service for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-45654__row38406179"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.1 "><p id="ALM-45654__p23892775">RoleName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.2 "><p id="ALM-45654__p56266616">Specifies the role for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-45654__row36637496"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.1 "><p id="ALM-45654__p14847206">HostName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.2 "><p id="ALM-45654__p61773077">Specifies the host for which the alarm is generated.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-45654__section722973311121"><h4 class="sectiontitle"><span id="ALM-45654__text98201443182317">Impact on the System</span></h4><p id="ALM-45654__p1068283611125">Currently, there is no impact on the system.</p>
</div>
<div class="section" id="ALM-45654__section46207013"><h4 class="sectiontitle"><span id="ALM-45654__text11871546172411">Possible Causes</span></h4><p id="ALM-45654__p62452010">The HA certificate is about to expire.</p>
</div>
<div class="section" id="ALM-45654__section1932086181519"><h4 class="sectiontitle"><span id="ALM-45654__text79051154102518">Handling Procedure</span></h4><p id="ALM-45654__p9765201181513"><strong id="ALM-45654__b4400121914158">View alarm information.</strong></p>
<ol id="ALM-45654__ol2985547105618"><li id="ALM-45654__li13985847105619"><span>Log in to FusionInsight Manager, choose <strong id="ALM-45654__b5775164031211">O&amp;M</strong> &gt; <strong id="ALM-45654__b07761040101215">Alarm</strong> &gt; <strong id="ALM-45654__b9776340121217">Alarms</strong> &gt; <strong id="ALM-45654__b677624015121">ALM-45654 Flink HA Certificate Is About to Expire</strong>, view <strong id="ALM-45654__b16776440101216">Location</strong>, obtain the name of the host for which the alarm is generated, and click the host name to view its IP address.</span></li></ol>
<p id="ALM-45654__p72121810184"><strong id="ALM-45654__b814815542082">Check whether the HA certificate file in the system is valid. If it is not, generate a new one.</strong></p>
<ol start="2" id="ALM-45654__ol093713227375"><li id="ALM-45654__li39371222113715"><span>Log in to the host for which the alarm is generated as user <strong id="ALM-45654__b313521218911">omm</strong>.</span></li><li id="ALM-45654__li4937132214379"><span>Run the <strong id="ALM-45654__b17809113141611">cd ${BIGDATA_HOME}/FusionInsight_Flink_*/install/FusionInsight-Flink-*/ha/local/cert</strong> command to go to the directory where the HA certificate is stored.</span></li><li id="ALM-45654__li893717220371"><span>Run the <strong id="ALM-45654__b11224330391">openssl x509 -noout -text -in server.crt</strong> command to query the effective time and due time of the HA certificate.</span></li><li id="ALM-45654__li15688521425"><span>Perform <a href="#ALM-45654__li99371922103716">6</a> to <a href="#ALM-45654__li16937182293715">7</a> during off-peak hours to update the certificate file as needed.</span></li><li id="ALM-45654__li99371922103716"><a name="ALM-45654__li99371922103716"></a><a name="li99371922103716"></a><span>Run the <strong id="ALM-45654__b69372229376">cd </strong><strong id="ALM-45654__b9937202243717">${BIGDATA_HOME}/</strong><strong id="ALM-45654__b1241723519">FusionInsight_Flink_*/install/FusionInsight-Flink-*/flink/s</strong><strong id="ALM-45654__b1624152155115">bin</strong> command to go to the Flink script directory.</span></li><li id="ALM-45654__li16937182293715"><a name="ALM-45654__li16937182293715"></a><a name="li16937182293715"></a><span>Run the <strong id="ALM-45654__b2912183914103">sh proceed_ha_ssl_cert.sh</strong> command to generate a new HA certificate. Then, check whether the alarm is cleared 1 minute later.</span><p><ul id="ALM-45654__ul770182010388"><li id="ALM-45654__li127011820153816">If yes, go to <a href="#ALM-45654__li127861713811">9</a>.</li><li id="ALM-45654__li1260435810588">If no, go to <a href="#ALM-45654__li6673192244411">8</a>.</li></ul>
</p></li><li id="ALM-45654__li6673192244411"><a name="ALM-45654__li6673192244411"></a><a name="li6673192244411"></a><span>On the node where the standby FlinkServer instance is located, repeat <a href="#ALM-45654__li99371922103716">6</a> to <a href="#ALM-45654__li16937182293715">7</a>. Then, check whether the alarm is cleared 1 minute later.</span><p><ul id="ALM-45654__ul15162135612215"><li id="ALM-45654__li18162056621">If yes, go to <a href="#ALM-45654__li127861713811">9</a>.</li><li id="ALM-45654__li101622056322">If no, go to <a href="#ALM-45654__li593632253716">10</a>.</li></ul>
</p></li><li id="ALM-45654__li127861713811"><a name="ALM-45654__li127861713811"></a><a name="li127861713811"></a><span>Check whether this alarm is generated again during periodic system check.</span><p><ul id="ALM-45654__ul478613131112"><li id="ALM-45654__li14786121315111">If yes, go to <a href="#ALM-45654__li593632253716">10</a>.</li><li id="ALM-45654__li16786201316114">If no, no further action is required.</li></ul>
</p></li></ol>
<p class="tableheading" id="ALM-45654__p3538354385459"><strong id="ALM-45654__b6160463585522">Collect fault information.</strong></p>
<ol start="10" id="ALM-45654__ol093762283713"><li id="ALM-45654__li593632253716"><a name="ALM-45654__li593632253716"></a><a name="li593632253716"></a><span>On FusionInsight Manager, choose <strong id="ALM-45654__b2372184331113">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-45654__b137234316112">Log</strong> &gt; <strong id="ALM-45654__b2037313437116">Download</strong>.</span></li><li id="ALM-45654__li19327535427"><span>Expand the <strong id="ALM-45654__b1417044611117">Service</strong> drop-down list, and select <strong id="ALM-45654__b6170346191113">Flink</strong> for the target cluster.</span></li><li id="ALM-45654__li293692216373"><span>Click <span><img id="ALM-45654__image13936322143711" src="en-us_image_0000002008129165.png"></span> in the upper right corner, and set <strong id="ALM-45654__b164991050101118">Start Date</strong> and <strong id="ALM-45654__b1650011506110">End Date</strong> for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click <strong id="ALM-45654__b165007501118">Download</strong>.</span></li><li id="ALM-45654__li1993762233711"><span>Contact <span id="ALM-45654__text126301214142412">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div>
<div class="section" id="ALM-45654__section169311343318"><h4 class="sectiontitle"><span id="ALM-45654__text195945622616">Alarm Clearance</span></h4><p id="ALM-45654__p754913417333">This alarm is automatically cleared after the fault is rectified.</p>
</div>
<div class="section" id="ALM-45654__section20027245"><h4 class="sectiontitle"><span id="ALM-45654__text143698488285">Related Information</span></h4><p id="ALM-45654__p16257720"><span id="ALM-45654__text19275105817121">None.</span></p>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
</div>
</div>