forked from docs/doc-exports
Reviewed-by: Hasko, Vladimir <vladimir.hasko@t-systems.com> Co-authored-by: Yang, Tong <yangtong2@huawei.com> Co-committed-by: Yang, Tong <yangtong2@huawei.com>
86 lines
9.4 KiB
HTML
86 lines
9.4 KiB
HTML
<a name="ALM-45180"></a><a name="ALM-45180"></a>
|
|
|
|
<h1 class="topictitle1">ALM-45180 Number of Failed OBS read API Calls Exceeds the Threshold</h1>
|
|
<div id="body0000001343562025"><div class="section" id="ALM-45180__section13447226"><h4 class="sectiontitle">Description</h4><p id="ALM-45180__p61726833">The system checks whether the number of failed OBS read API calls exceeds the threshold every 30 seconds. This alarm is generated when the number of failed API calls exceeds the threshold.</p>
|
|
<p id="ALM-45180__p1798194523016">This alarm is automatically cleared when the number of failed OBS read API calls is less than the threshold.</p>
|
|
</div>
|
|
<div class="section" id="ALM-45180__section53916176"><h4 class="sectiontitle">Attribute</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-45180__table169581736123413" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-45180__row16959183615347"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.1"><p id="ALM-45180__p295933673413">Alarm ID</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.2"><p id="ALM-45180__p1995993618344">Alarm Severity</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.2.2.1.4.1.3"><p id="ALM-45180__p9959123614348">Auto Clear</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-45180__row19959236103419"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.1 "><p id="ALM-45180__p179511920165120">45180</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.2 "><p id="ALM-45180__p28828553">Minor</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.2.2.1.4.1.3 "><p id="ALM-45180__p53411432">Yes</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-45180__section15483537"><h4 class="sectiontitle">Parameters</h4>
|
|
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-45180__table10459824153412" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-45180__row18459112413411"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.1"><p id="ALM-45180__p1145962433419">Name</p>
|
|
</th>
|
|
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.3.2.1.3.1.2"><p id="ALM-45180__p6459132417341">Meaning</p>
|
|
</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody><tr id="ALM-45180__row245982493413"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45180__p17935380415">Source</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45180__p187931338134115">Specifies the cluster for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-45180__row74591242346"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45180__p41293795">ServiceName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45180__p56463136">Specifies the service for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-45180__row34591724103420"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45180__p23892775">RoleName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45180__p56266616">Specifies the role for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-45180__row3459224193411"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45180__p14847206">HostName</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45180__p61773077">Specifies the host for which the alarm is generated.</p>
|
|
</td>
|
|
</tr>
|
|
<tr id="ALM-45180__row745972433415"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.1 "><p id="ALM-45180__p57854422">Trigger Condition</p>
|
|
</td>
|
|
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.3.2.1.3.1.2 "><p id="ALM-45180__p55696635">Specifies the threshold for triggering the alarm.</p>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
</div>
|
|
<div class="section" id="ALM-45180__section5134112"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-45180__p42871020143312">Certain upper-layer big data computing tasks will fail to execute.</p>
|
|
</div>
|
|
<div class="section" id="ALM-45180__section46207013"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-45180__p594902420217">An execution exception or severe timeout occurs on the OBS server.</p>
|
|
</div>
|
|
<div class="section" id="ALM-45180__section112391141155320"><h4 class="sectiontitle">Procedure</h4><ol id="ALM-45180__ol132281510272"><li id="ALM-45180__li6321815112713"><span>Log in to FusionInsight Manager and choose <strong id="ALM-45180__b196406444034641">O&M</strong> > <strong id="ALM-45180__b185835851034641">Alarm</strong> > <strong id="ALM-45180__b33596937234641">Thresholds</strong>. On the <strong id="ALM-45180__b20902528834641">Thresholds</strong> page, choose <strong id="ALM-45180__b189460550434641">meta</strong> > <strong id="ALM-45180__b100682625534641">Number of failed calls to the OBS read interface</strong>. In the right pane, set <strong id="ALM-45180__b46566919934641">Threshold</strong> or <strong id="ALM-45180__b102098816634641">Trigger Count</strong> to a larger value as required.</span></li><li id="ALM-45180__li33221215162715"><span>Check whether the alarm is cleared.</span><p><ul class="subitemlist" id="ALM-45180__ul183228153270"><li id="ALM-45180__li123222150276">If yes, no further action is required.</li><li id="ALM-45180__li83221715192713">If no, go to <a href="#ALM-45180__li6758181214398">3</a>.</li></ul>
|
|
</p></li><li class="subitemlist" id="ALM-45180__li6758181214398"><a name="ALM-45180__li6758181214398"></a><a name="li6758181214398"></a><span>Contact OBS <span id="ALM-45180__text11991105105311">O&M personnel</span> to check whether the OBS service is normal.</span><p><ul id="ALM-45180__ul891993717344"><li id="ALM-45180__li18919237193411">If yes, go to <a href="#ALM-45180__li1591285591014">4</a>.</li><li id="ALM-45180__li79611713356">If no, contact OBS <span id="ALM-45180__text11448162102514">O&M personnel</span> to restore the OBS service.</li></ul>
|
|
</p></li></ol>
|
|
<p class="tableheading" id="ALM-45180__p3538354385459"><strong id="ALM-45180__b6160463585522">Collect fault information.</strong></p>
|
|
<ol start="4" id="ALM-45180__ol03214154279"><li id="ALM-45180__li1591285591014"><a name="ALM-45180__li1591285591014"></a><a name="li1591285591014"></a><span>On FusionInsight Manager, choose <strong id="ALM-45180__b54257544134641">Cluster</strong> > <strong id="ALM-45180__b154973557534641">Services</strong> > <strong id="ALM-45180__b101336779634641">meta</strong>. On the page that is displayed, click the <strong id="ALM-45180__b82493643634641">Chart</strong> tab. On this tab page, select <strong id="ALM-45180__b69193702734641">OBS data read operation</strong> in the <strong id="ALM-45180__b23391823734641">Chart Category</strong> area. In the <strong id="ALM-45180__b11758904434641">Number of failed calls to the OBS read interface-All Instances</strong> chart, view the host name of the instance that has the maximum number of failed OBS read API calls. For example, the host name is <strong id="ALM-45180__b179486490334641">node-ana-corevpeO003</strong>.</span><p><p id="ALM-45180__p15762055142515"><span><img id="ALM-45180__image976195582518" src="en-us_image_0000001297838112.png"></span></p>
|
|
</p></li><li id="ALM-45180__li41621464915"><span>Choose <strong id="ALM-45180__b69957933834641">O&M</strong> > <strong id="ALM-45180__b124827314734641">Log</strong> > <strong id="ALM-45180__b5835791834641">Download</strong> and select <strong id="ALM-45180__b128788906734641">meta</strong> and <strong id="ALM-45180__b188869377834641">meta</strong> under it for <strong id="ALM-45180__b5142374534641">Service</strong>.</span></li><li id="ALM-45180__li19768556512"><span>Select the host obtained in <a href="#ALM-45180__li1591285591014">4</a> for <strong id="ALM-45180__b18065837134641">Hosts</strong>.</span></li><li id="ALM-45180__li173219159277"><span>Click <span><img id="ALM-45180__image1032171514272" src="en-us_image_0000001296365606.png"></span> in the upper right corner, and set <strong id="ALM-45180__b198351423549">Start Date</strong> and <strong id="ALM-45180__b1683817217545">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-45180__b1084011218547">Download</strong>.</span></li><li id="ALM-45180__li8321181513277"><span>Contact <span id="ALM-45180__text16321201552711">O&M personnel</span> and provide the collected logs.</span></li></ol>
|
|
<p id="ALM-45180__p846622282611"></p>
|
|
</div>
|
|
<div class="section" id="ALM-45180__section139011924373"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-45180__p19011213375">This alarm is automatically cleared after the fault is rectified.</p>
|
|
</div>
|
|
<div class="section" id="ALM-45180__section92375498287"><h4 class="sectiontitle">Related Information</h4><p id="ALM-45180__p8237124962812">None</p>
|
|
</div>
|
|
</div>
|
|
<div>
|
|
<div class="familylinks">
|
|
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
|
|
</div>
|
|
</div>
|
|
|