Files
doc-exports/docs/mrs/umn/ALM-45429.html
Yang, Tong 2195db241c MRS UMN 20231220 version update
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com>
Reviewed-by: Rechenburg, Matthias <matthias.rechenburg@t-systems.com>
Co-authored-by: Yang, Tong <yangtong2@huawei.com>
Co-committed-by: Yang, Tong <yangtong2@huawei.com>
2024-05-16 09:40:21 +00:00

85 lines
11 KiB
HTML

<a name="ALM-45429"></a><a name="ALM-45429"></a>
<h1 class="topictitle1">ALM-45429 Table Metadata Synchronization Failed on the Added ClickHouse Node</h1>
<div id="body0000001160478452"><div class="note" id="ALM-45429__note1752311344012"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="ALM-45429__p1152415341202">This section applies only to MRS 3.1.2<span id="ALM-45429__ph174355293719">-LTS.6</span> or later.</p>
</div></div>
<div class="section" id="ALM-45429__section13447226"><h4 class="sectiontitle">Description</h4><p id="ALM-45429__p13518023133811">This alarm is generated when the local table corresponding to the distributed table fails to be created during ClickHouse capacity expansion.</p>
</div>
<div class="section" id="ALM-45429__section53916176"><h4 class="sectiontitle">Attribute</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-45429__table33817547" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-45429__row8931076"><th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.3.2.1.4.1.1"><p id="ALM-45429__p52328576">Alarm ID</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.3.2.1.4.1.2"><p id="ALM-45429__p10756297">Alarm Severity</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33333333333333%" id="mcps1.3.3.2.1.4.1.3"><p id="ALM-45429__p65953734">Auto Clear</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-45429__row40652256"><td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.3.2.1.4.1.1 "><p id="ALM-45429__p8364329184016">45429</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.3.2.1.4.1.2 "><p id="ALM-45429__p1236402914406">Major</p>
</td>
<td class="cellrowborder" valign="top" width="33.33333333333333%" headers="mcps1.3.3.2.1.4.1.3 "><p id="ALM-45429__p1536482924019">No</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-45429__section15483537"><h4 class="sectiontitle">Parameters</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="ALM-45429__table31358724" frame="border" border="1" rules="all"><thead align="left"><tr id="ALM-45429__row33518103"><th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.4.2.1.3.1.1"><p id="ALM-45429__p30611809">Name</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="50%" id="mcps1.3.4.2.1.3.1.2"><p id="ALM-45429__p63637484">Meaning</p>
</th>
</tr>
</thead>
<tbody><tr id="ALM-45429__row163311621185116"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.1 "><p id="ALM-45429__p17935380415">Source</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.2 "><p id="ALM-45429__p187931338134115">Specifies the cluster for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-45429__row54362592"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.1 "><p id="ALM-45429__p41293795">ServiceName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.2 "><p id="ALM-45429__p56463136">Specifies the service for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-45429__row38406179"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.1 "><p id="ALM-45429__p23892775">RoleName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.2 "><p id="ALM-45429__p56266616">Specifies the role for which the alarm is generated.</p>
</td>
</tr>
<tr id="ALM-45429__row36637496"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.1 "><p id="ALM-45429__p14847206">HostName</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.4.2.1.3.1.2 "><p id="ALM-45429__p61773077">Specifies the host for which the alarm is generated.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="ALM-45429__section5134112"><h4 class="sectiontitle">Impact on the System</h4><p id="ALM-45429__p166918236411">The distributed table fails to be queried.</p>
</div>
<div class="section" id="ALM-45429__section46207013"><h4 class="sectiontitle">Possible Causes</h4><p id="ALM-45429__p12171173144116">A node is stopped or faulty during capacity expansion.</p>
</div>
<div class="section" id="ALM-45429__section02555126619"><h4 class="sectiontitle">Procedure</h4><ol id="ALM-45429__ol85761615337"><li id="ALM-45429__li1957661133312"><span>On <span id="ALM-45429__text34789336432">MRS</span> Manager, choose <strong id="ALM-45429__b1734401615233">Cluster</strong> &gt; <strong id="ALM-45429__b1284911962311">Services</strong> &gt; <strong id="ALM-45429__b819312223236">ClickHouse</strong> &gt; <strong id="ALM-45429__b18394192382314">Instance</strong>.</span></li><li id="ALM-45429__li1957612118337"><span>Check whether an instance is stopped, decommissioned, or faulty.</span><p><ul id="ALM-45429__ul8576101113318"><li id="ALM-45429__li657618163314">If yes, go to <a href="#ALM-45429__li1857611153310">3</a>.</li><li id="ALM-45429__li7576513337">If no, go to <a href="#ALM-45429__li1457618110336">4</a>.</li></ul>
</p></li><li id="ALM-45429__li1857611153310"><a name="ALM-45429__li1857611153310"></a><a name="li1857611153310"></a><span>Start the instance or rectify the instance fault until all instances are running properly.</span></li><li id="ALM-45429__li1457618110336"><a name="ALM-45429__li1457618110336"></a><a name="li1457618110336"></a><span>On <span id="ALM-45429__text8243826122216">MRS</span> Manager, choose <strong id="ALM-45429__b93041612112615">O&amp;M</strong> &gt; <strong id="ALM-45429__b18889161414261">Alarm</strong> &gt; <strong id="ALM-45429__b15945151615261">Alarms</strong>, locate this alarm and the faulty host based on the location information.</span></li></ol><ol start="5" id="ALM-45429__ol3360126142112"><li id="ALM-45429__li1336018269214"><span>Log in to the faulty host as user <strong id="ALM-45429__b207382410276">omm</strong>.</span></li><li id="ALM-45429__li836014262217"><span>Run the following commands to initialize environment variables:</span><p><p id="ALM-45429__p174641651143118"><strong id="ALM-45429__b1419214613448">source </strong><em id="ALM-45429__i549819744414">Cluster installation directory</em><strong id="ALM-45429__b14193186154411">/FusionInsight_ClickHouse_*/*_*_ClickHouseServer/etc/ENV_VARS</strong></p>
<p id="ALM-45429__p1556512012328"><strong id="ALM-45429__b42739125441">source </strong><em id="ALM-45429__i1929191318448">Cluster installation directory</em><strong id="ALM-45429__b827391214445">/FusionInsight_ClickHouse_*/*_*_ClickHouseServer/etc/clickhouse-env.sh</strong></p>
<p id="ALM-45429__p143591053173114"><strong id="ALM-45429__b4225536163310">export CLICKHOUSE_CONF_DIR=${CLICKHOUSE_CONF_DIR}</strong></p>
</p></li><li id="ALM-45429__li16860155143213"><span>Run the following command to run the metadata synchronization tool to synchronize metadata from the existing node to the faulty node:</span><p><p id="ALM-45429__p12979134320336"><strong id="ALM-45429__b11781158115617">sh </strong><em id="ALM-45429__i177884855620">Cluster installation directory</em><strong id="ALM-45429__b5788118145618">/FusionInsight_ClickHouse_*/install/FusionInsight-ClickHouse-*/clickhouse/sbin/clickhouse-create-meta.sh</strong> <strong id="ALM-45429__b1778913819561">true</strong></p>
</p></li><li id="ALM-45429__li32121146145113"><span>Run the following command to view the log information and check whether the metadata has been synchronized:</span><p><div class="p" id="ALM-45429__p1957517547335"><strong id="ALM-45429__b053485810331">vim /var/log/Bigdata/clickhouse/clickhouseServer/start.log</strong><ul id="ALM-45429__ul93493003312"><li id="ALM-45429__li234193093310">If the synchronization is complete, go to <a href="#ALM-45429__li185513552368">9</a>.</li><li id="ALM-45429__li16348300332">If the synchronization fails, go to <a href="#ALM-45429__li4749473185459">10</a>.</li></ul>
</div>
</p></li><li id="ALM-45429__li185513552368"><a name="ALM-45429__li185513552368"></a><a name="li185513552368"></a><span>On <span id="ALM-45429__text14465353229">MRS</span> Manager, choose <strong id="ALM-45429__b290672514299">O&amp;M</strong> &gt; <strong id="ALM-45429__b2963182712919">Alarm</strong> &gt; <strong id="ALM-45429__b9900192932918">Alarms</strong>. In the <strong id="ALM-45429__b13570436102911">Alarm ID</strong> column, locate the corresponding alarm and click <strong id="ALM-45429__b1915725333117">Clear</strong> in the <strong id="ALM-45429__b84577554319">Operation</strong> column. In the displayed dialog box, click <strong id="ALM-45429__b1687314591314">OK</strong> to manually clear the alarm.</span></li></ol>
<p class="tableheading" id="ALM-45429__p3538354385459"><strong id="ALM-45429__b6160463585522">Collect the fault information.</strong></p>
<ol start="10" id="ALM-45429__ol4790308885524"><li id="ALM-45429__li4749473185459"><a name="ALM-45429__li4749473185459"></a><a name="li4749473185459"></a><span>On <span id="ALM-45429__text2105379222">MRS</span> Manager, choose <strong id="ALM-45429__b47018513330">O&amp;M</strong>. In the navigation pane on the left, choose <strong id="ALM-45429__b07115519334">Log</strong> &gt; <strong id="ALM-45429__b16721511338">Download</strong>.</span></li><li id="ALM-45429__li2648019085459"><span>Expand the <strong id="ALM-45429__b93675590742127">Service</strong> drop-down list, select <strong id="ALM-45429__b164007838142127">ClickHouse</strong> for the target cluster, and click <strong id="ALM-45429__b25352123542127">OK</strong>.</span></li><li id="ALM-45429__li1686655576"><span>Choose the corresponding host form the host list.</span></li><li id="ALM-45429__li3699511985459"><span>Click <span><img id="ALM-45429__image104601319175315" src="en-us_image_0000001582927545.png"></span> in the upper right corner, and set <strong id="ALM-45429__b23305076542127">Start Date</strong> and <strong id="ALM-45429__b197429450042127">End Date</strong> for log collection to 30 minutes ahead of and after the alarm generation time respectively. Then, click <strong id="ALM-45429__b204248608042127">Download</strong>.</span></li><li id="ALM-45429__li4381466885459"><span>Contact <span id="ALM-45429__text14795161524010">O&amp;M personnel</span> and provide the collected logs.</span></li></ol>
</div>
<div class="section" id="ALM-45429__section169311343318"><h4 class="sectiontitle">Alarm Clearing</h4><p id="ALM-45429__p754913417333">This alarm needs to be manually cleared after the fault is rectified.</p>
</div>
<div class="section" id="ALM-45429__section51780573"><h4 class="sectiontitle">Related Information</h4><p id="ALM-45429__p54528917">None</p>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="mrs_01_1298.html">Alarm Reference (Applicable to MRS 3.x)</a></div>
</div>
</div>