Files
doc-exports/docs/mrs/umn/admin_guide_000399.html
yangtong c285e88a17 MRS UMN 20250806 version
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com>
Co-authored-by: yangtong <yangtong2@huawei.com>
Co-committed-by: yangtong <yangtong2@huawei.com>
2025-09-02 10:43:57 +00:00

264 lines
36 KiB
HTML

<a name="admin_guide_000399"></a><a name="admin_guide_000399"></a>
<h1 class="topictitle1">Introduction</h1>
<div id="body1530063325280"><div class="section" id="admin_guide_000399__sbbcbb38c757f4ddfb0e69e051a1797c6"><h4 class="sectiontitle">Overview</h4><p id="admin_guide_000399__en-us_topic_0046736760_p21034837"><span id="admin_guide_000399__text67509419010">MRS</span> Manager provides the backup and restoration of system data and user data by component. The system can back up Manager data, component metadata, and service data.</p>
<p id="admin_guide_000399__p150025184916">Data can be backed up to local disks (LocalDir), local HDFS (LocalHDFS), remote HDFS (RemoteHDFS), NAS (NFS/CIFS), Object Storage Service (OBS), and SFTP server (SFTP). For details, see <a href="admin_guide_000201.html">Backing Up Data</a>.</p>
<p id="admin_guide_000399__p18128948203214">For a component that supports multiple services, multiple instances of a service can be backed up and restored. The backup and restoration operations are consistent with those of a service instance.</p>
<div class="note" id="admin_guide_000399__note1782813720109"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="admin_guide_000399__p18741958332">Only MRS 3.1.0 or later supports data backup to OBS.</p>
</div></div>
<p id="admin_guide_000399__en-us_topic_0046736760_p55095806">Backup and restoration tasks are performed in the following scenarios:</p>
<ul id="admin_guide_000399__en-us_topic_0046736760_ul26100208"><li id="admin_guide_000399__en-us_topic_0046736760_li33575284">Routine backup is performed to ensure the data security of the system and components.</li><li id="admin_guide_000399__en-us_topic_0046736760_li33742108">If the system is faulty, the data backup can be used to recover the system.</li><li id="admin_guide_000399__en-us_topic_0046736760_li35243520">If the active cluster is completely faulty, a mirrored cluster identical to the active cluster needs to be created. You can use the backup data to restore the active cluster.</li></ul>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="admin_guide_000399__en-us_topic_0046736760_table48756224" frame="border" border="1" rules="all"><caption><b>Table 1 </b>Manager configuration data to be backed up</caption><thead align="left"><tr id="admin_guide_000399__en-us_topic_0046736760_row56070673"><th align="left" class="cellrowborder" valign="top" width="12.24%" id="mcps1.3.1.8.2.4.1.1"><p id="admin_guide_000399__en-us_topic_0046736760_p45430631">Backup Type</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="54.370000000000005%" id="mcps1.3.1.8.2.4.1.2"><p id="admin_guide_000399__en-us_topic_0046736760_p56002494">Backup Content</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.39%" id="mcps1.3.1.8.2.4.1.3"><p id="admin_guide_000399__p89213556315">Backup Directory Type</p>
</th>
</tr>
</thead>
<tbody><tr id="admin_guide_000399__en-us_topic_0046736760_row39908157"><td class="cellrowborder" valign="top" width="12.24%" headers="mcps1.3.1.8.2.4.1.1 "><p id="admin_guide_000399__en-us_topic_0046736760_p11335309">OMS</p>
</td>
<td class="cellrowborder" valign="top" width="54.370000000000005%" headers="mcps1.3.1.8.2.4.1.2 "><p id="admin_guide_000399__en-us_topic_0046736760_p45744854">Database data (excluding alarm data) and configuration data in the cluster management system by default</p>
</td>
<td class="cellrowborder" valign="top" width="33.39%" headers="mcps1.3.1.8.2.4.1.3 "><ul id="admin_guide_000399__ul204248118335"><li id="admin_guide_000399__li10424131173314">LocalDir</li><li id="admin_guide_000399__li54741917193312">LocalHDFS</li><li id="admin_guide_000399__li159451432133310">RemoteHDFS</li><li id="admin_guide_000399__li1376237133310">NFS</li><li id="admin_guide_000399__li79620399334">CIFS</li><li id="admin_guide_000399__li13803114234">SFTP</li><li id="admin_guide_000399__li19126171011288">OBS</li></ul>
</td>
</tr>
</tbody>
</table>
</div>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="admin_guide_000399__en-us_topic_0046736760_table51479525" frame="border" border="1" rules="all"><caption><b>Table 2 </b>Component metadata or other data to be backed up</caption><thead align="left"><tr id="admin_guide_000399__en-us_topic_0046736760_row59925403"><th align="left" class="cellrowborder" valign="top" width="14.78%" id="mcps1.3.1.9.2.4.1.1"><p id="admin_guide_000399__en-us_topic_0046736760_p22119473">Backup Type</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="55.910000000000004%" id="mcps1.3.1.9.2.4.1.2"><p id="admin_guide_000399__en-us_topic_0046736760_p46846859">Backup Content</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="29.310000000000002%" id="mcps1.3.1.9.2.4.1.3"><p id="admin_guide_000399__p59305013344">Backup Directory Type</p>
</th>
</tr>
</thead>
<tbody><tr id="admin_guide_000399__en-us_topic_0046736760_row36499217"><td class="cellrowborder" valign="top" width="14.78%" headers="mcps1.3.1.9.2.4.1.1 "><p id="admin_guide_000399__en-us_topic_0046736760_p3646637">DBService</p>
</td>
<td class="cellrowborder" valign="top" width="55.910000000000004%" headers="mcps1.3.1.9.2.4.1.2 "><p id="admin_guide_000399__en-us_topic_0046736760_p26942148">Metadata of the components (including Loader, Hive, Spark, Oozie, and Hue) managed by DBService. For a cluster with multiple services installed, back up the metadata of multiple Hive and Spark service instances.</p>
</td>
<td class="cellrowborder" valign="top" width="29.310000000000002%" headers="mcps1.3.1.9.2.4.1.3 "><ul id="admin_guide_000399__ul964911614340"><li id="admin_guide_000399__li166491763344">LocalDir</li><li id="admin_guide_000399__li16649106123419">LocalHDFS</li><li id="admin_guide_000399__li2649761346">RemoteHDFS</li><li id="admin_guide_000399__li156492618341">NFS</li><li id="admin_guide_000399__li86498623419">CIFS</li><li id="admin_guide_000399__li18351221242">SFTP</li><li id="admin_guide_000399__li121698162912">OBS</li></ul>
</td>
</tr>
<tr id="admin_guide_000399__row13991633143717"><td class="cellrowborder" valign="top" width="14.78%" headers="mcps1.3.1.9.2.4.1.1 "><p id="admin_guide_000399__p13497520173624">Kafka</p>
</td>
<td class="cellrowborder" valign="top" width="55.910000000000004%" headers="mcps1.3.1.9.2.4.1.2 "><p id="admin_guide_000399__p28195434173624">Kafka metadata.</p>
</td>
<td class="cellrowborder" valign="top" width="29.310000000000002%" headers="mcps1.3.1.9.2.4.1.3 "><ul id="admin_guide_000399__ul193491838545"><li id="admin_guide_000399__li163492381346">LocalDir</li><li id="admin_guide_000399__li18349173810418">LocalHDFS</li><li id="admin_guide_000399__li183491381346">RemoteHDFS</li><li id="admin_guide_000399__li1534915388417">NFS</li><li id="admin_guide_000399__li53499386411">CIFS</li><li id="admin_guide_000399__li156101014192913">OBS</li></ul>
</td>
</tr>
<tr id="admin_guide_000399__en-us_topic_0046736760_row41152746"><td class="cellrowborder" valign="top" width="14.78%" headers="mcps1.3.1.9.2.4.1.1 "><p id="admin_guide_000399__en-us_topic_0046736760_p45038152">NameNode</p>
</td>
<td class="cellrowborder" valign="top" width="55.910000000000004%" headers="mcps1.3.1.9.2.4.1.2 "><p id="admin_guide_000399__en-us_topic_0046736760_p24211670">HDFS metadata. After multiple NameServices are added, backup and restoration are supported for all of them and the operations are consistent with those of the default hacluster instance.</p>
</td>
<td class="cellrowborder" rowspan="3" valign="top" width="29.310000000000002%" headers="mcps1.3.1.9.2.4.1.3 "><ul id="admin_guide_000399__ul59025463348"><li id="admin_guide_000399__li129021646183412">LocalDir</li><li id="admin_guide_000399__li790215467344">RemoteHDFS</li><li id="admin_guide_000399__li89021446103414">NFS</li><li id="admin_guide_000399__li129021446113420">CIFS</li><li id="admin_guide_000399__li1084252655">SFTP</li><li id="admin_guide_000399__li18898122512295">OBS</li></ul>
</td>
</tr>
<tr id="admin_guide_000399__row2084313015715"><td class="cellrowborder" valign="top" headers="mcps1.3.1.9.2.4.1.1 "><p id="admin_guide_000399__p184316300710">Yarn</p>
</td>
<td class="cellrowborder" valign="top" headers="mcps1.3.1.9.2.4.1.2 "><p id="admin_guide_000399__p98431630379">Information about the Yarn service resource pool.</p>
</td>
</tr>
<tr id="admin_guide_000399__row13192102153010"><td class="cellrowborder" valign="top" headers="mcps1.3.1.9.2.4.1.1 "><p id="admin_guide_000399__p1519302153017">HBase</p>
</td>
<td class="cellrowborder" valign="top" headers="mcps1.3.1.9.2.4.1.2 "><p id="admin_guide_000399__p17193152143013"><strong id="admin_guide_000399__b134129143911381">tableinfo</strong> files and data files of HBase system tables.</p>
</td>
</tr>
<tr id="admin_guide_000399__row1430873316312"><td class="cellrowborder" valign="top" width="14.78%" headers="mcps1.3.1.9.2.4.1.1 "><p id="admin_guide_000399__p4309173323116">ClickHouse</p>
</td>
<td class="cellrowborder" valign="top" width="55.910000000000004%" headers="mcps1.3.1.9.2.4.1.2 "><p id="admin_guide_000399__p1730953363119">ClickHouse metadata.</p>
</td>
<td class="cellrowborder" valign="top" width="29.310000000000002%" headers="mcps1.3.1.9.2.4.1.3 "><ul id="admin_guide_000399__ul33762883211"><li id="admin_guide_000399__li203767135323">LocalDir</li><li id="admin_guide_000399__li5376713163211">RemoteHDFS</li></ul>
</td>
</tr>
</tbody>
</table>
</div>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="admin_guide_000399__en-us_topic_0046736760_table35267730" frame="border" border="1" rules="all"><caption><b>Table 3 </b>Service data of specific components to be backed up</caption><thead align="left"><tr id="admin_guide_000399__en-us_topic_0046736760_row17465515"><th align="left" class="cellrowborder" valign="top" width="12.65%" id="mcps1.3.1.10.2.4.1.1"><p id="admin_guide_000399__en-us_topic_0046736760_p5420574">Backup Type</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="54.02%" id="mcps1.3.1.10.2.4.1.2"><p id="admin_guide_000399__en-us_topic_0046736760_p36413387">Backup Content</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="33.33%" id="mcps1.3.1.10.2.4.1.3"><p id="admin_guide_000399__p1390434515354">Backup Directory Type</p>
</th>
</tr>
</thead>
<tbody><tr id="admin_guide_000399__en-us_topic_0046736760_row63803224"><td class="cellrowborder" valign="top" width="12.65%" headers="mcps1.3.1.10.2.4.1.1 "><p id="admin_guide_000399__en-us_topic_0046736760_p678637">HBase</p>
</td>
<td class="cellrowborder" valign="top" width="54.02%" headers="mcps1.3.1.10.2.4.1.2 "><p id="admin_guide_000399__en-us_topic_0046736760_p54969651">Table-level user data. For a cluster with multiple services installed, backup and restoration are supported for multiple HBase service instances and the backup and restoration operations are consistent with those of a single HBase service instance.</p>
</td>
<td class="cellrowborder" rowspan="3" valign="top" width="33.33%" headers="mcps1.3.1.10.2.4.1.3 "><ul id="admin_guide_000399__ul1656717301367"><li id="admin_guide_000399__li95675300369">RemoteHDFS</li><li id="admin_guide_000399__li5567230193619">NFS</li><li id="admin_guide_000399__li256753043617">CIFS</li><li id="admin_guide_000399__li727163411015">SFTP</li><li id="admin_guide_000399__li24368327245">OBS</li></ul>
</td>
</tr>
<tr id="admin_guide_000399__en-us_topic_0046736760_row24964816"><td class="cellrowborder" valign="top" headers="mcps1.3.1.10.2.4.1.1 "><p id="admin_guide_000399__en-us_topic_0046736760_p8884193">HDFS</p>
</td>
<td class="cellrowborder" valign="top" headers="mcps1.3.1.10.2.4.1.2 "><p id="admin_guide_000399__en-us_topic_0046736760_p48531064">Directories or files of user services.</p>
<div class="note" id="admin_guide_000399__note1690815599556"><span class="notetitle"> NOTE: </span><div class="notebody"><p id="admin_guide_000399__p1290915915553">Encrypted directories cannot be backed up or restored.</p>
</div></div>
</td>
</tr>
<tr id="admin_guide_000399__en-us_topic_0046736760_row34126392"><td class="cellrowborder" valign="top" headers="mcps1.3.1.10.2.4.1.1 "><p id="admin_guide_000399__en-us_topic_0046736760_p12774352">Hive</p>
</td>
<td class="cellrowborder" valign="top" headers="mcps1.3.1.10.2.4.1.2 "><p id="admin_guide_000399__en-us_topic_0046736760_p28089568">Table-level user data. For a cluster with multiple services installed, backup and restoration are supported for multiple Hive service instances and the backup and restoration operations are consistent with those of a single Hive service instance.</p>
</td>
</tr>
<tr id="admin_guide_000399__row134603399320"><td class="cellrowborder" valign="top" width="12.65%" headers="mcps1.3.1.10.2.4.1.1 "><p id="admin_guide_000399__p134601039143215">ClickHouse</p>
</td>
<td class="cellrowborder" valign="top" width="54.02%" headers="mcps1.3.1.10.2.4.1.2 "><p id="admin_guide_000399__p3460103913326">Table-level user data.</p>
</td>
<td class="cellrowborder" valign="top" width="33.33%" headers="mcps1.3.1.10.2.4.1.3 "><ul id="admin_guide_000399__ul1875135025120"><li id="admin_guide_000399__li9753507511">RemoteHDFS</li><li id="admin_guide_000399__li106022031604">OBS</li></ul>
</td>
</tr>
<tr id="admin_guide_000399__row198781117136"><td class="cellrowborder" valign="top" width="12.65%" headers="mcps1.3.1.10.2.4.1.1 "><p id="admin_guide_000399__p1390615920014">Doris</p>
</td>
<td class="cellrowborder" valign="top" width="54.02%" headers="mcps1.3.1.10.2.4.1.2 "><p id="admin_guide_000399__p196921436703">Doris data. This function is available for MRS 3.3.1 or later.</p>
</td>
<td class="cellrowborder" valign="top" width="33.33%" headers="mcps1.3.1.10.2.4.1.3 "><ul id="admin_guide_000399__ul14428419919"><li id="admin_guide_000399__li0316133016117">RemoteHDFS</li><li id="admin_guide_000399__li631614304110">OBS</li></ul>
</td>
</tr>
</tbody>
</table>
</div>
<p id="admin_guide_000399__en-us_topic_0046736760_p16578438">Note that some components do not provide data backup or restoration:</p>
<ul id="admin_guide_000399__en-us_topic_0046736760_ul14988220"><li id="admin_guide_000399__en-us_topic_0046736760_li676256">Kafka supports replicas and allows multiple replicas to be specified when a topic is created.</li><li id="admin_guide_000399__en-us_topic_0046736760_li23228850">MapReduce and Yarn data is stored in HDFS. Therefore, they rely on the backup and restoration provided by HDFS.</li><li id="admin_guide_000399__li1055910483368">Backup and restoration of service data in ZooKeeper are performed by their own upper-layer components.</li></ul>
</div>
<div class="section" id="admin_guide_000399__s05983d4475bd44e2991eda1b3db0c379"><h4 class="sectiontitle">Principles</h4><p id="admin_guide_000399__en-us_topic_0046736760_p2488722"><strong id="admin_guide_000399__en-us_topic_0046736760_b22398504">Task</strong></p>
<p id="admin_guide_000399__en-us_topic_0046736760_p259947">Before backup or restoration, you need to create a backup or restoration task and set task parameters, such as the task name, backup data source, and type of the directory for storing backup files. Then you can execute the tasks to back up or restore data. When Manager is used to restore the data of HDFS, HBase, Hive, and NameNode, the cluster cannot be accessed.</p>
<p id="admin_guide_000399__p966241312540">Each backup task can back up data of different data sources and generate an independent backup file for each data source. All the backup files generated in a backup task form a backup file set, which can be used in restoration tasks. Backup data can be stored on Linux local disks, local cluster HDFS, and standby cluster HDFS.</p>
<p id="admin_guide_000399__en-us_topic_0046736760_p2339524">Backup tasks support full backup and incremental backup policies. Cloud data backup tasks do not support incremental backup. If the backup directory type is NFS or CIFS, incremental backup is not recommended. When incremental backup is used for NFS or CIFS backup, the latest full backup data is updated each time the incremental backup is performed. Therefore, no new recovery point is generated.</p>
<div class="note" id="admin_guide_000399__en-us_topic_0046736760_note21055716"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p class="text" id="admin_guide_000399__en-us_topic_0046736760_p55283716">Task execution rules:</p>
<ul id="admin_guide_000399__en-us_topic_0046736760_ul27791397"><li id="admin_guide_000399__en-us_topic_0046736760_li48795984">If a task is being executed, the task cannot be executed repeatedly and other tasks cannot be started at the same time.</li><li id="admin_guide_000399__en-us_topic_0046736760_li36510679">The interval at which a periodic task is automatically executed must be greater than 120s. Otherwise, the task is postponed and will be executed in the next period. Manual tasks can be executed at any interval.</li><li id="admin_guide_000399__en-us_topic_0046736760_li60160657">When a periodic task is to be automatically executed, the current time cannot be 120s later than the task start time. Otherwise, the task is postponed and executed in the next period.</li><li id="admin_guide_000399__en-us_topic_0046736760_li4575009">When a periodic task is locked, it cannot be automatically executed and needs to be manually unlocked.</li><li id="admin_guide_000399__en-us_topic_0046736760_li41175087">Before an OMS, DBService, Kafka, or NameNode backup task starts, ensure that the LocalBackup partition on the active management node has not less than 20 GB of available space. Otherwise, the backup task cannot be started.</li></ul>
</div></div>
<p id="admin_guide_000399__en-us_topic_0046736760_p35031468">When planning backup and restoration tasks, select the data to be backed up or restored strictly based on the service logic, data store structure, and database or table association. By default, the system creates periodic backup tasks <strong id="admin_guide_000399__b114245518711381">default-oms</strong> and <strong id="admin_guide_000399__b169340638511381">default-</strong><strong id="admin_guide_000399__b53201054911381"><em id="admin_guide_000399__i165379693411381">cluster ID</em></strong> at an interval of one hour. OMS metadata and cluster metadata, such as DBService and NameNode, can be fully backed up to local disks.</p>
<p id="admin_guide_000399__en-us_topic_0046736760_p46847761"><strong id="admin_guide_000399__en-us_topic_0046736760_b18976672">Snapshot</strong></p>
<p id="admin_guide_000399__en-us_topic_0046736760_p36572320">The system uses the snapshot technology to quickly back up data. Snapshots include HBase and HDFS snapshots.</p>
<ul id="admin_guide_000399__en-us_topic_0046736760_ul60715427"><li id="admin_guide_000399__en-us_topic_0046736760_li9567939">HBase snapshots<p id="admin_guide_000399__en-us_topic_0046736760_p19002592"><a name="admin_guide_000399__en-us_topic_0046736760_li9567939"></a><a name="en-us_topic_0046736760_li9567939"></a>An HBase snapshot is a backup file of HBase tables at a specified time point. This backup file does not replicate service data or affect the RegionServer. The HBase snapshot replicates table metadata, including table descriptor, region info, and HFile reference information. The metadata can be used to restore data before the snapshot creation time.</p>
</li><li id="admin_guide_000399__en-us_topic_0046736760_li36805603">HDFS snapshots<p id="admin_guide_000399__en-us_topic_0046736760_p62814973"><a name="admin_guide_000399__en-us_topic_0046736760_li36805603"></a><a name="en-us_topic_0046736760_li36805603"></a>An HDFS snapshot is a read-only backup of HDFS at a specified time point. The snapshot is used in data backup, misoperation protection, and disaster recovery scenarios.</p>
<p id="admin_guide_000399__en-us_topic_0046736760_p28463847">The snapshot function can be enabled for any HDFS directory to create the related snapshot file. Before creating a snapshot for a directory, the system automatically enables the snapshot function for the directory. Creating a snapshot does not affect any HDFS operation. A maximum of 65,536 snapshots can be created for each HDFS directory.</p>
<p id="admin_guide_000399__en-us_topic_0046736760_p54848034">When a snapshot is being created for an HDFS directory, the directory cannot be deleted or modified before the snapshot is created. Snapshots cannot be created for the upper-layer directories or subdirectories of the directory.</p>
</li></ul>
<p id="admin_guide_000399__en-us_topic_0046736760_p23870264"><strong id="admin_guide_000399__en-us_topic_0046736760_b13505784">DistCp</strong></p>
<p id="admin_guide_000399__en-us_topic_0046736760_p54443197">Distributed copy (DistCp) is a tool used to replicate a large amount of data in HDFS in a cluster or between the HDFSs of different clusters. In a backup or restoration task of HBase, HDFS, or Hive, if you back up the data to HDFS of the standby cluster, the system invokes DistCp to perform the operation. Install the <span id="admin_guide_000399__text42371546163">MRS</span> software of the same version for the active and standby clusters and install the cluster.</p>
<p id="admin_guide_000399__en-us_topic_0046736760_p20226726">DistCp uses MapReduce to implement data distribution, troubleshooting, restoration, and report. DistCp specifies different Map jobs for various source files and directories in the specified list. Each Map job copies the data in the partition that corresponds to the specified file in the list.</p>
<p id="admin_guide_000399__en-us_topic_0046736760_p47822814">If you use DistCp to replicate data between HDFSs of two clusters, configure the cross-cluster mutual trust (mutual trust does not need to be configured for clusters managed by the same <span id="admin_guide_000399__text1096165841314">MRS</span> Manager) and cross-cluster replication for both clusters. When backing up the cluster data to HDFS in another cluster, you need to install the Yarn component. Otherwise, the backup fails.</p>
<p id="admin_guide_000399__en-us_topic_0046736760_p27752149"><strong id="admin_guide_000399__en-us_topic_0046736760_b48442757">Local rapid restoration</strong></p>
<p id="admin_guide_000399__en-us_topic_0046736760_p33331629">After using DistCp to back up the HBase, HDFS, and Hive data of the local cluster to the HDFS of the standby cluster, the HDFS of the local cluster retains the backup data snapshots. You can create local rapid restoration tasks to restore data by using the snapshot files in the HDFS of the local cluster.</p>
<p id="admin_guide_000399__p1478542814519"><strong id="admin_guide_000399__b117870288451">NAS</strong></p>
<p id="admin_guide_000399__p12787028204516">Network Attached Storage (NAS) is a dedicated data storage server which includes the storage components and embedded system software. It provides the cross-platform file sharing function. By using NFS (supporting NFSv3 and NFSv4) and CIFS (supporting SMBv2 and SMBv3), you can connect the service plane of <span id="admin_guide_000399__text14683175616166">MRS</span> to the NAS server to back up data to the NAS or restore data from the NAS.</p>
<div class="note" id="admin_guide_000399__note1555331743216"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><ul id="admin_guide_000399__ul184841961513"><li id="admin_guide_000399__li193192111515">Before data is backed up to the NAS, the system automatically mounts the NAS shared address to a local partition of the backup task execution node. After the backup is complete, the system unmounts the NAS shared partition from the backup task execution node.</li><li id="admin_guide_000399__li1828358142518">To prevent backup and restoration failures, do not access the shared address where the NAS server has been mounted to, for example, <strong id="admin_guide_000399__b11919183411381">/srv/BigData/LocalBackup/nas</strong>, during data backup and restoration.</li><li id="admin_guide_000399__li1248319121515">When service data is backed up to the NAS, DistCp is used.</li></ul>
</div></div>
</div>
<div class="section" id="admin_guide_000399__sb82fae94b80f4043824476e32a82995f"><h4 class="sectiontitle">Specifications</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="admin_guide_000399__en-us_topic_0046736760_table15507413" frame="border" border="1" rules="all"><caption><b>Table 4 </b>Specifications of the backup and restoration feature</caption><thead align="left"><tr id="admin_guide_000399__en-us_topic_0046736760_row38246463"><th align="left" class="cellrowborder" valign="top" width="51.54%" id="mcps1.3.3.2.2.3.1.1"><p id="admin_guide_000399__en-us_topic_0046736760_p10955833">Item</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="48.46%" id="mcps1.3.3.2.2.3.1.2"><p id="admin_guide_000399__en-us_topic_0046736760_p15007308">Specification</p>
</th>
</tr>
</thead>
<tbody><tr id="admin_guide_000399__en-us_topic_0046736760_row7632415"><td class="cellrowborder" valign="top" width="51.54%" headers="mcps1.3.3.2.2.3.1.1 "><p id="admin_guide_000399__en-us_topic_0046736760_p14245850">Maximum number of backup or restoration tasks</p>
</td>
<td class="cellrowborder" valign="top" width="48.46%" headers="mcps1.3.3.2.2.3.1.2 "><p id="admin_guide_000399__en-us_topic_0046736760_p13063206">100</p>
</td>
</tr>
<tr id="admin_guide_000399__en-us_topic_0046736760_row50459994"><td class="cellrowborder" valign="top" width="51.54%" headers="mcps1.3.3.2.2.3.1.1 "><p id="admin_guide_000399__en-us_topic_0046736760_p60727681">Number of concurrent tasks in a cluster</p>
</td>
<td class="cellrowborder" valign="top" width="48.46%" headers="mcps1.3.3.2.2.3.1.2 "><p id="admin_guide_000399__en-us_topic_0046736760_p19995149">1</p>
</td>
</tr>
<tr id="admin_guide_000399__en-us_topic_0046736760_row45738621"><td class="cellrowborder" valign="top" width="51.54%" headers="mcps1.3.3.2.2.3.1.1 "><p id="admin_guide_000399__en-us_topic_0046736760_p13840795">Maximum number of waiting tasks</p>
</td>
<td class="cellrowborder" valign="top" width="48.46%" headers="mcps1.3.3.2.2.3.1.2 "><p id="admin_guide_000399__en-us_topic_0046736760_p47362577">199</p>
</td>
</tr>
<tr id="admin_guide_000399__en-us_topic_0046736760_row23610013"><td class="cellrowborder" valign="top" width="51.54%" headers="mcps1.3.3.2.2.3.1.1 "><p id="admin_guide_000399__en-us_topic_0046736760_p33362891">Maximum size (GB) of backup files on a Linux local disk</p>
</td>
<td class="cellrowborder" valign="top" width="48.46%" headers="mcps1.3.3.2.2.3.1.2 "><p id="admin_guide_000399__en-us_topic_0046736760_p18039654">600</p>
</td>
</tr>
</tbody>
</table>
</div>
<div class="note" id="admin_guide_000399__note1750125019392"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><p id="admin_guide_000399__p15511350123918">If service data is stored in the ZooKeeper upper-layer components, ensure that the number of znodes in a single backup or restoration task is not too large. Otherwise, the task will fail, and the ZooKeeper service performance will be affected. To check the number of znodes in a single backup or restoration task, perform the following operations:</p>
<ul id="admin_guide_000399__ul19500418473"><li id="admin_guide_000399__li159509412472">Ensure that the number of znodes in a single backup or restoration task is smaller than the upper limit of OS file handles. Specifically:<ol id="admin_guide_000399__ol104914175017"><li id="admin_guide_000399__li14495417508">To check the upper limit at the system level, run the <strong id="admin_guide_000399__b388714412214">cat /proc/sys/fs/file-max</strong> command.</li><li id="admin_guide_000399__li171891737205111">To check the upper limit at the user level, run the <strong id="admin_guide_000399__b15481146229">ulimit -n</strong> command.</li></ol>
</li><li id="admin_guide_000399__li125706734915">If the number of znodes in the parent directory exceeds the upper limit, back up and restore data in its sub-directories in batches. To check the number of znodes using ZooKeeper client scripts, perform the following operations:<ol id="admin_guide_000399__ol18646124618540"><li id="admin_guide_000399__li17646174613546">On <span id="admin_guide_000399__text11703135921312">MRS</span> Manager, choose <strong id="admin_guide_000399__b199753946311381">Cluster</strong>, click the name of the desired cluster, choose <strong id="admin_guide_000399__b350112085311">Services</strong> &gt; <strong id="admin_guide_000399__b46111428205315">ZooKeeper</strong> &gt; <strong id="admin_guide_000399__b1028533412535">Instance</strong>, and view the management IP address of each ZooKeeper role.</li><li id="admin_guide_000399__li1221510234411">Log in to the node where the client is located and run the following command:<p id="admin_guide_000399__p15950102817346"><a name="admin_guide_000399__li1221510234411"></a><a name="li1221510234411"></a><strong id="admin_guide_000399__b20543043211381">zkCli.sh -server </strong><em id="admin_guide_000399__i94939623911381">ip</em><strong id="admin_guide_000399__b207679169411381">:</strong><em id="admin_guide_000399__i27337372511381">port</em>, where, <em id="admin_guide_000399__i175270225011381">ip</em> can be any management IP address, and the default port number is 2181.</p>
</li><li id="admin_guide_000399__li40171717343">If the following information is displayed, login to the ZooKeeper server is successful:<pre class="screen" id="admin_guide_000399__screen73716331419">WatchedEvent state:SyncConnected type:None path:null
[zk: ip:port(CONNECIED) 0]</pre>
</li><li id="admin_guide_000399__li132622461401">Run the <strong id="admin_guide_000399__b175766775611381">getusage</strong> command to check the number of znodes in the directory to be backed up.<p id="admin_guide_000399__p4416171018448">For example, <strong id="admin_guide_000399__b98320747811381">getusage /hbase/region</strong>. In the command output, <strong id="admin_guide_000399__b38006521711381">Node count=xxxxxx</strong> indicates the number of znodes stored in the <strong id="admin_guide_000399__b1633312534554">region</strong> directory.</p>
</li></ol>
</li></ul>
</div></div>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="admin_guide_000399__table1298520243201" frame="border" border="1" rules="all"><caption><b>Table 5 </b>Specifications of the default task</caption><thead align="left"><tr id="admin_guide_000399__row8986132419207"><th align="left" class="cellrowborder" valign="top" width="19.82%" id="mcps1.3.3.4.2.7.1.1"><p id="admin_guide_000399__p298622412018">Item</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="12.629999999999999%" id="mcps1.3.3.4.2.7.1.2"><p id="admin_guide_000399__p5986224172017">OMS</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="12.31%" id="mcps1.3.3.4.2.7.1.3"><p id="admin_guide_000399__p13986142412015">HBase</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="14.44%" id="mcps1.3.3.4.2.7.1.4"><p id="admin_guide_000399__p169862240202">Kafka</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="17.27%" id="mcps1.3.3.4.2.7.1.5"><p id="admin_guide_000399__p18986132410205">DBService</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="23.53%" id="mcps1.3.3.4.2.7.1.6"><p id="admin_guide_000399__p2098632416209">NameNode</p>
</th>
</tr>
</thead>
<tbody><tr id="admin_guide_000399__row10986324172010"><td class="cellrowborder" valign="top" headers="mcps1.3.3.4.2.7.1.1 "><p id="admin_guide_000399__p5986152418203">Backup period</p>
</td>
<td class="cellrowborder" colspan="5" valign="top" headers="mcps1.3.3.4.2.7.1.2 mcps1.3.3.4.2.7.1.3 mcps1.3.3.4.2.7.1.4 mcps1.3.3.4.2.7.1.5 mcps1.3.3.4.2.7.1.6 "><p id="admin_guide_000399__p3986024192012">1 hour</p>
</td>
</tr>
<tr id="admin_guide_000399__row16986172416203"><td class="cellrowborder" valign="top" headers="mcps1.3.3.4.2.7.1.1 "><p id="admin_guide_000399__p89861024152014">Maximum number of backups</p>
</td>
<td class="cellrowborder" colspan="4" valign="top" headers="mcps1.3.3.4.2.7.1.2 mcps1.3.3.4.2.7.1.3 mcps1.3.3.4.2.7.1.4 mcps1.3.3.4.2.7.1.5 "><p id="admin_guide_000399__p498614241204">168 (7-day historical data)</p>
</td>
<td class="cellrowborder" valign="top" headers="mcps1.3.3.4.2.7.1.6 "><p id="admin_guide_000399__p17986112418200">24 (one-day historical data)</p>
</td>
</tr>
<tr id="admin_guide_000399__row17986124192012"><td class="cellrowborder" valign="top" width="19.82%" headers="mcps1.3.3.4.2.7.1.1 "><p id="admin_guide_000399__p2987192432014">Maximum size of a backup file</p>
</td>
<td class="cellrowborder" valign="top" width="12.629999999999999%" headers="mcps1.3.3.4.2.7.1.2 "><p id="admin_guide_000399__p39878249206">10 MB</p>
</td>
<td class="cellrowborder" valign="top" width="12.31%" headers="mcps1.3.3.4.2.7.1.3 "><p id="admin_guide_000399__p498752418203">10 MB</p>
</td>
<td class="cellrowborder" valign="top" width="14.44%" headers="mcps1.3.3.4.2.7.1.4 "><p id="admin_guide_000399__p19987172413205">512 MB</p>
</td>
<td class="cellrowborder" valign="top" width="17.27%" headers="mcps1.3.3.4.2.7.1.5 "><p id="admin_guide_000399__p7987124112013">100 MB</p>
</td>
<td class="cellrowborder" valign="top" width="23.53%" headers="mcps1.3.3.4.2.7.1.6 "><p id="admin_guide_000399__p15987192442018">20 GB</p>
</td>
</tr>
<tr id="admin_guide_000399__row9987132410205"><td class="cellrowborder" valign="top" width="19.82%" headers="mcps1.3.3.4.2.7.1.1 "><p id="admin_guide_000399__p129871624162015">Maximum size of disk space used</p>
</td>
<td class="cellrowborder" valign="top" width="12.629999999999999%" headers="mcps1.3.3.4.2.7.1.2 "><p id="admin_guide_000399__p1398714249206">1.64 GB</p>
</td>
<td class="cellrowborder" valign="top" width="12.31%" headers="mcps1.3.3.4.2.7.1.3 "><p id="admin_guide_000399__p1998792416207">1.64 GB</p>
</td>
<td class="cellrowborder" valign="top" width="14.44%" headers="mcps1.3.3.4.2.7.1.4 "><p id="admin_guide_000399__p7987122472012">84 GB</p>
</td>
<td class="cellrowborder" valign="top" width="17.27%" headers="mcps1.3.3.4.2.7.1.5 "><p id="admin_guide_000399__p1998762413209">16.41 GB</p>
</td>
<td class="cellrowborder" valign="top" width="23.53%" headers="mcps1.3.3.4.2.7.1.6 "><p id="admin_guide_000399__p17987192414209">480 GB</p>
</td>
</tr>
<tr id="admin_guide_000399__row8987162411204"><td class="cellrowborder" valign="top" headers="mcps1.3.3.4.2.7.1.1 "><p id="admin_guide_000399__p1198718242208">Storage path of backup data</p>
</td>
<td class="cellrowborder" colspan="5" valign="top" headers="mcps1.3.3.4.2.7.1.2 mcps1.3.3.4.2.7.1.3 mcps1.3.3.4.2.7.1.4 mcps1.3.3.4.2.7.1.5 mcps1.3.3.4.2.7.1.6 "><p id="admin_guide_000399__p1398792492019"><em id="admin_guide_000399__i18206128115919">Data storage path</em><strong id="admin_guide_000399__b1207172845915">/LocalBackup/</strong> of the active and standby management nodes</p>
</td>
</tr>
</tbody>
</table>
</div>
<div class="note" id="admin_guide_000399__en-us_topic_0046736760_note34244056"><img src="public_sys-resources/note_3.0-en-us.png"><span class="notetitle"> </span><div class="notebody"><ul id="admin_guide_000399__en-us_topic_0046736760_ul39761052"><li id="admin_guide_000399__en-us_topic_0046736760_li22305155">The backup data of the default backup task must be periodically transferred and saved outside the cluster based on the enterprise O&amp;M requirements.</li><li id="admin_guide_000399__en-us_topic_0046736760_li66528668">Administrators can create DistCp backup tasks to save OMS, DBService, and NameNode data to external clusters.</li><li id="admin_guide_000399__li178651381486">The execution time of a cluster data backup task can be calculated using the following formula: Task execution time = Volume of data to be backed up/Network bandwidth between the cluster and the backup device. In practice, you are advised to multiply the calculated time by 1.5 to get the reference value of the task execution time.</li><li id="admin_guide_000399__li193954018">Executing a data backup task affects the maximum I/O performance of the cluster. Therefore, you are advised to execute a backup task during off-peak hours.</li></ul>
</div></div>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="admin_guide_000198.html">Backup and Recovery Management</a></div>
</div>
</div>