Files
doc-exports/docs/mrs/umn/admin_guide_000424.html
yangtong c285e88a17 MRS UMN 20250806 version
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com>
Co-authored-by: yangtong <yangtong2@huawei.com>
Co-committed-by: yangtong <yangtong2@huawei.com>
2025-09-02 10:43:57 +00:00

53 lines
18 KiB
HTML

<a name="admin_guide_000424"></a><a name="admin_guide_000424"></a>
<h1 class="topictitle1">Restoring Doris Service Data</h1>
<div id="body0000002413397197"><div class="section" id="admin_guide_000424__en-us_topic_0000001866255229_section2852132524811"><h4 class="sectiontitle">Scenario</h4><p id="admin_guide_000424__en-us_topic_0000001866255229_en-us_topic_0046736775_p60353288">Doris data needs to be recovered in the following scenarios: data is modified or deleted unexpectedly and needs to be restored. After an administrator performs critical data adjustment in the Doris, an exception occurs or the operation has not achieved the expected result. All modules are faulty and become unavailable. Data is migrated to a new cluster.</p>
<p id="admin_guide_000424__en-us_topic_0000001866255229_en-us_topic_0046736775_p6308685">System administrators can create a recovery task in MRS Manager to recover Doris data. Only manual restoration tasks are supported.</p>
<p id="admin_guide_000424__en-us_topic_0000001866255229_en-us_topic_0046736775_p56778172">When executing backup and restoration tasks, you need to manage unified restoration points based on service scenarios to ensure proper service running.</p>
<div class="notice" id="admin_guide_000424__en-us_topic_0000001866255229_en-us_topic_0046736775_note35629183"><span class="noticetitle"><img src="public_sys-resources/notice_3.0-en-us.png"> </span><div class="noticebody"><ul id="admin_guide_000424__en-us_topic_0000001866255229_en-us_topic_0046736775_ul52227199"><li id="admin_guide_000424__li291561045914">This topic is available for clusters of MRS <span id="admin_guide_000424__ph1132514017222">3.5.0</span> and later only.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_li1477269716310">Data can be restored only when the system version during data backup is the same as the current system version.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_en-us_topic_0046736775_li282748">To restore data when services are normal, manually back up the latest management data before restoring data. Otherwise, the Doris data generated after data backup and before data restoration will be lost.</li></ul>
</div></div>
</div>
<div class="section" id="admin_guide_000424__en-us_topic_0000001866255229_s618fe949ea1640cdb33e6226d3911c36"><h4 class="sectiontitle">Impact on the System</h4><p id="admin_guide_000424__en-us_topic_0000001866255229_p1561413917169">After data is restored, the data generated after data backup and before data restoration is lost.</p>
</div>
<div class="section" id="admin_guide_000424__en-us_topic_0000001866255229_section1531641154814"><h4 class="sectiontitle">Prerequisites</h4><ul id="admin_guide_000424__en-us_topic_0000001866255229_ul1222213533213"><li id="admin_guide_000424__en-us_topic_0000001866255229_li1134811244713">To restore data from a remote HDFS, the following conditions must be met:<ul id="admin_guide_000424__en-us_topic_0000001866255229_ul195198496480"><li id="admin_guide_000424__en-us_topic_0000001866255229_l6b2ccbf0a54a4d7994c8b888340d4fc3">A standby cluster for restoring data has been created, and data in this cluster has been backed up. For details, see <a href="admin_guide_000423.html">Backing Up Doris Data</a>. If the active cluster is deployed in security mode and the active and standby clusters are not managed by the same MRS Manager, mutual trust has been configured. For details, see <a href="admin_guide_000177.html">Configuring Cross-Manager Mutual Trust Between Clusters</a>. If the active cluster is deployed in normal mode, you do not need to configure mutual trust.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_li1093064204918">At least one DBroker instance of the Doris service has been deployed in the active cluster.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_li1136815366503">The time on the active and standby clusters must be the same, and the NTP service on the active and standby clusters uses the same time source.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_li5865946205014">The value of <strong id="admin_guide_000424__en-us_topic_0000001866255229_b152167309565">hadoop.rpc.protection</strong> of Doris must be the same as that of <strong id="admin_guide_000424__en-us_topic_0000001866255229_b1721612306566">hadoop.rpc.protection</strong> of HDFS in both active and standby clusters.</li></ul>
</li></ul>
<ul id="admin_guide_000424__en-us_topic_0000001866255229_en-us_topic_0046736775_ul38613850"><li id="admin_guide_000424__en-us_topic_0000001866255229_li107784143346">If you want to restore data from OBS, you have connected the Doris cluster to OBS and have the permission to access OBS.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_en-us_topic_0046736775_li32436794">The database for storing restored data tables, the location for storing the data tables in HDFS, and the list of users who can access the restored data have been planned.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_en-us_topic_0046736775_li23495697">Check the path for storing Doris backup files.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_en-us_topic_0046736775_li10134686">Stop the upper-layer Doris applications.</li></ul>
</div>
<div class="section" id="admin_guide_000424__en-us_topic_0000001866255229_section113917518484"><h4 class="sectiontitle">Restoring Doris Service Data</h4><ol id="admin_guide_000424__en-us_topic_0000001866255229_ol1323744816521"><li id="admin_guide_000424__en-us_topic_0000001866255229_li2113658016521"><span>On MRS Manager, choose <strong id="admin_guide_000424__en-us_topic_0000001866255229_b10202146641449">O&amp;M</strong> &gt; <strong id="admin_guide_000424__en-us_topic_0000001866255229_b184016803441449">Backup and Restoration</strong> &gt; <strong id="admin_guide_000424__en-us_topic_0000001866255229_b189985234441449">Backup Management</strong>.</span></li><li id="admin_guide_000424__en-us_topic_0000001866255229_li6526329016521"><a name="admin_guide_000424__en-us_topic_0000001866255229_li6526329016521"></a><a name="en-us_topic_0000001866255229_li6526329016521"></a><span>In the <strong id="admin_guide_000424__en-us_topic_0000001866255229_b132758850141449">Operation</strong> column of a specified task in the task list, click <strong id="admin_guide_000424__en-us_topic_0000001866255229_b149378139741449">More</strong> and select <strong id="admin_guide_000424__en-us_topic_0000001866255229_b94446350541449">View History</strong> to view historical execution records of backup tasks.</span><p><p class="litext" id="admin_guide_000424__en-us_topic_0000001866255229_p3434142316521">In the window that is displayed, select a success record and click <strong id="admin_guide_000424__en-us_topic_0000001866255229_b44487983541449">View</strong> in the <strong id="admin_guide_000424__en-us_topic_0000001866255229_b119004182141449">Backup Path</strong> column to view its backup path information and find the following information:</p>
<ul class="subitemlist" id="admin_guide_000424__en-us_topic_0000001866255229_ul2962109816521"><li id="admin_guide_000424__en-us_topic_0000001866255229_li4063735216521"><strong id="admin_guide_000424__en-us_topic_0000001866255229_b207141705741449">Backup Object</strong>: indicates the backup data source.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_li329123316521"><strong id="admin_guide_000424__en-us_topic_0000001866255229_b148176083841449">Backup Path</strong>: indicates the full path for storing backup files.<p id="admin_guide_000424__en-us_topic_0000001866255229_p3019185416521">Select the correct path and copy the full path of backup files in <strong id="admin_guide_000424__en-us_topic_0000001866255229_b33015843041449">Backup Path</strong>.</p>
</li></ul>
</p></li><li id="admin_guide_000424__en-us_topic_0000001866255229_li5183509916521"><span>Choose <strong id="admin_guide_000424__en-us_topic_0000001866255229_b44251021256">Restoration Management</strong> and click <strong id="admin_guide_000424__en-us_topic_0000001866255229_b1086565520415">Create</strong>.</span></li><li id="admin_guide_000424__en-us_topic_0000001866255229_li6386271016521"><span>Set <strong id="admin_guide_000424__en-us_topic_0000001866255229_b62870203541449">Task Name</strong> to the name of the restoration task.</span></li><li id="admin_guide_000424__en-us_topic_0000001866255229_li1696835325016"><span>Select the desired cluster from <strong id="admin_guide_000424__en-us_topic_0000001866255229_b110714666741449">Recovery Object</strong>.</span></li><li id="admin_guide_000424__en-us_topic_0000001866255229_li3789348516521"><span>In <strong id="admin_guide_000424__en-us_topic_0000001866255229_b59319411930">Restoration Configuration</strong>, select <strong id="admin_guide_000424__en-us_topic_0000001866255229_b18932041835">Doris</strong> under <strong id="admin_guide_000424__en-us_topic_0000001866255229_b129494119316">Service data</strong>.</span></li><li id="admin_guide_000424__en-us_topic_0000001866255229_li4457996415256"><span>Set <strong id="admin_guide_000424__en-us_topic_0000001866255229_b167222513316">Path Type</strong> of <strong id="admin_guide_000424__en-us_topic_0000001866255229_b972218511833">Doris</strong> to a restoration directory type.</span><p>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="admin_guide_000424__en-us_topic_0000001866255229_table194030385228" frame="border" border="1" rules="all"><caption><b>Table 1 </b>Path for data restoration</caption><thead align="left"><tr id="admin_guide_000424__en-us_topic_0000001866255229_row134045389225"><th align="left" class="cellrowborder" valign="top" width="30%" id="mcps1.3.4.2.7.2.1.2.3.1.1"><p id="admin_guide_000424__en-us_topic_0000001866255229_p11404438122212">Directory Type</p>
</th>
<th align="left" class="cellrowborder" valign="top" width="70%" id="mcps1.3.4.2.7.2.1.2.3.1.2"><p id="admin_guide_000424__en-us_topic_0000001866255229_p24040386226">Description</p>
</th>
</tr>
</thead>
<tbody><tr id="admin_guide_000424__en-us_topic_0000001866255229_row1940493812221"><td class="cellrowborder" valign="top" width="30%" headers="mcps1.3.4.2.7.2.1.2.3.1.1 "><p id="admin_guide_000424__en-us_topic_0000001866255229_p18404173862215">RemoteHDFS</p>
</td>
<td class="cellrowborder" valign="top" width="70%" headers="mcps1.3.4.2.7.2.1.2.3.1.2 "><div class="p" id="admin_guide_000424__en-us_topic_0000001866255229_p73521019161712">The backup files are stored in the HDFS directory of the standby cluster. If you select this option, you also need to configure the following parameters:<ul id="admin_guide_000424__en-us_topic_0000001866255229_ul10645143314376"><li id="admin_guide_000424__en-us_topic_0000001866255229_li464583311375"><strong id="admin_guide_000424__en-us_topic_0000001866255229_b213680649041449">Source NameService Name</strong>: <span id="admin_guide_000424__en-us_topic_0000001866255229_text72744777541449">indicates the NameService name of the backup data cluster, for example, <strong id="admin_guide_000424__en-us_topic_0000001866255229_b55421474741449">hacluster</strong>.</span> You can obtain it from the <strong id="admin_guide_000424__en-us_topic_0000001866255229_b194295817141449">NameService Management</strong> page of HDFS of the standby cluster.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_li19645113363713"><strong id="admin_guide_000424__en-us_topic_0000001866255229_b214243672041449">IP Mode</strong>: indicates the mode of the target IP address. The system automatically selects an IP address mode based on the cluster network type, for example, <strong id="admin_guide_000424__en-us_topic_0000001866255229_b206874975841449">IPv4</strong> or <strong id="admin_guide_000424__en-us_topic_0000001866255229_b150507238441449">IPv6</strong>.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_li11646103363711"><strong id="admin_guide_000424__en-us_topic_0000001866255229_b161811568517">Source NameNode IP Address</strong>: indicates the service plane IP address of the NameNode in the standby cluster.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_li18646113319373"><strong id="admin_guide_000424__en-us_topic_0000001866255229_b12367920461">Source NameNode RPC Port</strong>: indicates the value of <strong id="admin_guide_000424__en-us_topic_0000001866255229_b1836862016617">dfs.namenode.rpc.port</strong> in the HDFS configuration of the standby cluster.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_li5792104211550"><strong id="admin_guide_000424__en-us_topic_0000001866255229_b2809104219613">DBroker IP</strong>: indicates the IP address of a service plane where the DBroker role in the cluster is deployed. The DBroker is used to transmit data during restoration.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_li864623363718"><strong id="admin_guide_000424__b46868544136">Source Path</strong>: indicates the full path of the HDFS directory for storing backup data of the standby cluster. For details, see <strong id="admin_guide_000424__b13686175418137">Backup Path</strong> obtained in <a href="#admin_guide_000424__en-us_topic_0000001866255229_li6526329016521">2</a>. for example, <em id="admin_guide_000424__en-us_topic_0000001866255229_i8807105310715">Backup path/Backup task name_Data source_Task creation time/</em>.</li></ul>
</div>
</td>
</tr>
<tr id="admin_guide_000424__en-us_topic_0000001866255229_row18404738182219"><td class="cellrowborder" valign="top" width="30%" headers="mcps1.3.4.2.7.2.1.2.3.1.1 "><p id="admin_guide_000424__en-us_topic_0000001866255229_p1740423816225">OBS</p>
</td>
<td class="cellrowborder" valign="top" width="70%" headers="mcps1.3.4.2.7.2.1.2.3.1.2 "><p id="admin_guide_000424__en-us_topic_0000001866255229_p11265172672713">Data is restored from OBS. If you select this option, you also need to configure the following parameters:</p>
<p id="admin_guide_000424__en-us_topic_0000001866255229_p66591842961"><strong id="admin_guide_000424__b1182218841411">Source Path</strong>: indicates the full OBS directory of the backup files. Specify this path by referring to <a href="#admin_guide_000424__en-us_topic_0000001866255229_li6526329016521">2</a>, for example, <em id="admin_guide_000424__en-us_topic_0000001866255229_i203115221641449">Backup path/Backup task name_Data source_Task creation time/</em>.</p>
</td>
</tr>
</tbody>
</table>
</div>
</p></li><li id="admin_guide_000424__en-us_topic_0000001866255229_li2745011616521"><span>Click <strong id="admin_guide_000424__en-us_topic_0000001866255229_b1978535975712">Refresh</strong> and select a Doris backup file set that has been backed up.</span></li><li id="admin_guide_000424__en-us_topic_0000001866255229_li1820462716521"><span>In the <strong id="admin_guide_000424__en-us_topic_0000001866255229_b10887143791014">Data Configuration</strong> area, select one or more pieces of backup data for <strong id="admin_guide_000424__en-us_topic_0000001866255229_b17887203719104">Select Data</strong> based on service requirements.</span><p><p class="litext" id="admin_guide_000424__en-us_topic_0000001866255229_p5303142117817">Configuration restrictions are as follows:</p>
<ul class="subitemlist" id="admin_guide_000424__en-us_topic_0000001866255229_ul93030212812"><li id="admin_guide_000424__en-us_topic_0000001866255229_li13641202319172">There is a database with the same name as the original database of the selected backup data in the Doris of the cluster.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_li1330318214812">The backup data is restored to the backup table with the same name as the original table in the database.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_li16053614238">If there is a table with the same name in Doris, ensure that the structures of the two tables are the same, including table names, columns, partitions, and materialized views.</li></ul>
</p></li><li id="admin_guide_000424__en-us_topic_0000001866255229_li150719371749"><span>Set <strong id="admin_guide_000424__en-us_topic_0000001866255229_b440414335110">Original Configurations</strong> to <strong id="admin_guide_000424__en-us_topic_0000001866255229_b19792944115116">true</strong>, indicating that the configuration of the backup data, such as the number of copies, will be used. If this parameter is set to <strong id="admin_guide_000424__en-us_topic_0000001866255229_b1086783612195">false</strong>, the default configuration is used to create a table.</span></li><li id="admin_guide_000424__en-us_topic_0000001866255229_li99583496207"><span>Click <strong id="admin_guide_000424__en-us_topic_0000001866255229_b42114199041449">OK</strong>.</span></li><li id="admin_guide_000424__en-us_topic_0000001866255229_li147082716521"><span>In the restoration task list, locate a created task and click <strong id="admin_guide_000424__en-us_topic_0000001866255229_b76118874841449">Start</strong> in the <strong id="admin_guide_000424__en-us_topic_0000001866255229_b43786301741449">Operation</strong> column to execute the restoration task.</span><p><ul id="admin_guide_000424__en-us_topic_0000001866255229_ul5981574816521"><li id="admin_guide_000424__en-us_topic_0000001866255229_li886695516521">After the restoration is successful, the progress bar is in green.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_li1269373116521">After the restoration is successful, the restoration task cannot be re-executed.</li><li id="admin_guide_000424__en-us_topic_0000001866255229_li2155927516521">If the restoration task fails during the first execution, rectify the fault and click <strong id="admin_guide_000424__en-us_topic_0000001866255229_b103991677941449">Retry</strong> to re-execute the task.</li></ul>
</p></li></ol>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="admin_guide_000215.html">Recovering Data</a></div>
</div>
</div>