This section describes how to configure a default item. You can perform the operations in this section only if you have the permissions of DARTS Administrator or Tenant Administrator.
If a parameter is invoked by multiple jobs, you can use this parameter as the default configuration item. In this way, you do not need to set this parameter for each job.
Configuration Item |
Affected Module |
Main Usage |
|---|---|---|
Job scheduling |
|
|
Job scheduling |
Policy for executing nodes with multiple IF conditions |
|
Script/Job development |
Policy for grabbing the lock of a job or script |
|
Script development |
Format definition of script variables. Two formats are available: ${} and ${dlf.}. |
|
Script/Job development |
Policy for downloading or dumping the SQL execution result
|
|
Job Development |
When a node in a DataArts Studio job is associated with a script or a job of another service, the node name does not change accordingly. |
|
Job development |
A simple variable set provides a series of custom variables that automatically replace parameters during job scheduling. |
|
O&M and scheduling |
Notification type for jobs whose status is failure ignored |
|
Job execution |
Whether a node will be re-executed if it fails upon timeout |
|
Job execution |
If you select Yes, the waiting time before an instance starts running is excluded from the instance timeout duration. |
|
Job development |
Rules for splitting string parameters (parameters enclosed by "") in the JAR packages of MRS MapReduce and MRS Spark operators |
|
O&M and scheduling |
Whether a waiting instance synchronizes the latest job version when it runs |
|
Script/Job development |
|
|
Job development |
If you select Yes, parameter MRS Resource Queue is mandatory. |
|
O&M – PatchData |
Priority of a PatchData job. If system resources are insufficient, computing resources are preferentially allocated to jobs with higher priorities. A larger value indicates a higher priority. Priorities can be set only for DLI SQL operators. |
|
O&M and scheduling |
Days to wait before job instances are canceled. If the wait time of a job instance exceeds the value of this parameter, the instance will be canceled. The minimum value is 2, that is, a job instance can be canceled only after two days. The default value is 60 days. |
|
O&M and scheduling |
Days in which alarms can be reported for job instances. The default value is 7, that is, alarms can be reported for the job instances created within the last seven days, but not for those created before that. |
|
O&M and scheduling |
Default policy for retrying a failed job operator |
|
O&M and scheduling |
If you select All jobs, Real-time jobs, or Batch jobs, an alarm is generated each time a job fails to be retried. If you select Disable, an alarm is generated only when the maximum number of retries has been reached for the job. |
|
Job development (job execution) |
If this function is enabled, set mapreduce.job.name=Script name of the Hive SQL script is automatically transferred to MRS during job execution in the current workspace. |
|
Job scheduling |
Jobs can be depended on by jobs in other workspaces (requires the permission to query the job list in the workspace). All default roles in the workspace have this permission. Custom roles must have the job query permission in DataArts Factory. |
|
Script/Job development |
Which script execution results are displayed
|
|
Job development (job test) |
Identity for testing jobs.
|
|
Spark SQL script/job development |
Whether any parameters can be set for Spark SQL jobs and scripts |
|
Spark SQL script/job development |
Whether any parameters can be set for Hive SQL jobs and scripts |
|
Job/Script import and export |
Whether to enable job/script change management for the workspace
|
|
Viewing logs |
Whether to open logs on a new tab or in a pop-up window |
Three options are available. The default value is Terminate.
To configure the policy for executing nodes with multiple IF conditions, perform the following operations:
The following two options are available:
For details, see Configuring the Policy for Executing a Node with Multiple IF Statements.
The policy determines how you can grab the lock of a job or script. If you use a soft lock, you can grab the lock of a job or script regardless of whether you have the lock. If you use a hard lock, you can only unlock or grab the lock of a job or script for which you have the lock. Operations such as publish, execution, and scheduling are not restricted by locks.
You can configure the hard/soft policy based on your needs.
The default policy is Soft Lock.
Variables of an SQL script can be in ${} or ${dlf.} format. You can configure either type as needed. The configured variable format applies to SQL scripts, SQL statements in jobs, single-node jobs, and environment variables.
The default variable format is ${}.
By default, all users can download and dump the execution results of SQL scripts. If you do not want all users to have this permission, perform the following steps to configure a data export policy:
The default data export policy is All User Can.
On the Develop Job page, when you select a script for a node or associate a node with the function of another cloud service, the node name will be automatically changed to the script name or function name. You can disable this function.
The simple variable set provides a series of customized variables to dynamically replace parameters during task scheduling.
To configure the notification type for jobs whose status is failure ignored, perform the following steps:
Successful (default)
You can set this parameter to specify whether a node will be re-executed if it fails upon timeout.
You can specify whether to exclude waiting time from instance timeout duration.
Yes: The waiting time before an instance starts running is excluded from the instance timeout duration.
No: The waiting time before an instance starts running is included in the instance timeout duration.
You can set the rule for splitting the string parameters (enclosed by "") in the JAR package parameters of MRS MapReduce and MRS Spark operators.
Split String Arguments by Space: For example, "select * from table" is split into four parameters by space: select, *, from, and table.
Do not split string arguments: For example, "select * from table" is regarded as one parameter and is not split.
You can specify whether a waiting instance can synchronize the latest job version.
Yes: The waiting instance uses the latest job version.
No: The waiting instance still uses the existing job version.
When Hive SQL and Spark SQL statements are executed, DataArts Studio can place SQL statements in OBS or in the request body.
In OBS: Hive SQL and Spark SQL statements are put in OBS, and the OBS is returned to MRS.
In the request message body: Hive SQL and Spark SQL statements are put in the request message body, and the script content is returned to MRS.
This configuration supports Hive SQL and Spark SQL scripts, and pipeline and single-task jobs.
You can set whether the MRS resource queue is mandatory for configuring an MRS-related job.
Yes: The MRS Resource Queue parameter is mandatory.
No: The MRS Resource Queue parameter is not mandatory.

You can set the priority of a PatchData job. When system resources are insufficient, computing resources are preferentially allocated to jobs with higher priorities. A larger number indicates a higher priority. Currently, only the priorities of DLI SQL operators can be set.
The mapping between the PatchData Job Priority and spark.sql.dli.job.priority of DLI is as follows:
If PatchData Job Priority is set to 1, spark.sql.dli.job.priority of DLI is 1.
If PatchData Job Priority is set to 2, spark.sql.dli.job.priority of DLI is 3.
If PatchData Job Priority is set to 3, spark.sql.dli.job.priority of DLI is 5.
If PatchData Job Priority is set to 4, spark.sql.dli.job.priority of DLI is 8.
If PatchData Job Priority is set to 5, spark.sql.dli.job.priority of DLI is 10.
You can set the number of retention days for waiting job instances. If the waiting time of a job instance exceeds the configured retention days, the job instance is canceled. The minimum number of retention days is 2, that is, a job instance which is not executed can be canceled after at least two days. The default number of retention days is 60.
Send Alarm Upon Instance Cancellation If you select Yes for this parameter and configure a cancellation notification for a job, an alarm notification will be sent when a historical job instance is canceled upon timeout. If you select No, no alarm notification will be sent.
You can set the number of days during which alarms can be generated for monitored job instances. The default value is seven days. Alarms cannot be sent for job instances beyond the seven-day period.
For example, if you set the value of this parameter to 2, alarms can be generated for the job instances of yesterday and today, but cannot be generated for the job instances of the day before yesterday and of an earlier time even if the triggering conditions are met.
The default value is 7. Set a value from 1 to 270.
After you set this parameter, alarms are generated only for the job instances which are created after this parameter is set and not for historical instances.
This policy takes effect only for new job operators in the current workspace. The default policy for the operators in historical jobs is not affected. The default value is No.
If this parameter is set to Yes, new job operators can be retried once, and the retry interval is 120 seconds by default.
If you enable this function, an alarm is generated each time a job fails to be retried.
If this function is enabled, set mapreduce.job.name="Script name" of the Hive SQL script is automatically transferred to MRS during job execution in the current workspace.
This function takes effect only if the preceding parameter value has not been set for the script. If the parameter value has been set for the script, the value set is preferentially read and transferred to MRS. This function is unavailable for MRS clusters in security mode. To enable this function for such clusters, set them to non-security mode.
Jobs can be depended on by jobs in other workspaces (requires the permission to query the job list in the workspace). All default roles in the workspace have this permission. Custom roles must have the job query permission in DataArts Factory.
You can set this parameter to control the permissions to view the script execution history.
After configuring this parameter, you can specify the identity used to test jobs.
If no workspace agency or IAM account is available, a personal account is used for job tests.
If you are using a federated account, you must set this parameter to Public agency or IAM account.
You can set this parameter to determine whether any parameters can be set to overwrite the default parameters of the template.
In the MRS API connection mode, default parameters can be configured for Spark SQL scripts. For proxy connections, this function is not supported.
Then go to the basic information page of the Spark SQL job or Spark SQL script page and click
in the upper right corner to view the configured default program parameters. The preset default parameters are unavailable and cannot be modified.
You can also customize program parameters, which can replace the template parameters during the execution of Spark SQL jobs or scripts.
You can set this parameter to determine whether parameters can be set to overwrite the default parameters of the template.
In the MRS API connection mode, default parameters can be configured for Hive SQL scripts. For proxy connections, this function is not supported.
Then go to the basic information page of the Hive SQL job or Hive SQL script page and click
in the upper right corner to view the configured default program parameters. The preset default parameters are unavailable and cannot be modified.
You can also customize program parameters, which can replace the template parameters during the execution of Hive SQL jobs or scripts.
If you enable this function, you can export job/script changes (addition, modification, and deletion) in a workspace to a .zip file, and import the file to another workspace.
You can export and import jobs and scripts in the workspace only if you have set Job/Script Change Management to Yes.
You can configure whether to open logs on a new tab or in a pop-up window.