This section describes how to develop and configure a job.
For details about how to develop a real-time processing Flink SQL job in single-task mode, see sections Developing an SQL Script, Configuring Job Parameters, Saving a Job, and Templates.
Parameter |
Description |
|---|---|
Owner |
An owner configured during job creation is automatically matched. This parameter value can be modified. |
Executor |
This parameter is available when Scheduling Identities is set to Yes. User that executes the job. When you enter an executor, the job is executed by the executor. If the executor is left unspecified, the job is executed by the user who submitted the job for startup. NOTE:
You can configure execution users only after you apply for the whitelist membership. To enable it, contact customer service or technical support. |
Job Agency |
This parameter is available when Scheduling Identities is set to Yes. After an agency is configured, the job interacts with other services as an agency during job execution. |
Priority |
Priority configured during job creation is automatically matched. This parameter value can be modified. |
Execution Timeout |
Timeout of the job instance. If this parameter is set to 0 or is not set, this parameter does not take effect. If the notification function is enabled for the job and the execution time of the job instance exceeds the preset value, the system sends a specified notification, and the job keeps running. |
Exclude Waiting Time from Instance Timeout Duration |
Whether to exclude the wait time from the instance execution timeout duration If you select this option, the time to wait before an instance starts running is excluded from the timeout duration. You can modify this setting in Default Configuration > Exclude Waiting Time from Instance Timeout Duration. If you do not select this option, the time to wait before an instance starts running is included in the timeout duration. |
Custom Parameter |
Set the name and value of the parameter. |
Job Tag |
Configure job tags to manage jobs by category. Click Add to add a tag to the job. You can also select a tag configured in Managing Job Tags. |
Job Description |
Description of the job |
Property |
Description |
|---|---|
Flink SQL properties |
|
Flink Job Name |
Enter the Flink job name. The name is automatically generated in Workspace-Job name format. NOTE:
It can contain only letters, digits, hyphens (-), and underscores. A maximum of 64 characters are allowed, and Chinese characters are not allowed. |
MRS Cluster |
Select an MRS cluster. NOTE:
Currently, jobs with a single Flink SQL node support MRS 3.2.0-LTS.1 and later versions. |
Program Parameter |
Set the job running parameters. This parameter is displayed only after an MRS cluster is selected. (Optional) Configure optimization parameters such as threads, memory, and vCPUs for the job to optimize resource usage and improve job execution performance. CAUTION:
You can query historical checkpoints and select a specified checkpoint to start a real-time Flink SQL job. To make a Flink checkpoint take effect, configure the following two parameters:
Figure 1 Configuring program parameters
![]() NOTE:
This parameter is mandatory if the cluster version is MRS 1.8.7 or later than MRS 2.0.1. Click Select Template and select a parameter template. You can also select multiple templates. For details about how to create templates, see Configuring a Template. For details about the parameters of MRS Flink jobs, see Managing an Existing Cluster > Job Management > Running a Flink Job in MapReduce Service (MRS) User Guide. |
Flink Job Parameter |
Set the parameters for the Flink job. Variables required for executing the Flink job. These variables are specified by the functions in the Hive script. Multiple parameters are separated by spaces. |
MRS Resource Queue |
Select a created MRS resource queue. This parameter is mandatory if Whether MRS Resource Queue Is Mandatory is set to Yes. Select a queue you configured in the queue permissions of DataArts Security. If you set multiple resource queues for this node, the resource queue you select here has the highest priority. |
Rerun Policy |
|
Input Data Path |
Set the input data path. You can select an HDFS or OBS path. |
Output Data Path |
Set the output data path. You can select an HDFS or OBS path. |
Parameter |
Mandatory |
Description |
|---|---|---|
Job Status Polling Interval (s) |
Yes |
Set the interval at which the system checks whether the job is complete. The interval can range from 30s to 60s, or 120s, 180s, 240s, or 300s. During job execution, the system checks the job status at the configured interval. |
Maximum Wait Time |
Yes |
Set the timeout interval for the job. If the job is not complete within the timeout interval and retry is enabled, the job will be executed again. NOTE:
If the job is in starting state and fails to start, it will fail upon timeout. |
Retry upon Failure |
Yes |
Whether to re-execute the job if it fails
|
select 1; select * from a where b="dsfa\;"; --example 1\;example 2.
Enter script parameters in the SQL statement and click Parameter Setup in the right pane of the editor and then click Update from Script. You can also directly configure parameters and constants for the job script.
In the following script example, str1 indicates the parameter name. It can contain only letters, digits, hyphens (-), underscores (_), greater-than signs (>), and less-than signs (<), and can contain a maximum of 16 characters. The parameter name must be unique.
select ${str1} from data;
Click Parameters on the right of the editor and set the parameters described in Table 4.
Function |
Description |
|---|---|
Variables |
|
Add |
Click Add and enter the variable parameter name and parameter value in the text boxes.
After the parameter is configured, it is referenced in the format of ${parameter name} in the job. |
Edit Parameter Expression |
Click |
Modify |
Change the parameter name or value in the corresponding text boxes. |
Mask |
If the parameter value is a key, click |
Delete |
Click |
Constant Parameter |
|
Add |
Click Add and enter the constant parameter name and parameter value in the text boxes.
After the parameter is configured, it is referenced in the format of ${parameter name} in the job. |
Edit Parameter Expression |
Click |
Modify |
Modify the parameter name and parameter value in text boxes and save the modifications. |
Delete |
Click |
Workspace Environment Variables |
|
View the variables and constants that have been configured in the workspace. |
|
Click the Parameter Preview tab and configure the parameters listed in Table 5.
Function |
Description |
|---|---|
Current Time |
This parameter is displayed only when Scheduling Type is set to Run once. The default value is the current time. |
Event Triggering Time |
This parameter is displayed only when Scheduling Type is set to Event-based. The default value is the time when an event is triggered. |
Scheduling Period |
This parameter is displayed only when Scheduling Type is set to Run periodically. The default value is the scheduling period. |
Start Time |
This parameter is displayed only when Scheduling Type is set to Run periodically. The value is the configured job execution time. |
Start Time |
This parameter is displayed only when Scheduling Type is set to Run periodically. The value is the time when the periodic job scheduling starts. |
Subsequent Instances |
Number of job instances scheduled.
|
In Parameter Preview, if a job parameter has a syntax error, the system displays a message.
If a parameter depends on the data generated during job execution, such data cannot be simulated and displayed in Parameter Preview.
After configuring the job, perform the following operations:
A maximum of 1,000 records can be displayed in the execution result. The size of the execution result cannot exceed 3 MB. If the size exceeds 3 MB, the result will be truncated.
to save the job configuration.After the job is saved, a version is automatically generated and displayed in Versions. The version can be rolled back. If you save a job multiple times within a minute, only one version is recorded. If the intermediate data is important, you can click Save new version to save and add a version.
When developing a real-time processing, single-task Flink SQL job, you can use a public script template. For details about how to create a template, see Configuring a Template. For details about how to use a script template, see Using Script Templates and Parameter Templates.