In daily big data analysis work, it is important to allocate and manage compute resources properly to provide a good job execution environment.
You can allocate resources and adjust task execution order based on the job's compute needs and data scale, and schedule different elastic resource pools or queues to adapt to different workloads. To ensure normal job execution, the CUs required for the submitted job should be less than or equal to the remaining available CUs in the elastic resource pool.
This section describes how to view the usage of compute resources in an elastic resource pool and the required CUs for a job.
Locate the target resource pool in the list and check its Actual CUs and Used CUs.
To ensure normal job execution, the CUs required for the submitted job should be less than or equal to the remaining available CUs in the elastic resource pool.
For details about the number of CUs required by different types of jobs, see Checking the Required CUs for a Job.
Use the monitoring dashboard provided by Cloud Eye to check the number of running and submitted jobs, and use the job count to determine the overall resource usage of SQL jobs.
You can set the number of CUs on the job editing page using the following formula: CUs = Job Manager CUs + (Parallelism/Slots per TM) x CUs per TM.
Check the compute resource specifications configured for the job.
The formula is as follows:
Number of CUs of a Spark job = Number of CUs used by executors + Number of CUs used by the driver
Number of CUs used by executors = max {[(Executors x Executor Memory)/4], (Executors x Executor Cores)} x 1
Number of CUs used by the driver = max [(Driver Memory/4), Driver Cores] x 1