Hi, I noticed an in-efficiency in Batch task scheduling. Suppose I have 14 tasks and each machine runs 4 tasks, Batch would only schedule 12 tasks to run in parallel and would only run the remaining 2 tasks if those 12 tasks have completed.
So in total it would take 2*per_task_time to finish instead of just 1*per_task_time (if it's able to schedule all 14 tasks to run at the same time).
Can the team help me take a look on how to resolve this? Thanks!