Robuea
Enthusiast
Enthusiast

Thanks FDenneman01​, that all makes sense, I have a background in AIX/Solaris fair share scheduling so understand most of what you're saying.  Scheduling is all about fairness, not making sure it will work - if its oversubscribed it will still break whatever.  As you say, the art is in understanding/predicting contention scenario behavior and its effect/relation to all workloads.

If I understand correctly then, the actual 'worst case' values are calculated on clusterwide capacity, but the actual running values will depend on the relative values of competing workloads within a DRS group of compute.  All we are really doing within a resource pool is weighting the siblings within the scheduler (which is why hierarchy is very important).

I may well get this cluster deep dive book if it covers all this content.