Yes, keep your clusters separated by GPU model. The profiles applied are specific to that GPU.
You cannot control which host the instant clones get provisioned on and DRS/vMotion would also cause issues.
Keep in mind that vMotion is not supported until vSphere 6.7u2 - it is an awesome feature for maintenance purposes.