@jmacdaddy - we have the same issue as you, also seeing one VM on 99%. All the users on the same GPU get a black screen. Although so far, resetting the problem VM doesn't allow the other users to reconnect - and they won't vmotion off, we have to restart them. The hosts have 3 graphics cards, and we have to vmotion the users off the other two cards - then reboot the host to get it functional again. Very disruptive all in all!
We do have a case logged with Nvidia - and they tell us its a known issue, which is currently with engineering as top priority. Although they can't give info on what exactly causes the issue, or if there is a way to mitigate or minimise it. Nor an estimation of when there may be a fix. Everytime there is a failure we are collecting the logs and adding them to the case.
Also we moved from 1B to 2B profiles for everyone about a month ago. Unfortunately we are still experiencing the 99% issue - although possibly not quite so regularly. We don't actually do anything GPU intensive, no tools like AutoDesk - and are considering reverting to software based graphics until this issue is actually resolved.