adc_1997
Contributor
Contributor

@jmacdaddy - we have the same issue as you, also seeing one VM on 99%.  All the users on the same GPU get a black screen.  Although so far, resetting the problem VM doesn't allow the other users to reconnect - and they won't vmotion off, we have to restart them.  The hosts have 3 graphics cards, and we have to vmotion the users off the other two cards - then reboot the host to get it functional again.  Very disruptive all in all!

We do have a case logged with Nvidia - and they tell us its a known issue, which is currently with engineering as top priority.  Although they can't give info on what exactly causes the issue, or if there is a way to mitigate or minimise it.   Nor an estimation of when there may be a fix.  Everytime there is a failure we are collecting the logs and adding them to the case.

Also we moved from 1B to 2B profiles for everyone about a month ago.  Unfortunately we are still experiencing the 99% issue - although possibly not quite so regularly.  We don't actually do anything GPU intensive, no tools like AutoDesk - and are considering reverting to software based graphics until this issue is actually resolved.

Reply
0 Kudos