I have a Server 2008R2 Hyper-V cluster with 2 nodes. They use a CSV on a SAN. I use SCVMM to manage them. We recently had several crashes that caused a failover, making virtual machines die and start up on the other node. For the most part, this worked fine. At one point during a power failure, both nodes were unable to access the SAN for a moment, to the CSV went offline. Bringing it online in Failover Cluster Manager worked, and most of the virtual machines started just fine.
One virtual machine however will not start.
Besides the error above, there don't seem to be any recent relevant logs in the failover cluster or windows event logs on either node. There are some in Critical events I can see in failover cluster manager from when the failures happened last week:
I've looked at files on the SAN. All of them appear to be intact. The XML configuration file seems to be valid (some research showed this could happen if the XML file got corrupted).
Edit: I have also run the cluster validation report. Besides the failed resource and some expected errors that it couldn't test the disks while they are online, everything looks fine.
How do I go about getting this virtual machine running again?
Despite not knowing exactly what caused the problem, it was pretty easy to get the VM running again:
When I stopped the cluster service, the VM was immediately taken over by one of the remaining nodes and started up automatically.
User contributions licensed under CC BY-SA 3.0