Hi All,
Been running a selection of our VMs with High Availability/Fault Tolerance enabled for a couple of months now, without issue.
We backup using Veeam, and so, nightly, before the backup starts, a scheduled task kicks in which removes FT from the protected VMs, so that the Veeam backup job can run successfully. At the end of the job, the post-job processing runs a script which then enables FT again.
Since last week, just one of the VMs has been kicking out an error following the Veeam backup - When we come in in the morning, we are flooded with vSphere alarm emails stating:
Alarm Definition:
([Event alarm expression: vSphere HA cannot reset VM; Status = Red])
The weird thing is, FT is actually re-enabled just fine on the VM, and all appears ok. Yet the emails keep coming through, reporting this status (I know that I can set this behaviour in vSphere, but I want to solve the issue), and the only way to stop them is to disable FT on the VM, reboot the VM and then re-enable FT again.
I can't seem to find any more detailed information on the error.
Anyone got any ideas?
Thanks in advance!
Dave