I have no time atm to open a SR and send logs etc my priority was getting production VMs back online. I post this here just for information and maybe save some time to people like me.
I'm using HP blades with latest SPP and vmware 5.1 with latest patches and veeam-one v9. I don't know if this can happen on other versions.
I think it's a very RARE bug, I have no idea what could have caused this. Please refer to my post (sharantyr3) here : https://communities.vmware.com/message/2577347#2577346
The vmotion of veeam-one VM to another host allowed ESXi to remove the faulty locks on the VM files (validated).
So just to tell you, if by any chance you get struck VMs and see entries like this in vmkwarning.log :
and entries like this in vmkernel.log :cpu16:8252)WARNING: Swap: vm 9930238: 4780: Failed to unlink /vmfs/volumes/55533920-9ee23892-fbf5-2c768aae2b42/maria-db-01/vmx-maria-db-01-794194787-1.vswp: Maximum kernel-level retries exceeded
Simply try to vmotion your veeam-one server to another ESXi. After the vmotion, I could successfully remove the old VM file and got this in the ESXi logs :2016-02-23T12:53:51.161Z cpu4:9937646)DLX: 4230: vol 'BJ-ISN-06', lock at 43160576: [Req mode: 1] Not free:
2016-02-23T12:53:51.161Z cpu4:9937646)[type 10c00005 offset 43160576 v 78794, hb offset 3624960
gen 1087, mode 1, owner 56b5cf5d-f208f93a-4755-3c4a926c279c mtime 171709
num 0 gblnum 0 gblgen 0 gblbrk 0]
2016-02-23T12:53:51.161Z cpu4:9937646)Res3: 5732: Rank violation threshold reached: cid 0xc1d00002, resType 4, cnum 1
2016-02-23T12:53:53.760Z cpu8:9937646)DLX: 3706: vol 'BJ-ISN-06', lock at 43160576: [Req mode 1] Checking liveness:
2016-02-23T12:53:53.760Z cpu8:9937646)[type 10c00005 offset 43160576 v 78794, hb offset 3624960
gen 1087, mode 1, owner 56b5cf5d-f208f93a-4755-3c4a926c279c mtime 171709
num 0 gblnum 0 gblgen 0 gblbrk 0]
2016-02-23T13:09:43.259Z cpu26:5686842)DLX: 3706: vol 'BJ-ISN-06', lock at 43160576: [Req mode 1] Checking liveness:
2016-02-23T13:09:43.259Z cpu26:5686842)[type 10c00005 offset 43160576 v 78794, hb offset 3624960
gen 1087, mode 1, owner 56b5cf5d-f208f93a-4755-3c4a926c279c mtime 171709
num 0 gblnum 0 gblgen 0 gblbrk 0]
2016-02-23T13:09:43.260Z cpu26:5686842)DLX: 3321: Clearing wrong owner for lock at 43160576 with [HB state abcdef01 offset 3624960 gen 1088 stampUS 1480954430581 uuid 00000000-00000000-0000-000000000000 jrnl <FB 0> drv 14.58]