We had an incident a week ago where Veeam deleted 5 IBM SVC production datastores with 126 VMs running instead of their snapshot volumes. I don't know exactly how SVC storage snaps work but I know that it is different than NetApp. In any case I would never expect this to happen. I remember that there was a similar case right after the V11 release (we are on latest V11 CU).
This description frightens me a lot! Maybe I don't understand this correct but it sounds like there are commands executed on SVC side but no real check about the result or status is happening (no real error handling). The described solution in V12 also doesn't sound that more error handling was added, just the way a ssh connection is used. There was maintenance on storage side that weekend which might have been the reason for the timeouts, but still I'd expect that Veeam is able to take care of the production datastores. This all leaves me wondering how risky this storage integration really is.It looks like there was a timeout for one of our ssh commands sent to the storage device for a reason that is still under investigation. This led to retry of the command that got a response for a previous request. Since that moment each next ssh request we did got reply for a previous one effectively creating a queue of responses. This led to incorrect IDs being used in the requests they do not belong.