Background:
I have a Production datacentre operated through ISP 1 and a smaller DR datacentre operated through ISP 2. I also have a large Storage server from ISP 1 in a separate location to my Production datacentre that I'm using for long-term GFS archives. My only true downtime for the Production datacentre is across the weekend (we run 24x5), so I need to try to avoid impacting the Production datacentre during the week more than is necessary. We have around 15-20 VM Guests in Production of which around 10 are replication candidates.
In the unlikely event that ISP 1 disappears completely we can cope with losing our archives. It's not great, but it's bearable. I don't have any off-line storage and providing some is not a viable option in our situation. My protection is that the Storage server is read-only for everyone/everything except Veeam B&R and our administrative ssh logins (it's based on Proxmox and Linux, rather than vSphere and Windows), and the Veeam archives themselves have periodic snapshots taken from within the target filesystem.
Requirement:
My target replication frequency from Production to DR is every two hours. I have been advised that to avoid hitting our production site with multiple I/O requests for the same sets of Guest changes I could take an off-site backup every two hours and use that to generate the replicas.
Disk space permitting, I'd like to keep at least five days of backups in my DR centre, but since I have replicas I don't think I need more than one or two backups every day. However, I don't see how I can achieve this while using these backups to update replicas every two hours.
Questions:
- Is it better to run one repository containing five days of backups at 2 hourly intervals, or two repositories on the same server, one holding a day of 2 hourly backups and the other holding four days of 12 hourly backups?
- If I generated a few backups on-site, stored on a Veeam Proxy, would it be better (more efficient) to copy those off-site, or just to connect directly to the Production VMware from the DR datacentre as if the on-site backups didn't exist? Both approaches would require two I/O hits per VM guest disk, wouldn't they?
- What is the difference between Veeam Proxy running on Windows and the Veeam "data mover" utility that runs on demand on a Linux-based storage server?
- I understand that good practice is to group multiple VMs per backup/replica job. What recommendations are there with regard to choosing which VMs to group together in a single job?
Chris