Our environment consists of two sites, each running ESX4.0u1 (Enterprise), each attached to a FC SAN (separated by a small WAN link), and each site running production VMs. In addition, each site also has an off-SAN Linux backup JBOD server as a backup target.
Code: Select all
SiteA
VM-A-1 (prod)
VM-A-2 (prod)
VM-A-3
SiteB
VM-B-1 (prod)
VM-B-2
Goal: Backup each facility "locally" to allow for "fast" restoration of data; also replicate "prod" VMs between facilities to allow for failover in case of complete site failure. For example (using the above-mentioned Sites & VMs): if VM-A-1 became corrupt (or a file were inadvertently removed), we would restore the given VM/file from site-local backups (and not incur any WAN-overhead on retrieving the missing data). On the other hand, suppose SiteB were to completely fail: we'd recover to SiteA by failing-over the replicant VM's. In that case, we should see:
Code: Select all
SiteA
VM-A-1 (prod)
VM-A-2 (prod)
VM-A-3
VM-B-1 (prod-replica)
Conversely, if SiteA were to fail completely, we would have:
Code: Select all
SiteB
VM-B-1 (prod)
VM-B-2
VM-A-1 (prod-replica)
VM-A-2 (prod-replica)
In short, our goals are to:
- a) leverage vStorage API for backups, to help decrease backup window & overall load
b) provide site-local backup storage to allow for local restoration due to corruption/human error
c) provide site-to-site redundancy via replication
Could you please comment on the following approaches and let me know whether I understand the Veeam product correctly:
- 1) Single Veeam B&R server (located at SiteA - the "big" site) this server would:
- a) Do "local" backups via SAN mode (or Virtual Appliance)
b) Perform replication to SiteB
c) Perform backup of SiteB, leveraging network agents within ESX service console and Linux backup target server.
d) requires local SQL database to be replicated regularly to SiteB
e) SiteA failure requires buildout of single Veeam B&R in SiteB, followed by DB restore (and subsequent VM recovery)
2) Two Veeam B&R servers, one at each Site
- a) Do "local" backups via SAN mode (or Virtual Appliance)
b) Perform replication to target Site ( B goes to A, A goes to B )
c) replicate local SQL databases for each Veeam B&R to each site (bi-directional replication for each Veeam B&R)
d) SiteA failure follows option 1 above
e) SiteB failure follows option 1 above
f) Either site failure results in two Veeam B&R servers in a single site
I suspect that either approach would work (please do help me understand if I'm wrong in thinking this); I'm really trying to design a sensible approach to our environment with this product, and want to know the benefits and drawbacks to each approach. Using the above two options (surely there are more - can you comment?), I would suspect that option 1 is preferable from a "simplistic" standpoint, but option 2 is preferred from a performance standpoint (since it doesn't require network agents).