
I have 3 esxi hosts in a cluster with about 40 VMs to backup.
The storage is located on a nimble array.
i am using veeam backup and rep 9.5 update 4.
the backup machine runs in its own VM on its own separate ESXI host.
All backups are set to "virtual appliance" mode on the backup proxy which is on the same veeam machine.
The backups are saved to a buffalo terrastation that seems to not be the bottleneck, but i am not sure.
All bottlenecks are 80-99% source.
an example job, file server backup. Processed 7.4 TB, read 48.7GB, transfered 13.1GB (3.7x), Processing Rate 41 MB/s, Duration 4:16:49, Speed 144.3 MB/S, Busy: Source 73% > Proxy 43% > Network 36% > Target 19% (Previously before i started changing things, this job was taking 2 hours, which i thought was bad before.). I seem to recall like 6 months ago, these jobs would all have no problem firing off in less than 30 minutes.
Things I have done this week to try and make times better.
- increased CPU available to veeam from 4 to 8 cores.
- increased ram available to veeam from 40gb to 50gb
- Adjusted backup proxy to be able to do 8 concurrent tasks
- adjusted backup repository concurrent tasks from 4 to 2
- played around with the timings of backups. They were all starting around the same time, staggered by an hour. I thought maybe they were stacking ontop of eachother and not being efficient, so i changed the timings so that different jobs would run every few hours all night.
When i look at the throughput graph, the backups seem to run fine, taking about 30 minutes for the example fileserver job above. But the runtime on the job is 4 hours... so what the heck is it doing for all that time? when i look at the details, under "Action" i see that its spending 1h 44m at the "required backup infrastructure resources have been assigned" step. and 47:19 at the "index guest file system" step. All other steps are not taking very long at all. But those numbers don't add up to 4h16m anyways...
its very confusing to me. Does anyone have any idea what i am doing wrong, or what directions i should be looking in? i originally thought it was my backup arrays performance, but now i am not so sure.