-
- Enthusiast
- Posts: 84
- Liked: 8 times
- Joined: Jul 04, 2012 6:32 am
- Full Name: Tobias Elfstrom
- Contact:
parallel processing
I might as well ask this here too
Are there any options related to the new parallel processing or is that just default on if a new job is created in v7 ?
BR Tobias
Are there any options related to the new parallel processing or is that just default on if a new job is created in v7 ?
BR Tobias
-
- Product Manager
- Posts: 20413
- Liked: 2302 times
- Joined: Oct 26, 2012 3:28 pm
- Full Name: Vladimir Eremin
- Contact:
Re: parallel processing
By default (after the upgrade), the parallel processing is turned off. So, in order to enable it go to the menu -> options -> advanced -> enable parallel VM and virtual disk processing.
Thanks.
Thanks.
-
- Enthusiast
- Posts: 84
- Liked: 8 times
- Joined: Jul 04, 2012 6:32 am
- Full Name: Tobias Elfstrom
- Contact:
Re: parallel processing
So there is no per job options for this but rather for one whole instance of Veeam Backup & Replication server then?
-
- Veeam Software
- Posts: 21139
- Liked: 2141 times
- Joined: Jul 11, 2011 10:22 am
- Full Name: Alexander Fogelson
- Contact:
Re: parallel processing
Yes, this setting will take effect for all backup and replication jobs on the Veeam B&R server.
-
- Chief Product Officer
- Posts: 31814
- Liked: 7302 times
- Joined: Jan 01, 2006 1:01 am
- Location: Baar, Switzerland
- Contact:
Re: parallel processing
I could not see any value of making it disableable on a job level.
-
- Novice
- Posts: 4
- Liked: never
- Joined: Jan 18, 2013 2:30 pm
- Full Name: Woody
- Contact:
Re: parallel processing
I'm surprised there's no options with this too so can only assume there's loads of intelligence built in. Is there a deep dive explanation anywhere?
Parallel disk backup sounds like a big win and will help with backup related IO issues on heavy VMs. However, backing up multiple heavy VMs in parallel will have the opposite affect. How is B&R managing this decision making?
If there was an option to enable parallel disk processing but disable parallel VM processing I'd be ticking it but there may be way more going on than I know.
Parallel disk backup sounds like a big win and will help with backup related IO issues on heavy VMs. However, backing up multiple heavy VMs in parallel will have the opposite affect. How is B&R managing this decision making?
If there was an option to enable parallel disk processing but disable parallel VM processing I'd be ticking it but there may be way more going on than I know.
-
- Chief Product Officer
- Posts: 31814
- Liked: 7302 times
- Joined: Jan 01, 2006 1:01 am
- Location: Baar, Switzerland
- Contact:
Re: parallel processing
I/O issues can be controlled by the number of concurrent tasks on backup repositories, this capability has been in the product since version 6.0. The whole issue is not particularly new, as you could also run into these issues with concurrent jobs processing multiple heavy VMs at the same time.
Parallel VM processing is only leveraged when you have enough task slots to process all virtual disks of multiple VMs in parallel, so you can easily control the load using standard means. For example, if you set the max task slots on your backup repository to 2, and most of your VMs have two virtual disks, this will effectively disable parallel VM processing completely (except if the job gets hits two single-disk VMs), but 2 virtual disks of the same VM will still be processed in parallel.
Parallel VM processing is only leveraged when you have enough task slots to process all virtual disks of multiple VMs in parallel, so you can easily control the load using standard means. For example, if you set the max task slots on your backup repository to 2, and most of your VMs have two virtual disks, this will effectively disable parallel VM processing completely (except if the job gets hits two single-disk VMs), but 2 virtual disks of the same VM will still be processed in parallel.
-
- Service Provider
- Posts: 77
- Liked: 15 times
- Joined: Jun 03, 2009 7:45 am
- Full Name: Lars O Helle
- Contact:
Re: parallel processing
I would like to suggest the possibility to limit max parallel tasks per job. 8 parallel tasks would not be a problem on a SSD-SAN, but it could really hurt VM performance on a small SATA-HDD SAN/array. Remember that the same proxy could be used by many jobs, so then it could be necessary to create a separate proxy for the lowend storage and limit tasks there.
A very nice feature would be to be able to limit parallel tasks on a per datastore-cluster/folder basis in the proxy-configuration.
A very nice feature would be to be able to limit parallel tasks on a per datastore-cluster/folder basis in the proxy-configuration.
-
- Service Provider
- Posts: 77
- Liked: 15 times
- Joined: Jun 03, 2009 7:45 am
- Full Name: Lars O Helle
- Contact:
Re: parallel processing
Some feedback from a customer I upgraded to v7.
Optimal compression.
Backup:
Job HP - 36 VMs - 3 TB data
v6.5 - 3 hours
v7 - 45 minutes
Job Exchange - 20 VMs - 1,5 TB data
v6.5 - 100 min
v7 - 25 minutes
Replication:
Job Replica HP - 36 VMs - 3 TB data
v6.5 - 4 hours
v7 - 70 minutes
Splitting into more jobs on v6.5 would help, but it would also mean more work/monitoring.
The backup-times would most likely improve even more if data was backed up to a local repository. The backups are sent off-site and the replicas are sent to a local spare SAN.
Optimal compression.
Backup:
Job HP - 36 VMs - 3 TB data
v6.5 - 3 hours
v7 - 45 minutes
Job Exchange - 20 VMs - 1,5 TB data
v6.5 - 100 min
v7 - 25 minutes
Replication:
Job Replica HP - 36 VMs - 3 TB data
v6.5 - 4 hours
v7 - 70 minutes
Splitting into more jobs on v6.5 would help, but it would also mean more work/monitoring.
The backup-times would most likely improve even more if data was backed up to a local repository. The backups are sent off-site and the replicas are sent to a local spare SAN.
-
- Chief Product Officer
- Posts: 31814
- Liked: 7302 times
- Joined: Jan 01, 2006 1:01 am
- Location: Baar, Switzerland
- Contact:
Re: parallel processing
The max parallel tasks per job is implicitly controlled by max parallel tasks setting on the backup repository used by the given job (and real backup repository throughput).lohelle wrote:I would like to suggest the possibility to limit max parallel tasks per job.
I don't expect your backup repository to be more powerful than your production storage, so if backup storage can keep up with set amount of concurrent tasks, the production storage certainly will too and if backup repository cannot keep up, the engine will be just waiting for it most of the time to write the data, not reading anything from the production storage.
I do have some ideas around controlling load on production and backup storage more intelligently for future versions though.
-
- Service Provider
- Posts: 77
- Liked: 15 times
- Joined: Jun 03, 2009 7:45 am
- Full Name: Lars O Helle
- Contact:
Re: parallel processing
The backup repository might usually be slower, but live VMs usually do not run from this storage (even if YOU made it possible!).
Maybe a "number of parallel tasks" selection in the job setup where you select source proxy would be possible in a coming release..?
VERY VERY nice job on v7 btw. Some more vendors supported with SAN integration would be nice. We have mostly "small" customers, so we could like Starwind integration.
Maybe a "number of parallel tasks" selection in the job setup where you select source proxy would be possible in a coming release..?
VERY VERY nice job on v7 btw. Some more vendors supported with SAN integration would be nice. We have mostly "small" customers, so we could like Starwind integration.
-
- VP, Product Management
- Posts: 7081
- Liked: 1511 times
- Joined: May 04, 2011 8:36 am
- Full Name: Andreas Neufert
- Location: Germany
- Contact:
Re: parallel processing
You can do this today by using only proxies with the needed proxy-slot count for the jobs with this heavy load.lohelle wrote:The backup repository might usually be slower, but live VMs usually do not run from this storage (even if YOU made it possible!).
Maybe a "number of parallel tasks" selection in the job setup where you select source proxy would be possible in a coming release..?
-
- Lurker
- Posts: 1
- Liked: never
- Joined: Sep 09, 2011 4:29 pm
- Full Name: Eddie Ortega
- Contact:
Re: parallel processing
Per JOB would be nice. Currently I had a situation where it processed both of my Exchange Mailbox Servers at the same time causing them to kind of freak out on who should Mount the databases.
What would be the best way to stop this from happening besides creating 2 different jobs.
What would be the best way to stop this from happening besides creating 2 different jobs.
-
- Chief Product Officer
- Posts: 31814
- Liked: 7302 times
- Joined: Jan 01, 2006 1:01 am
- Location: Baar, Switzerland
- Contact:
Re: parallel processing
I would just reorder them within the job to spread out. This is possible even when you configure job with containers.
Say, you currently have your job configured by adding Container 1 (containing both Exchange servers).
In that case, just add both Exchange VMs explicitly as well, but order them like this in the list:
Exchange1
Container1
Exchange2
This should work.
Say, you currently have your job configured by adding Container 1 (containing both Exchange servers).
In that case, just add both Exchange VMs explicitly as well, but order them like this in the list:
Exchange1
Container1
Exchange2
This should work.
-
- VP, Product Management
- Posts: 7081
- Liked: 1511 times
- Joined: May 04, 2011 8:36 am
- Full Name: Andreas Neufert
- Location: Germany
- Contact:
Re: parallel processing
Scary parallel processing. 2 DAG member because of high Dedup (and both because of Surebackup) in same job.eortega wrote:Per JOB would be nice. Currently I had a situation where it processed both of my Exchange Mailbox Servers at the same time causing them to kind of freak out on who should Mount the databases.
What would be the best way to stop this from happening besides creating 2 different jobs.
-
- Product Manager
- Posts: 20413
- Liked: 2302 times
- Joined: Oct 26, 2012 3:28 pm
- Full Name: Vladimir Eremin
- Contact:
Re: parallel processing
Also, if you want to process VMs within certain job sequentially, it might be worth creating a dedicated repository (it might even a new folder on already existing repository) for this job and set 1 as the number of backup tasks this repository can handle. This way, any job that uses this repository will have its concurrency limited.eortega wrote:Per JOB would be nice.
Thanks.
-
- Service Provider
- Posts: 81
- Liked: 14 times
- Joined: Mar 10, 2010 7:50 pm
- Full Name: Mark Hodges
- Contact:
Re: parallel processing
I just hit this same thing without realizing it...both CAS Servers and a mailbox server in a job....
they really need a way to disable this per job...so I don't have to create separate jobs just to ensure both sides of an exchange node don't try to backup at the same time.
they really need a way to disable this per job...so I don't have to create separate jobs just to ensure both sides of an exchange node don't try to backup at the same time.
-
- Product Manager
- Posts: 20413
- Liked: 2302 times
- Joined: Oct 26, 2012 3:28 pm
- Full Name: Vladimir Eremin
- Contact:
Re: parallel processing
As mentioned above, for now the best option will be to limit the number of concurrent tasks the job proxy/repository can handle. It should guarantee that VMs will be processed sequentially.
Anyway, thanks for the feedback; much appreciated.
Anyway, thanks for the feedback; much appreciated.
-
- Veeam ProPartner
- Posts: 566
- Liked: 103 times
- Joined: Dec 29, 2009 12:48 pm
- Full Name: Marco Novelli
- Location: Asti - Italy
- Contact:
Re: parallel processing
Hi, I've been using for some time the new parallel processing option for different customers, and my feeling is that with modern hardware on the backup server the backup process can slow down too much Virtual Machines / datastores / Hosts
I'm asking for a new option that we can call "intelligent parallel processing" with a behavior similar to this:
- check how many task can run at the same time on the proxy server. For instance: 2
- start processing the first VM. If the VM has a single disk, start processing the next VM with a disk NOT located on the same datastore as the first VM. If the backup mode is NBD start processing the next VM with a disk NOT located on the same datastore as the first VM AND not located on the same host as the first VM
- if the first VM has two disk, start processing only the first disk AND start processing another VM with previous logic.
The main logic behind that is that Veeam should backup only one disk per datastore and only one VM per host if the backup mode is NBD
This logic will process the VM slightly out of order but seems to me a better tradeoff than actual parallel processing
Comments are welcome
Marco
I'm asking for a new option that we can call "intelligent parallel processing" with a behavior similar to this:
- check how many task can run at the same time on the proxy server. For instance: 2
- start processing the first VM. If the VM has a single disk, start processing the next VM with a disk NOT located on the same datastore as the first VM. If the backup mode is NBD start processing the next VM with a disk NOT located on the same datastore as the first VM AND not located on the same host as the first VM
- if the first VM has two disk, start processing only the first disk AND start processing another VM with previous logic.
The main logic behind that is that Veeam should backup only one disk per datastore and only one VM per host if the backup mode is NBD
This logic will process the VM slightly out of order but seems to me a better tradeoff than actual parallel processing
Comments are welcome
Marco
-
- Chief Product Officer
- Posts: 31814
- Liked: 7302 times
- Joined: Jan 01, 2006 1:01 am
- Location: Baar, Switzerland
- Contact:
Re: parallel processing
Hi Marco, the main reason why customers asked for parallel processing is that backing up a single disk per datastore almost never saturates typical storage. It was reported that for most arrays require at least 3-4 threads to do this. So, your proposed logic will harm the performance.
I also disagree with processing only one disk at a time for the VM with two disks is a good idea - in fact, the most positive v7 feedback comes from parallel processing of multiple disks of the same VM in parallel (which dramatically reduces time VM runs off the snapshot).
The only times I see backup process slowing down the infrastructure too much is when customer oversubscribe their infrastructure by creating too many powerful backup proxies, which enables tons of VMs to be processed in parallel. In most cases, after upgrading to v7 customers can easily remove half of their proxy server slots, and still get the same or better performance than with previous versions.
And there is a really easy fix. Do not oversubscribe
Now, I am not saying here that the current version is already super smart. There is always room for improvement, and we will certainly have further improvement in the "intelligent" part of our automated load balancing, but they will not be inline with what you are proposing here.
Thanks!
I also disagree with processing only one disk at a time for the VM with two disks is a good idea - in fact, the most positive v7 feedback comes from parallel processing of multiple disks of the same VM in parallel (which dramatically reduces time VM runs off the snapshot).
The only times I see backup process slowing down the infrastructure too much is when customer oversubscribe their infrastructure by creating too many powerful backup proxies, which enables tons of VMs to be processed in parallel. In most cases, after upgrading to v7 customers can easily remove half of their proxy server slots, and still get the same or better performance than with previous versions.
And there is a really easy fix. Do not oversubscribe
Now, I am not saying here that the current version is already super smart. There is always room for improvement, and we will certainly have further improvement in the "intelligent" part of our automated load balancing, but they will not be inline with what you are proposing here.
Thanks!
-
- Veeam ProPartner
- Posts: 566
- Liked: 103 times
- Joined: Dec 29, 2009 12:48 pm
- Full Name: Marco Novelli
- Location: Asti - Italy
- Contact:
Re: parallel processing
mmm so probably we should differentiate "parallel processing for Enterprise" from "parallel processing for SMB"
Unfortunately I work mostly with singles VMware servers or little infrastructures with 2 - 3 hosts and a shared iSCSI SAN Dell PowerVault MD3xxx series and modern Veeam Backup server with local proxy can really slow down quite a bit those SMB infrastructures
Marco
Unfortunately I work mostly with singles VMware servers or little infrastructures with 2 - 3 hosts and a shared iSCSI SAN Dell PowerVault MD3xxx series and modern Veeam Backup server with local proxy can really slow down quite a bit those SMB infrastructures
Marco
-
- Chief Product Officer
- Posts: 31814
- Liked: 7302 times
- Joined: Jan 01, 2006 1:01 am
- Location: Baar, Switzerland
- Contact:
Re: parallel processing
Limit Veeam proxy to two concurrent tasks, and put ingest rate throttling on the backup repository if backup is still too fast (this is actually a good problem to have!), and the backup will not slow the infrastructure down. While those few dozens of VMs from 2-3 hosts with a still backup in no time.
Who is online
Users browsing this forum: Google [Bot], Semrush [Bot] and 57 guests