Concurrency in a job?

#1 VM Backup : Modern Data Protection for VMware vSphere and Microsoft Hyper-V

Concurrency in a job?

Postby Oletho » Fri Sep 17, 2010 4:54 am

Is there a way to have several VMs backed up at the same time without creating multiple jobs?

If dedup is not global but only covering a backup file I would like to have as many servers in a job as possible. But it seems that a job is strictly sequential?

Sorry if this is a dumb question, but I am just starting to test the product and would like to optimize both concurrency and dedup.

Ole Thomsen
Oletho
Enthusiast
 
Posts: 38
Liked: never
Joined: Fri Sep 17, 2010 4:37 am
Full Name: Ole Thomsen

Re: Concurrency in a job?

Postby Alexey D. » Fri Sep 17, 2010 8:02 am

Hello Ole,

For VMs belonging to same job concurrent processing is not possible. And deduplication works only for VMs in bounds of one job.
But if you want parallel processing there might be a balance point though. For example, create one job and place your Windows VMs inside,
and another job, let's say, for Ubuntu Linux VMs, etc. Dedupe is only effective between similar VMs anyway.

It's up to you which scenario to implement, but the general idea is described above. Hope this helps!
Alexey D.
 

Re: Concurrency in a job?

Postby Oletho » Fri Sep 17, 2010 5:42 pm

Thanks Alexey.

Yes, this helps me to confirm that my understanding of the limitations is correct. That's a good start :-)

But the fact that dedup is most efficient across similar servers does not always cope with the real world needs. Example: we have a system that involves Windows servers (2000, 2003 and 2008), Redhat Linux, SQL and Oracle databases. The goal is to have as small a backup window as possible, and with other products like VDR and esXpress i can run all simultaneously in one job to the same dedup store.

Well, guess I just have to think different.

Thanks again,
Ole Thomsen
Oletho
Enthusiast
 
Posts: 38
Liked: never
Joined: Fri Sep 17, 2010 4:37 am
Full Name: Ole Thomsen

Re: Concurrency in a job?

Postby Gostev » Fri Sep 17, 2010 6:35 pm

Hi Ole, I wanted to add that we have a good whitepaper called "Deduplication and VMware Backup Sprawl" going into details of our dedupe approach (per job), and comparing it to a single dedupe store. I recommend that you check it out, it is more technical than marketing (written by our system engineers).

You can download it from here:
http://www.veeam.com/whitepapers.html

By the way, VDR is almost no different from us in terms of end results: while it provides ability to write to a single dedupe store, the dedupe store itself is limited to 2TB in size - so you are going to have to use multiple ones.

Thanks!
Gostev
Veeam Software
 
Posts: 12911
Liked: 309 times
Joined: Sun Jan 01, 2006 1:01 am
Full Name: Anton Gostev

Re: Concurrency in a job?

Postby Oletho » Sat Sep 18, 2010 4:49 am

I'll take a look at the whitepaper, Gostev.

I tested the latest VDR for a couple of months, and while the store limit is 2TB the supported size is only 1TB (500GB for CIFS). Even then it gets unstable the closer one gets to fill the store, and the missing features and weaknesses makes it unusable for all but the smallest environments in my opinion.

Still I wish Veeam had concurrency in each job, this is one thing I really like about esXpress. Is it a design decision or because of the single NTFS file that holds the job data?

Ole Thomsen
Oletho
Enthusiast
 
Posts: 38
Liked: never
Joined: Fri Sep 17, 2010 4:37 am
Full Name: Ole Thomsen

Re: Concurrency in a job?

Postby Gostev » Sat Sep 18, 2010 10:24 am

Hi Ole, it is more of historical design/feature decision.

More detailed answer to the same question:
Why does Veeam do serial VM backups instead of parallel?
Gostev
Veeam Software
 
Posts: 12911
Liked: 309 times
Joined: Sun Jan 01, 2006 1:01 am
Full Name: Anton Gostev

Re: Concurrency in a job?

Postby Bunce » Sat Sep 18, 2010 11:08 am

Ditto here. We have no issue 'flooding' any pipe when running multiple jobs.

As mentioned frequently, having to run multiple jobs removes the dedupe advantage, and increases management and monitoring requirements.

Not being able to order the backups within a job then adds further complication when trying to backup and replicate the same VM in different jobs (to prevent being backup up at the same time) - while still trying to maintain a small backup window.

If its a technical limitation of the backup engine, perhaps Veem could consider a custom backup 'appliance' target that multiple jobs could write to concurrently? (also of the ilk of esxpress). This may also provide the possibility to provide useful reporting statistics from the target appliance (dedup, compression, VM stats, indexing and searching, accurate transfer rates etc - all the good stuff that we can provide in managerial reports)

Just a thought.

Cheers,
A
Bunce
Expert
 
Posts: 221
Liked: 2 times
Joined: Fri Sep 18, 2009 9:56 am
Location: Adelaide, Australia
Full Name: Andrew

Re: Concurrency in a job?

Postby Gostev » Sat Sep 18, 2010 1:10 pm

No, this is not a technical limitation of the backup engine. This is specifically explained in the post referenced in my previous reply.
Gostev
Veeam Software
 
Posts: 12911
Liked: 309 times
Joined: Sun Jan 01, 2006 1:01 am
Full Name: Anton Gostev

Re: Concurrency in a job?

Postby Oletho » Sun Sep 19, 2010 7:56 am

Gostev wrote:Hi Ole, it is more of historical design/feature decision.

More detailed answer to the same question:
Why does Veeam do serial VM backups instead of parallel?


I understand the decision based on those findings, but that is not my experience so far.

We use 10Gbit NFS as primary VM storage (FC for backup store) and I chose to run a 4 vCPU virtual appliance with 4GB RAM on Windows Server 2008 R2.

Each HOTADD backup job uses ~25% CPU (equals 1 vCPU) and gives a processing rate of ~30 MB/s for full backup, higher for CBT.

So I see 2 full jobs using 50% CPU and total processing rate ~60MB/s, 4 full jobs doubles both numbers. I did not add additional vCPUs to see when that flattens :-)

Ole Thomsen
Oletho
Enthusiast
 
Posts: 38
Liked: never
Joined: Fri Sep 17, 2010 4:37 am
Full Name: Ole Thomsen

Re: Concurrency in a job?

Postby tsightler » Sun Sep 19, 2010 3:45 pm

Oletho wrote:
Gostev wrote:Hi Ole, it is more of historical design/feature decision.
More detailed answer to the same question:
Why does Veeam do serial VM backups instead of parallel?

I understand the decision based on those findings, but that is not my experience so far.


It's not my findings either. I've always seen better performance from multiple jobs than from a single job. Even if a single job saturates the SAN or the CPU that doesn't account for all the "down time" in the job between VM's while snapshots are taken and removed. This can be significant on busy systems (our Exchange server regularly takes 45-60 minutes to remove the snapshot, all the while that Veeam job is basically doing nothing). We see significantly better performance from multiple jobs, even on our relatively old servers.

It also feels like the idea of a single job saturating the SAN or CPU is based on running fulls, but we rarely run a full backup. For CBT based incrementals, a single job rarely uses more than 25% of CPU in our environment, and the SAN in nowhere near saturated. In that scenario the overhead of creation and removal of snapshots is almost half of the entire backup time. Having multiple jobs can cut the overall job time nearly in half for these circumstances. We've found that 4 jobs is generally the best setting for performance without completely saturating the SAN.
tsightler
Veeam MVP
 
Posts: 2400
Liked: 402 times
Joined: Fri Jun 05, 2009 12:57 pm
Full Name: Tom Sightler

Re: Concurrency in a job?

Postby MartinSvec » Tue Apr 17, 2012 5:31 pm

Hello, I'd like to reopen this question.

I'm evaluating if Veeam v6 is a suitable backup solution for our vSphere 5 cloud environment and the sequential job processing seems to me like a potential source of troubles. And if I'm right, there is still no option to enable concurrent intra-job processing in v6? Having high number of small VMs with minimal daily CBT changes, I see a lot of time wasted by e.g. waiting for snapshot creations, snapshot removals etc. In this time, storage, CPU and network resources are clearly underutilized and the overall backup window unreasonably increases. Maintaining multiple identical jobs and manually distributing VMs between them just to workaround this behavior doesn't look like an user-friendly idea.

I'm also asking myself if all the great scalability features like backup proxies, storage repositories, automatic load balancing etc. are so great when the level of backup paralellism is primarily limited by the number of (manually created) jobs. With modern 10GE SANs and multi-core backup servers with plenty of RAM and HDDs, I think this deficiency will become more and more obvious.

So, is it true that the job processing is still purely sequential or is there any hidden option that I overlooked? And if it's still not possible, do you have any plans to implement it?

Thank you.

Martin Svec
MartinSvec
Lurker
 
Posts: 2
Liked: 1 time
Joined: Tue Apr 17, 2012 3:10 pm

Re: Concurrency in a job?

Postby Vitaliy S. » Tue Apr 17, 2012 8:00 pm

Hi Martin, there is more recent topic discussing all the pros and cons of this approach, please look it through and continue posting there: Concurrent Tasks
Vitaliy S.
Product Manager
 
Posts: 8132
Liked: 189 times
Joined: Mon Mar 30, 2009 9:13 am
Full Name: Vitaliy Safarov

replication job vms not concurrent

Postby c_manboy » Mon Jul 16, 2012 4:15 am

[merged]

I created a replication job with 7 vms. The source proxy and target proxy are different (yet both are local and the destination is local) and configured with 3 "Max concurrent tasks". So my question is, should the vms be processing concurrently within the same job?
c_manboy
Novice
 
Posts: 4
Liked: never
Joined: Mon Jul 16, 2012 3:36 am

Re: Concurrency in a job?

Postby dellock6 » Mon Jul 16, 2012 1:47 pm

no, all VMs in a single job are processed sequentially. Cuncurrency takes place when you have more than 1 job running at the same time.

Luca.
Luca Dell'Oca
http://www.virtualtothecore.com
@dellock6
vExpert 2011-2012
dellock6
Veeam MVP
 
Posts: 1154
Liked: 175 times
Joined: Sun Jul 26, 2009 3:39 pm
Location: Varese, Italy
Full Name: Luca Dell'Oca


Return to Veeam Backup & Replication



Who is online

Users browsing this forum: Bing [Bot], Exabot [Bot], Google [Bot] and 9 guests