-
- Influencer
- Posts: 23
- Liked: 6 times
- Joined: Mar 22, 2021 11:18 pm
- Contact:
Need a method for timing out a backup copy job or killing it in the console
We need a method for having backup copy jobs timeout (especially WAN accelerator jobs) and also a method for killing stuck backup copy jobs from within the console. If you have a hung job that is just sitting there waiting for resources, setting it to disable will accomplish nothing. The job will now say stopping, but will never stop.
The main issue with this is that if this is a WAN accelerator job, then other jobs back up behind it. So in this scenario, it would be great to have a timeout period for a job that is waiting for something. In addition, we need a method for killing the job from within the console. I am aware of going to the relevant repository servers and killing off processes within task manager, but this is a kludge.
I am also aware that just killing a copy job might result in the file it was working on being corrupted. However a better method to detect this corruption and allow it to be copied again is the correct solution. I would rather have one corrupted incremental that needs me to intervene on, rather than 100's of servers in other jobs not replicating offsite due to them being backed up behind a stuck WAN accelerator job.
The main issue with this is that if this is a WAN accelerator job, then other jobs back up behind it. So in this scenario, it would be great to have a timeout period for a job that is waiting for something. In addition, we need a method for killing the job from within the console. I am aware of going to the relevant repository servers and killing off processes within task manager, but this is a kludge.
I am also aware that just killing a copy job might result in the file it was working on being corrupted. However a better method to detect this corruption and allow it to be copied again is the correct solution. I would rather have one corrupted incremental that needs me to intervene on, rather than 100's of servers in other jobs not replicating offsite due to them being backed up behind a stuck WAN accelerator job.
-
- Veeam Software
- Posts: 3626
- Liked: 608 times
- Joined: Aug 28, 2013 8:23 am
- Full Name: Petr Makarov
- Location: Prague, Czech Republic
- Contact:
Re: Need a method for timing out a backup copy job or killing it in the console
Hello,
I would try to address the issue with a job which is waiting for resources for some unknown reason, have you had a chance to open a support case?
Thanks!
I would try to address the issue with a job which is waiting for resources for some unknown reason, have you had a chance to open a support case?
Thanks!
-
- Product Manager
- Posts: 2581
- Liked: 708 times
- Joined: Jun 14, 2013 9:30 am
- Full Name: Egor Yakovlev
- Location: Prague, Czech Republic
- Contact:
Re: Need a method for timing out a backup copy job or killing it in the console
Hi Taroking1,
- if Jobs hang \ stuck in stopping - it's abnormal and needs investigation with support
- if "Disable" a job does not disables it - it's abnormal and needs to be investigated with support
I would strongly advise to avoid task manager massacre on processes and troubleshoot your case in detail with our support engineers.
/Thanks!
- if Jobs hang \ stuck in stopping - it's abnormal and needs investigation with support
- if "Disable" a job does not disables it - it's abnormal and needs to be investigated with support
I would strongly advise to avoid task manager massacre on processes and troubleshoot your case in detail with our support engineers.
/Thanks!
-
- Influencer
- Posts: 23
- Liked: 6 times
- Joined: Mar 22, 2021 11:18 pm
- Contact:
Re: Need a method for timing out a backup copy job or killing it in the console
I have opened a ticket. More to the point, it looks like the patch for V11 has fixed the issue where the job is just sitting there doing nothing. It is now failing the particular machine it was stuck on and progressing to the next item.
-
- Veeam Software
- Posts: 3626
- Liked: 608 times
- Joined: Aug 28, 2013 8:23 am
- Full Name: Petr Makarov
- Location: Prague, Czech Republic
- Contact:
Re: Need a method for timing out a backup copy job or killing it in the console
Looks like the only remaining question is why job does not process this specific item which will be failed later. It makes sense to clarify it with our engineers as well.
Thanks!
Thanks!
-
- Influencer
- Posts: 23
- Liked: 6 times
- Joined: Mar 22, 2021 11:18 pm
- Contact:
Re: Need a method for timing out a backup copy job or killing it in the console
That is a good question Petr. A couple of weeks into troubleshooting with an open ticket and no resolution yet.
Most of the functions within Veeam are excellent, but I would say that my experience with the backup copy has been abysmal. One issue after the next and really lets the product down.
Direct copy is the only method that reliably works but will result in me pushing 100's of GB over our WAN network from remote sites with slow links. This is not actually a viable solution for us.
The WAN accelerator appears to be a hot mess of issues. If I use the low bandwidth, the job will start running nicely and then at random times within the backup copy I get "Global cache synchronization completed" which takes 1.5-1hrs and then away it goes again, but then after another 4 or 5 machines it does it again. Overall backup copy job is taking 18-23 hours. I can do direct copy in about 8 hours and that is even with Network traffic rules limiting my bandwidth.
Use high bandwidth and then you run into the job pausing to Create Fingerprints for all of the machines. Even though there has been no changes to the job from yesterday.
Most of the functions within Veeam are excellent, but I would say that my experience with the backup copy has been abysmal. One issue after the next and really lets the product down.
Direct copy is the only method that reliably works but will result in me pushing 100's of GB over our WAN network from remote sites with slow links. This is not actually a viable solution for us.
The WAN accelerator appears to be a hot mess of issues. If I use the low bandwidth, the job will start running nicely and then at random times within the backup copy I get "Global cache synchronization completed" which takes 1.5-1hrs and then away it goes again, but then after another 4 or 5 machines it does it again. Overall backup copy job is taking 18-23 hours. I can do direct copy in about 8 hours and that is even with Network traffic rules limiting my bandwidth.
Use high bandwidth and then you run into the job pausing to Create Fingerprints for all of the machines. Even though there has been no changes to the job from yesterday.
-
- Veeam Software
- Posts: 3626
- Liked: 608 times
- Joined: Aug 28, 2013 8:23 am
- Full Name: Petr Makarov
- Location: Prague, Czech Republic
- Contact:
Re: Need a method for timing out a backup copy job or killing it in the console
Hello,
Actually, the main problem is that we know just symptoms but we have no idea what exactly triggers these issues. I would continue working with our support team in order to get an explanation. Looks like the issue is not trivial and requires deep examination of debug logs and performing of various tests. Could you please share a support case ID for our reference?
Thanks!
Actually, the main problem is that we know just symptoms but we have no idea what exactly triggers these issues. I would continue working with our support team in order to get an explanation. Looks like the issue is not trivial and requires deep examination of debug logs and performing of various tests. Could you please share a support case ID for our reference?
Thanks!
-
- Influencer
- Posts: 23
- Liked: 6 times
- Joined: Mar 22, 2021 11:18 pm
- Contact:
-
- Influencer
- Posts: 23
- Liked: 6 times
- Joined: Mar 22, 2021 11:18 pm
- Contact:
Re: Need a method for timing out a backup copy job or killing it in the console
In case anyone is reading. We got around to a resolution. We were still having issues with WAN accelerator in Low Bandwidth mode so I am not using it until some more V11 patches come out.
What I have done
Put the WAN accelerators into high bandwidth mode
Set the Network Traffic rules to limit my bandwidth to about 50% of the available per WAN link
The job is now running consistently every night with no issues.
The amount of WAN traffic I am generating is more than Low bandwidth mode, but it is still much better than direct copy and it is now rock solid.
What I have done
Put the WAN accelerators into high bandwidth mode
Set the Network Traffic rules to limit my bandwidth to about 50% of the available per WAN link
The job is now running consistently every night with no issues.
The amount of WAN traffic I am generating is more than Low bandwidth mode, but it is still much better than direct copy and it is now rock solid.
-
- Veeam Software
- Posts: 3626
- Liked: 608 times
- Joined: Aug 28, 2013 8:23 am
- Full Name: Petr Makarov
- Location: Prague, Czech Republic
- Contact:
Re: Need a method for timing out a backup copy job or killing it in the console
Hello,
Yes, we read! Your willingness to share the workaround with our Community is highly appreciated!
Thanks!
Yes, we read! Your willingness to share the workaround with our Community is highly appreciated!
Thanks!
Who is online
Users browsing this forum: Semrush [Bot] and 74 guests