-
- Novice
- Posts: 6
- Liked: 1 time
- Joined: Dec 16, 2021 8:21 am
- Full Name: Iok
- Contact:
Checkpoint removal process info
Since I upgraded to Veeam B&R 12.2.0.334 I noticed that every night an additional background retention process is started. It is called "Checkpoint Removal". It starts at 3:00am and run continuously for 24 hours.
It seems related to object storage repositories but I could not find info about it in the official documentation.
Can someone explain how it works, what its purpose is and point me to the documentation?
Thanks
It seems related to object storage repositories but I could not find info about it in the official documentation.
Can someone explain how it works, what its purpose is and point me to the documentation?
Thanks
-
- Veeam Software
- Posts: 2174
- Liked: 521 times
- Joined: Jun 28, 2016 12:12 pm
- Contact:
Re: Checkpoint removal process info
Hi ifi-deda,
Sure, please check our "What's New" document here for 12.2: https://www.veeam.com/kb4600
Sure, please check our "What's New" document here for 12.2: https://www.veeam.com/kb4600
Basically, as it says, the checkpoint removal for the checkpoints on Object Storage are now detached from the jobs themselves as this removal can be fairly busy work for the S3 storage. The checkpoints are an internal mechanism for Object Storage Repositories to help manage the data blocks -- checkpoint removal removes the no longer necessary data as per retention. Previously the removal might occur while other backup operations are utilizing the Object Storage repository and it could cause slowness for all operations, often resulting in the jobs "hanging" at 99% when in reality the jobs were busy with the checkpoint removal operations, and now the jobs will run without having to compete for resources with the checkpoint removal.Background checkpoint removal — checkpoint removal operation has been decoupled from the backup
job and will now run as a system session after backup jobs and offload processes finish and there are
no conflicting tasks (checkpoint removal has the lowest priority for the task scheduler). In addition, this
process is triggered daily at 3 a.m. to perform the same activity. This change will remove additional load
from object storage during the time when it is already extremely busy accepting incoming backups, and
also prevent backup jobs from appearing to “hang” at the end while processing checkpoints removal.
David Domask | Product Management: Principal Analyst
-
- Service Provider
- Posts: 34
- Liked: 3 times
- Joined: Apr 16, 2023 6:16 pm
- Contact:
Re: Checkpoint removal process info
Any Reg-Key to change the starttime? Most of the Copy to Objectstorage jobs run at this time for us?
-
- Veeam Software
- Posts: 2174
- Liked: 521 times
- Joined: Jun 28, 2016 12:12 pm
- Contact:
Re: Checkpoint removal process info
Hi ReKe,
The registry values are:
CheckpointRemovalJobStartTimeHours
CheckpointRemovalJobStartTimeMins
Location: HKEY_LOCAL_MACHINE\SOFTWARE\Veeam\Veeam Backup and Replication
Type: DWORD
Values: Integer (default 3 and 0 respectively for Hours/Mins)
The registry values are:
CheckpointRemovalJobStartTimeHours
CheckpointRemovalJobStartTimeMins
Location: HKEY_LOCAL_MACHINE\SOFTWARE\Veeam\Veeam Backup and Replication
Type: DWORD
Values: Integer (default 3 and 0 respectively for Hours/Mins)
David Domask | Product Management: Principal Analyst
-
- Service Provider
- Posts: 34
- Liked: 3 times
- Joined: Apr 16, 2023 6:16 pm
- Contact:
Re: Checkpoint removal process info
Hi David,
thanks for the reply. Will try it.
Gretings Rene
thanks for the reply. Will try it.
Gretings Rene
-
- Influencer
- Posts: 11
- Liked: never
- Joined: Mar 27, 2024 12:37 pm
- Full Name: Martin McDonnell
- Contact:
Re: Checkpoint removal process info
We upgraded to v12.2 over the weekend and I just received an email. Subject: Failed to remove a checkpoint during a background job on <vbr server name>.
What actions need to be taken?
What actions need to be taken?
-
- Veeam Software
- Posts: 304
- Liked: 146 times
- Joined: Jul 24, 2018 8:38 pm
- Full Name: Stephen Firmes
- Contact:
Re: Checkpoint removal process info
mpm@aramark, you should contact Veeam Support and open a support case so that they can help you troubleshoot the issue.
Steve Firmes | Senior Solutions Architect, Product Management - Alliances @ Veeam Software
-
- Lurker
- Posts: 2
- Liked: never
- Joined: Aug 30, 2021 1:17 pm
- Full Name: Dominic Shoemaker
- Contact:
Re: Checkpoint removal process info
We are experiencing a similar trouble. I have opened a support case with Veeam.
-
- Chief Product Officer
- Posts: 31840
- Liked: 7331 times
- Joined: Jan 01, 2006 1:01 am
- Location: Baar, Switzerland
- Contact:
Re: Checkpoint removal process info
BTW some text got lost here during PDF conversion, this was meant to say "to perform the same activity for backups which are no longer associated with any jobs, as well as VeeamZIP and exported backups".david.domask wrote: ↑Sep 12, 2024 8:09 amIn addition, this process is triggered daily at 3 a.m. to perform the same activity.
-
- Veeam Software
- Posts: 304
- Liked: 146 times
- Joined: Jul 24, 2018 8:38 pm
- Full Name: Stephen Firmes
- Contact:
Re: Checkpoint removal process info
Please post or PM me your support case and I will look into it.DShoemaker wrote: ↑Sep 18, 2024 12:38 pm We are experiencing a similar trouble. I have opened a support case with Veeam.
Thanks
Steve Firmes | Senior Solutions Architect, Product Management - Alliances @ Veeam Software
-
- Enthusiast
- Posts: 67
- Liked: 6 times
- Joined: Aug 07, 2015 8:45 pm
- Full Name: David Engler
- Contact:
Re: Checkpoint removal process info
Is there a reg-key to disable the check point removal from starting at all or limiting how many hours it runs?
I have a need to temporarily stop it and/or limit the hours it runs. When offloads and the checkpoint removal are running at the same time both processes cause severe slowness. If they are allowed to run by themselves they process a lot faster since the S3 appliance is not trying to manage both random read and random writes at the same time
Dave
I have a need to temporarily stop it and/or limit the hours it runs. When offloads and the checkpoint removal are running at the same time both processes cause severe slowness. If they are allowed to run by themselves they process a lot faster since the S3 appliance is not trying to manage both random read and random writes at the same time
Dave
-
- Chief Product Officer
- Posts: 31840
- Liked: 7331 times
- Joined: Jan 01, 2006 1:01 am
- Location: Baar, Switzerland
- Contact:
Re: Checkpoint removal process info
If you were to limit the run time or disable this completely, your storage appliance would eventually overfill. So instead it's better to change the start time for after the offload ends.
-
- Service Provider
- Posts: 11
- Liked: 1 time
- Joined: Feb 22, 2024 1:37 pm
- Contact:
Re: Checkpoint removal process info
Thanks for the registry key !
We have two clients which have the same errors. One is everyday, the other is more random.
Case #02833866 ongoing.
Support asked me to gather logs from Object Storage side (Wasabi)
I am waiting for the error to appear again.
We have two clients which have the same errors. One is everyday, the other is more random.
Case #02833866 ongoing.
Support asked me to gather logs from Object Storage side (Wasabi)
I am waiting for the error to appear again.
-
- Enthusiast
- Posts: 67
- Liked: 6 times
- Joined: Aug 07, 2015 8:45 pm
- Full Name: David Engler
- Contact:
Re: Checkpoint removal process info
I agree that disabling or limiting the hours is not a good way to manage the backups. I would not intend to leave the system in this configuration. In my current situation I've been told when both offloads and deletes occur at the same time both suffer severe performance issues and if I could let them run on their own both would run significantly faster. I'm not convinced that would be the case but I'm seeing that my offloads are falling further and further behind each day. In some severe cases data is expiring faster than I can get the next offload completed for large VMs.
My short-term thought process would be to allow offloads to run while disabling deletions and monitor space on the appliance. When the space gets to a point where I have to run deletes, I would temporarily stop offloads to let them process. Its not a solution I would accept long term, I'm trying to find a way, any way to get the system caught up, I'm the equivalent of 6+ days of backups behind, some systems fully caught up and others without a single offload on the capacity tier.
Dave
My short-term thought process would be to allow offloads to run while disabling deletions and monitor space on the appliance. When the space gets to a point where I have to run deletes, I would temporarily stop offloads to let them process. Its not a solution I would accept long term, I'm trying to find a way, any way to get the system caught up, I'm the equivalent of 6+ days of backups behind, some systems fully caught up and others without a single offload on the capacity tier.
Dave
-
- Service Provider
- Posts: 101
- Liked: 10 times
- Joined: Aug 09, 2017 7:31 am
- Full Name: Tomer Schwaitzer
- Contact:
Re: Checkpoint removal process info
Hi,
I am in the same position. After upgrading to the latest hotfix, the checkpoint removal jobs keeps running 24/7. The load on the object storage is huge, it reaches 2000 and 3000 request rate only for the objectPutRetention requests, which are the checkpoint removal as much as I understand. This keeps the backups from running, everything is hang for a week now.
I have an opened case number 07430504 but I don't think the support guys understand the urgency of that issue, calling it "performance degradation" instead of full service outage.
Nothing works. We don't have backups. We have something like 500 servers waiting for a week now. Nothing to do from our side.
We are checking other vendors at the moment as a backup, maybe we will just switch everything if we will not have answers in the next hours.
we are Veeam Cloud Connect providers and our service is degraded because of that. It is sad that we can't have the service we need from your support in this bad situation.
I am in the same position. After upgrading to the latest hotfix, the checkpoint removal jobs keeps running 24/7. The load on the object storage is huge, it reaches 2000 and 3000 request rate only for the objectPutRetention requests, which are the checkpoint removal as much as I understand. This keeps the backups from running, everything is hang for a week now.
I have an opened case number 07430504 but I don't think the support guys understand the urgency of that issue, calling it "performance degradation" instead of full service outage.
Nothing works. We don't have backups. We have something like 500 servers waiting for a week now. Nothing to do from our side.
We are checking other vendors at the moment as a backup, maybe we will just switch everything if we will not have answers in the next hours.
we are Veeam Cloud Connect providers and our service is degraded because of that. It is sad that we can't have the service we need from your support in this bad situation.
Last edited by david.domask on Sep 24, 2024 8:10 am, edited 1 time in total.
Reason: Edited to correct case number
Reason: Edited to correct case number
-
- Veeam Software
- Posts: 2174
- Liked: 521 times
- Joined: Jun 28, 2016 12:12 pm
- Contact:
Re: Checkpoint removal process info
Hi Tomer,
Thank you for your post and for sharing the case number, and sorry to hear about the challenges.
Checking the case and with Support Management, the case has been prioritized and the situation is being prioritized given the impact of the issue and should be reaching out soon.
Thank you for your post and for sharing the case number, and sorry to hear about the challenges.
Checking the case and with Support Management, the case has been prioritized and the situation is being prioritized given the impact of the issue and should be reaching out soon.
David Domask | Product Management: Principal Analyst
-
- Chief Product Officer
- Posts: 31840
- Liked: 7331 times
- Joined: Jan 01, 2006 1:01 am
- Location: Baar, Switzerland
- Contact:
Re: Checkpoint removal process info
Please note that vast majority of deletes should still happen around same time as before, right after the backup jobs finish. As per my earlier comment above, the checkpoint removal process that starts at night (3am by default) is only there to take care of a few "special" backup types. So unless you're heavy user of VeeamZIP or Export functionality, or have tons of abandoned backup (detached from their original jobs and just left to slowly expire) then I would not expect much activity happening after 3am in the first place.
-
- Chief Product Officer
- Posts: 31840
- Liked: 7331 times
- Joined: Jan 01, 2006 1:01 am
- Location: Baar, Switzerland
- Contact:
Re: Checkpoint removal process info
Then this is not caused by the Checkpoint removal process discussed in this topic... putObjectRetention is from extending immutability on existing objects.tomers@y-tech.net wrote: ↑Sep 24, 2024 4:48 amThe load on the object storage is huge, it reaches 2000 and 3000 request rate only for the objectPutRetention requests
-
- Veeam Legend
- Posts: 411
- Liked: 232 times
- Joined: Apr 11, 2023 1:18 pm
- Full Name: Tyler Jurgens
- Contact:
Re: Checkpoint removal process info
Is PutObjectRetention also used to set the initial retention policy for immutable backups as well, not just extending immutability on existing objects? I've seen this heavily used for first time backups as well.
Tyler Jurgens
Veeam Legend x3 | vExpert ** | VMCE | VCP 2020 | Tanzu Vanguard | VUG Canada Leader | VMUG Calgary Leader
Blog: https://explosive.cloud
Twitter: @Tyler_Jurgens BlueSky: @explosive.cloud
Veeam Legend x3 | vExpert ** | VMCE | VCP 2020 | Tanzu Vanguard | VUG Canada Leader | VMUG Calgary Leader
Blog: https://explosive.cloud
Twitter: @Tyler_Jurgens BlueSky: @explosive.cloud
-
- Chief Product Officer
- Posts: 31840
- Liked: 7331 times
- Joined: Jan 01, 2006 1:01 am
- Location: Baar, Switzerland
- Contact:
Re: Checkpoint removal process info
Yes, of course. Immutability is set when a new unique object first appears, and it is extended when existing objects are being reused in a newly created restore point.
While checkpoint removal operation discussed here works with metadata that represents a restore point.
While checkpoint removal operation discussed here works with metadata that represents a restore point.
-
- Veeam Legend
- Posts: 411
- Liked: 232 times
- Joined: Apr 11, 2023 1:18 pm
- Full Name: Tyler Jurgens
- Contact:
Re: Checkpoint removal process info
@tomer - This may not help for the situation you are in now, but we've avoided most of those problems by increasing the block size on jobs targeting our S3. Larger blocks means fewer objects to set the retention policy on when it lands on our Object Storage arrays. This will have the net benefit of shorter backup times, shorter time frame for all the PutObjectRetention processing and shorter time frame for removing older restore points. However, it does come at a cost of larger incremental backups.
Checkpoint removal is a thing for us as well. One specific customer removed the backup copy jobs and replaced them with backup jobs, since he could not alter block size on backup copy jobs directly. This resulted in backups that are left behind to be cleaned up with the checkpoint removal process. The concern was not around backup size for him, but rather backups completing within the allotted time - larger objects helped immensely. Once those older backups finally age out we should be in a great spot moving forward.
Checkpoint removal is a thing for us as well. One specific customer removed the backup copy jobs and replaced them with backup jobs, since he could not alter block size on backup copy jobs directly. This resulted in backups that are left behind to be cleaned up with the checkpoint removal process. The concern was not around backup size for him, but rather backups completing within the allotted time - larger objects helped immensely. Once those older backups finally age out we should be in a great spot moving forward.
Tyler Jurgens
Veeam Legend x3 | vExpert ** | VMCE | VCP 2020 | Tanzu Vanguard | VUG Canada Leader | VMUG Calgary Leader
Blog: https://explosive.cloud
Twitter: @Tyler_Jurgens BlueSky: @explosive.cloud
Veeam Legend x3 | vExpert ** | VMCE | VCP 2020 | Tanzu Vanguard | VUG Canada Leader | VMUG Calgary Leader
Blog: https://explosive.cloud
Twitter: @Tyler_Jurgens BlueSky: @explosive.cloud
-
- Product Manager
- Posts: 20443
- Liked: 2310 times
- Joined: Oct 26, 2012 3:28 pm
- Full Name: Vladimir Eremin
- Contact:
Re: Checkpoint removal process info
tyler.jurgens wrote:Is PutObjectRetention also used to set the initial retention policy for immutable backups as well, not just extending immutability on existing objects? I've seen this heavily used for first time backups as well.
That is not entirely correct. When creating an object, we do not use the "putobjectretention" call. It is only used for extending immutability. In the case of object creation, we utilize the "putobject" request, where we can immediately set the desired lock period.Gostev wrote:Yes, of course. Immutability is set when a new unique object first appears, and it is extended when existing objects are being reused in a newly created restore point.
So, the Tyler's case looks a bit strange, and I believe our support team is already investigating it.
Thanks!
-
- Veeam ProPartner
- Posts: 573
- Liked: 106 times
- Joined: Dec 29, 2009 12:48 pm
- Full Name: Marco Novelli
- Location: Asti - Italy
- Contact:
Re: Checkpoint removal process info
Also I’ve started to see many random “Failed to remove a checkpoint” from different customers after 12.2 upgrade.
We use Azure as Object Repository
Marco
We use Azure as Object Repository
Marco
-
- Product Manager
- Posts: 20443
- Liked: 2310 times
- Joined: Oct 26, 2012 3:28 pm
- Full Name: Vladimir Eremin
- Contact:
Re: Checkpoint removal process info
Hi, Marco,
Do you happen to have a support ticket created with the debug logs attached, so we can ask the R&D team to take a look at it?
Thanks!
Do you happen to have a support ticket created with the debug logs attached, so we can ask the R&D team to take a look at it?
Thanks!
-
- Veeam ProPartner
- Posts: 573
- Liked: 106 times
- Joined: Dec 29, 2009 12:48 pm
- Full Name: Marco Novelli
- Location: Asti - Italy
- Contact:
Re: Checkpoint removal process info
Sure, I need a couple of days, now I'm out of office
Marco
Marco
-
- Veeam ProPartner
- Posts: 573
- Liked: 106 times
- Joined: Dec 29, 2009 12:48 pm
- Full Name: Marco Novelli
- Location: Asti - Italy
- Contact:
Re: Checkpoint removal process info
Case #07445010
I'm uploading logs from different customers
Marco
I'm uploading logs from different customers
Marco
-
- Product Manager
- Posts: 20443
- Liked: 2310 times
- Joined: Oct 26, 2012 3:28 pm
- Full Name: Vladimir Eremin
- Contact:
Re: Checkpoint removal process info
Thank you, Mario, for opening the case with us and providing the logs. I have passed the ticket on to the R&D team. If necessary, they will join the investigation process.
-
- Veeam ProPartner
- Posts: 573
- Liked: 106 times
- Joined: Dec 29, 2009 12:48 pm
- Full Name: Marco Novelli
- Location: Asti - Italy
- Contact:
Re: Checkpoint removal process info
Hi guys, the support engineer suggested me to choose a different time for background checkpoint removal , to avoid conflict with the backup copy to Object Storage
My 2 cents
1) time schedule is only by reg key
2) I'm expecting from Veeam "It just works"
I'm raising a feature request to make "background checkpoint removal" more intelligent, to detect when the copy job run and schedule itself according, or go back to previous implementation in v12.1 , or some more magic
Thanks,
Marco
My 2 cents
1) time schedule is only by reg key
2) I'm expecting from Veeam "It just works"
I'm raising a feature request to make "background checkpoint removal" more intelligent, to detect when the copy job run and schedule itself according, or go back to previous implementation in v12.1 , or some more magic
Thanks,
Marco
-
- Chief Product Officer
- Posts: 31840
- Liked: 7331 times
- Joined: Jan 01, 2006 1:01 am
- Location: Baar, Switzerland
- Contact:
Re: Checkpoint removal process info
At the time of reg key only minimal activity should happen in any case, as only a handful of backups is processed then at best. The bulk of work still happens after the backup jobs, so not much different from "previous implementation in 12.1" with the exception of waiting for all backup jobs to finish running.
So I would double check if you have many backups of types I listed in my post in the first place (which is more likely since you're primarily dealing with SMB customer), and if not then question support engineer's suggestion as there's nothing for Veeam to do at the time of the reg key in principle.
@veremin this does bring the question when did start processing the retention of VeeamZIP/Exported/Abandoned backups in 12.1? Because I assume it had to be started at some specific time as well even in 12.1?
So I would double check if you have many backups of types I listed in my post in the first place (which is more likely since you're primarily dealing with SMB customer), and if not then question support engineer's suggestion as there's nothing for Veeam to do at the time of the reg key in principle.
@veremin this does bring the question when did start processing the retention of VeeamZIP/Exported/Abandoned backups in 12.1? Because I assume it had to be started at some specific time as well even in 12.1?
-
- Product Manager
- Posts: 20443
- Liked: 2310 times
- Joined: Oct 26, 2012 3:28 pm
- Full Name: Vladimir Eremin
- Contact:
Re: Checkpoint removal process info
Upon reviewing the provided debug logs, the R&D team identified errors related to the SSL certificate retrieval failure on the VeeamAgent side. There were no other errors or warnings found in the log bundle. The issue does not appear to be linked to the background checkpoint removal job, but rather to potential network connectivity or infrastructure problems. Additionally, there were multiple http retries observed in the VeeamAgent communication with Azure, supporting this theory.
Therefore, it seems that the support engineer's suggestions regarding the root cause of the issue and potential remedial actions are incorrect.
We recommend sharing this information with the engineer and considering escalating the ticket if necessary.
I will double-check the information with the QA team and will get back later once I have additional details. Thanks!
Thanks!
Who is online
Users browsing this forum: Semrush [Bot] and 30 guests