Some of these vms contain daily database dumps on a dedicated drive/disk (uncompressed, unencrypted native full database backup/dump to - let's say - z:\dbbackup).
As long as the databases dont't have a lot of daily change, each db-dump is 99.9% identical to the last one (I verified this on Pure FlashArray with 10 daily dumps from two very similar db-systems - SAP prod and test - 25tb skinned down to ~300GB).
How would Object Storage handle this? Most parts of each daily incremental backup will be identical to an existing object/junk from the backup chain in the object store and thus be deduplicated, right?
Much more efficient than what is consumed on a SMB repo, correct? More in-line with a ReFS Repo with scheduled Dedupe and Compression, although not as good/efficient due to sample size being probably 1M/4M according to object/junk size vs few KB block size on ReFS...
Further:
If I am not wrong, two identical vms will not dedupe between/against each other in direct-to-object repos, identical to per-vm chains on traditional repos. Dedupe/compression only takes place inside their respective own chain (or specific object store/collection that represents their respective chain in the case of object storage repos).
Now, If I would collect all db-dumps from "mostly-same-database" vms on a central file-server and backup this vm via snap or in-guest agent (again, forever forward with 30 days chain), then all these db-dumps would end up in the same backup chain (or rather object store) and be deduped against each other inside the respective backup-chain in direct-to-object.
The result being that I would most probably achieve a very high dedupe rate, correct? Much better than on an SMB repo without any means of Host-dedupe/compression. Again, more in-line with a ReFS Repo with scheduled Dedupe and Compression...
Still under the assumption that a) the daily change-rate on the db-dumps is low and b) the collected source-dbs are mostly same-same (like recent sandbox copy of prod, recent q-control-copy of prod, etc...)
Correct?
Is there any difference between performance tier and capacity tier in this regard?
Thanks a lot for any insight
