Hi. So far I've always had a lot of trouble fitting our backups to our main repository. Our remote repository uses ReFS which works fine on there, but our main site still uses NTFS. We are a small company and budget isn't 'just' available. I now have about a 8TB NTFS repository with dedupe enabled, storing about 28TB of data. We dedupe everything older than 0 days, so we dedupe everything. Last week sh#t hit the fan for one VM, so I had to restore it. It was just a 30GB VM, but restoring took about 30 minutes. Bottleneck is the storage. The repository is on a 16 x 1TB 7200rpm SAS disk array, on its own dedicated 8Gb FibreChannel SAN (with that I mean it's not on the production SAN). The rest of the storage on that disk array is just archive storage. 99% of all IO on that SAN is coming from Veeams repository. Of course, this is somewhat to be expected. Dedupe overtime makes for a huge fragmentation of big files, so restoring a VM will take much more IO than when stored without dedupe.
I've had it with the crappy solutions, which takes me a lot of time which also tranlates into money. So next wednesday I'm installing a new disk shelf on the backup-SAN with 12x 2TB 7200rpm. That one will be solely used for Veeam. That means that space is not an issue anymore as long as I do some type of dedupe - be that actual NTFS dedupe or ReFS block-cloning (I know technically that's no dedupe but the result is more or less the same). Now I'm not sure what to format the repository with.
If I use NTFS with dedupe, and set it to dedupe only everything older than let's say a week, my restores should be quick. We do one active full a week, which would mean even in the last day of the backup chain, I'd still be able to restore a VM from non-deduped ie pretty much sequential data. If I need to restore something quickly, that mostly means I need to restore the last backed up status. If I need older data that's always been less urgent. So I think NTFS with dedupe > 7 days is a good option here.
If I use ReFS, I don't have that choice. I see the benefit of ReFS in it preventing redundant data in the first place, however, that means over time all full backup files will get so fragmented that restore times will get longer and longer again, just as my current issue is.
The main reason for finally ordering new storage is that restoring that 30GB VM took about 30 minutes which is unacceptable slow. I can't imagine how long it takes to restore a 2TB VM, even if we use instant recovery. So what do you think? should I go ReFS or NTFS?