k00laid wrote:Is the data anonymized in the process of doing the data mining? While I trust Veeam with my data having VM and job names show up in a paste bin in the case of something silly happen would be awkward.
I don't really understand your question, perhaps you're thinking some other type of reports? May be you can give me an example of what kind of big data report output you have in mind that may require anonymization?
Because at least in our case, it's all about collecting numbers for statistics purposes, for example - the number of ESXi hosts of certain version that we consider dropping. And even if I imagine some nonsense report that would actually tie up to a VM name or job name, for the results to be actionable in case of big data, it would have to be something like "
What percent of customers have a VM named myCriticalVM", and the output of this report will still be the number (or percent) of deployments in the data set. So, there's nothing to anonymize here further?
In other words, "anonymization" of big data into numbers is specifically what makes it usable and actionable.