Vitaliy S. wrote:Chris,
No exactly, you should receive email notification on the breached threshold for other jobs too, however only one combined alarm will be visible in the backup console, it is by design. If you need to review all jobs that has exceeded their max duration, please double-click on the alarms history to review all jobs. Anyway, I will verify the behavior you describe with our QC team.
The major use case for this alarm is to track individual job duration. I believe you would agree that jobs duration can vary a lot depending on the number of VMs being processed, thus it's hard to set a single threshold for all jobs, that is why you should be using separate alarms for different jobs.
BTW - how many jobs you have in your environment?
Thanks!
Vitaliy,
I have seen the history of the job shows which VM's are triggering the alarm. But as before once the first job triggers the alarm no further jobs that exceed the specified threshold can trigger that alarm until you resolve it - therefore the history does not show which jobs have breached AFTER the first alarm is triggered.
I have 34 jobs configured in house (mostly indivual VM's, but some jobs with multiple). Generally none of the jobs run for much more than an hour in my case so I wanted to set a global threshold for this alarm of approx 2 - 4 hours so that I will be made aware of any issues that would impact on the performance of any of the backup/replication jobs.
If the jobs suddenly start taking more than 2 hours, I know I have an issue somewhere!
This week we had a problem with our core switches which had a massive impact on Veeam's performance. I was seeing very low throughput (approx. 100Kb/s instead of 100+Mb/s) but because the jobs weren't failing I wasn't aware for several days until I logged on to the Veeam B&R server by chance!
I am looking to be able to monitor this performance in house but also for some of our customers who already have Veeam B&R but aren't using Veeam One. Some of the customers who I will be monitoring this for will have many more backup jobs than I am currently testing with here.
Hopefully that will help you understand where I am coming from
Thanks
Chris