Help me understand datastore latency alarms

kevdpc · Post by **kevdpc** » Nov 07, 2024 4:23 pm this post

Lately I've been getting alarms for datastore read and write latency and I'm having trouble understanding what I'm seeing.

For example I have an alarm email at 8:50 PM with details stating: "Disk/Datastore: Datastore Read Latency" (102.0 Milliseconds) is above a defined threshold (100.0 Milliseconds)

Here are the 'Datastore read latency' alarm rules (pretty sure they're the default).

Here is the 'Datastore Read Latency' performance graph for 'Last day.' You can see that there is a spike up to 7 ms at about 8:50 PM.

So why does the email alarm say 102.0 ms but the performance graph only shows 7 ms?

Nov 08, 2024 10:44 am

Hello Kevin,

As far as I remember, the alarm checks the max_value metric, while performance graphs use several aggregated current_value metrics.
Having that, the alarm is more precise and you indeed faced 102 ms around 8:50:00, while the graph checked values at 8:49:10, 8:49:30...8:50:10 and provided a single average value for the point on a graph.

Thanks

kevdpc · Post by **kevdpc** » Nov 12, 2024 3:35 pm this post

Okay that's helpful, thank you.

If it's checking the max value then what's the purpose of the 15 minute time period?

kevdpc · Post by **kevdpc** » Nov 12, 2024 4:38 pm this post

Also if it's checking the max value then why is the field called Aggregation?

Post by **RomanK** » Nov 13, 2024 2:35 pm this post

Hello Kevin,

Aggregation is just a general label for the alarm rule. It is possible to use min, max and avg functions against the data set as the rule might be applied to the multiple alarms with customization.
So the alarm rule populates the performance data and select min/max or avg among all numbers for that period. Then it check thresholds and start again. In practice 15 minutes it enough to prevent the alarm storm.

Thanks

kevdpc · Post by **kevdpc** » Nov 14, 2024 3:45 pm this post

Okay I understand now, thank you.

R&D Forums

Help me understand datastore latency alarms

Re: Help me understand datastore latency alarms

Re: Help me understand datastore latency alarms

Re: Help me understand datastore latency alarms

Re: Help me understand datastore latency alarms

Re: Help me understand datastore latency alarms

Who is online