Intermittent Hardware Status Change Unknown

Real-time performance monitoring and troubleshooting

Intermittent Hardware Status Change Unknown

Veeam Logoby barebones » Thu Jul 07, 2011 8:02 pm

Hi Guys
Started testing Veeam monitor and only using it to monitor hardware changes. It lists all the HP Proliant hardware in the hardware tab and when I do something like pull the power on a PSU the status changes and alerts us by email which is great. BUT every few days we get the following email stating that all hardware has entered an UNKNOWN state as if the monitor has lost connection with the ESX host. Any ideas why this would be?

Thanks in Advance

Code: Select all
Target: 192.168.20.11
Status: Warning (Yellow)
Alarm: Host hardware sensor status changed
Time: 6/30/2011 6:12:46 PM
Sensor "Memory" equal Unknown.
Sensor "Proc 1" equal Unknown.
Sensor "Proc 1 Level-1 Cache is 131072 B" equal Unknown.
Sensor "Proc 1 Level-2 Cache is 1048576 B" equal Unknown.
Sensor "Proc 1 Level-3 Cache is 8388608 B" equal Unknown.
Sensor "Proc 2" equal Unknown.
Sensor "Proc 2 Level-1 Cache is 131072 B" equal Unknown.
Sensor "Proc 2 Level-2 Cache is 1048576 B" equal Unknown.
Sensor "Proc 2 Level-3 Cache is 8388608 B" equal Unknown.
Sensor "HP Smart Array Controller HPSA1 Firmware 2.00 " equal Unknown.
Sensor "HP System BIOS P62 2009-10-01 00:00:00.000" equal Unknown.
Sensor "Hewlett-Packard BMC Firmware (node 0) 46:10000 1.78 " equal Unknown.
Sensor "VMware, Inc. VMware ESXi 4.0.0 build-360236 2011-02-07 00:00:00.000" equal Unknown.
Sensor "VMware, Inc. VMware ESXi Alternate Boot Bank 4.0.0 build-208167 " equal Unknown.
Sensor "bnx2 device firmware 4.6.4 " equal Unknown.
Sensor "bnx2 device firmware 4.6.4 NCSI 1.0.3 " equal Unknown.
Sensor "bnx2 driver 1.6.9 " equal Unknown.
Sensor "Battery on HPSA1" equal Unknown.
Sensor "Disk 1 on HPSA1 : Port 1I Box 1 Bay 1 : 136GB : Data Disk" equal Unknown.
Sensor "Disk 2 on HPSA1 : Port 1I Box 1 Bay 2 : 136GB : Data Disk" equal Unknown.
Sensor "Disk 3 on HPSA1 : Port 1I Box 1 Bay 3 : 136GB : Data Disk" equal Unknown.
Sensor "Disk 4 on HPSA1 : Port 1I Box 1 Bay 4 : 136GB : Data Disk" equal Unknown.
Sensor "Disk 5 on HPSA1 : Port 2I Box 1 Bay 5 : 136GB : Data Disk" equal Unknown.
Sensor "Disk 6 on HPSA1 : Port 2I Box 1 Bay 6 : 279GB : Unconfigured Disk" equal Unknown.
Sensor "Disk 7 on HPSA1 : Port 2I Box 1 Bay 7 : 279GB : Unconfigured Disk" equal Unknown.
Sensor "Disk 8 on HPSA1 : Port 2I Box 1 Bay 8 : 279GB : Unconfigured Disk" equal Unknown.
Sensor "HP Smart Array P410i Controller : HPSA1" equal Unknown.
Sensor "Logical Volume 1 on HPSA1 : RAID 5 : 546GB : Disk 1,2,3,4,5" equal Unknown.
Sensor "System Board 1 Fan 1 - Transition to Running" equal Unknown.
Sensor "System Board 2 Fan 2 - Transition to Running" equal Unknown.
Sensor "System Board 3 Fan 3 - Transition to Running" equal Unknown.
Sensor "System Board 4 Fan 4 - Transition to Running" equal Unknown.
Sensor "System Board 5 Fan 5 - Transition to Running" equal Unknown.
Sensor "System Board 6 Fan 6 - Transition to Running" equal Unknown.
Sensor "Power Supply 1 Power Supply 1: Failure status - Deassert" equal Unknown.
Sensor "Power Supply 1: Running/Full Power-Enabled" equal Unknown.
Sensor "System Board 10 Power Meter - Device enabled" equal Unknown.
Sensor "VMware Rollup Health State" equal Unknown.
Sensor "External Environment 1 Temp 1 - Normal" equal Unknown.
Sensor "Memory Module 1 Temp 4 - Normal" equal Unknown.
Sensor "Memory Module 2 Temp 5 - Normal" equal Unknown.
Sensor "Memory Module 3 Temp 6 - Normal" equal Unknown.
Sensor "Memory Module 4 Temp 7 - Normal" equal Unknown.
Sensor "Memory Module 5 Temp 24 - Normal" equal Unknown.
Sensor "Memory Module 6 Temp 25 - Normal" equal Unknown.
Sensor "Memory Module 7 Temp 26 - Normal" equal Unknown.
Sensor "Power Domain 1 Temp 8 - Normal" equal Unknown.
Sensor "Power Domain 2 Temp 9 - Normal" equal Unknown.
Sensor "Processor 1 Temp 2 - Normal" equal Unknown.
Sensor "Processor 2 Temp 3 - Normal" equal Unknown.
Sensor "Processor 3 Temp 19 - Normal" equal Unknown.
Sensor "Processor 4 Temp 20 - Normal" equal Unknown.
Sensor "Processor 5 Temp 21 - Normal" equal Unknown.
Sensor "Processor 6 Temp 22 - Normal" equal Unknown.
Sensor "System Board 8 Temp 29 - Normal" equal Unknown.
Sensor "System Internal Expansion Board 1 Temp 10 - Normal" equal Unknown.
Sensor "System Internal Expansion Board 10 Temp 23 - Normal" equal Unknown.
Sensor "System Internal Expansion Board 11 Temp 27 - Normal" equal Unknown.
Sensor "System Internal Expansion Board 13 Temp 30 - Normal" equal Unknown.
Sensor "System Internal Expansion Board 2 Temp 11 - Normal" equal Unknown.
Sensor "System Internal Expansion Board 3 Temp 12 - Normal" equal Unknown.
Sensor "System Internal Expansion Board 4 Temp 13 - Normal" equal Unknown.
Sensor "System Internal Expansion Board 5 Temp 14 - Normal" equal Unknown.
Sensor "System Internal Expansion Board 6 Temp 15 - Normal" equal Unknown.
Sensor "Power Supply 2 Power Supply 2: Failure status - Deassert" equal Unknown.
Sensor "Power Supply 2: Running/Full Power-Enabled" equal Unknown.
Sensor "System Board 7 Fans - unknown" equal Unknown.
Sensor "Power Supply 3 Power Supplies - unknown" equal Unknown
barebones
Novice
 
Posts: 3
Liked: never
Joined: Thu Jul 07, 2011 7:55 pm

Re: Intermittent Hardware Status Change Unknown

Veeam Logoby barebones » Thu Jul 07, 2011 8:48 pm

In addition. After a few minutes it rectifies itself.
barebones
Novice
 
Posts: 3
Liked: never
Joined: Thu Jul 07, 2011 7:55 pm

Re: Intermittent Hardware Status Change Unknown

Veeam Logoby Vitaliy S. » Thu Jul 07, 2011 8:49 pm

The only reason why it can happen is status change for the particular sensor in VMware Managed Object Browser. You can use the instructions from this thread to check up sensor's status with VMware. Sensor status should be the same, in all other cases please do contact our support team for further troubleshooting.
Vitaliy S.
Veeam Software
 
Posts: 19558
Liked: 1102 times
Joined: Mon Mar 30, 2009 9:13 am
Full Name: Vitaliy Safarov

Re: Intermittent Hardware Status Change Unknown

Veeam Logoby barebones » Fri Jul 08, 2011 5:53 am

Ok I have navigated to that section and it is as follows

[35] HostNumericSensorInfo Name Type Value
baseUnits string ""
currentReading long 0
dynamicProperty DynamicProperty[] Unset
dynamicType string Unset
healthState ElementDescription Name Type Value
dynamicProperty DynamicProperty[] Unset
dynamicType string Unset
key string "green"
label string "Green"
summary string "Sensor is operating under normal conditions"

name string "VMware Rollup Health State"
rateUnits string Unset
sensorType string "system"
unitModifier int 0


What am I supposed to be looking for or changing?
barebones
Novice
 
Posts: 3
Liked: never
Joined: Thu Jul 07, 2011 7:55 pm

Re: Intermittent Hardware Status Change Unknown

Veeam Logoby Vitaliy S. » Fri Jul 08, 2011 10:01 am

You should compare the "key string "green" value with one you have in Veeam Monitor for the corresponding hardware sensor. We use this status to raise alarms on sensors state, if you have a different picture in Monitor and MOB, reach our support team for help.
Vitaliy S.
Veeam Software
 
Posts: 19558
Liked: 1102 times
Joined: Mon Mar 30, 2009 9:13 am
Full Name: Vitaliy Safarov


Return to Monitoring



Who is online

Users browsing this forum: No registered users and 3 guests