Monitoring Dell Openmanage
Hi,
I'd like to use Serverscheck to monitor a few Dell servers disk arrays, and notify me if the array has an error.
Currently I use the Dell supplied 'Array Manager' but noticed you already support certain Dell SNMP monitors (Tempreature, Fans and Power...)
Is it possible to monitor the status of a Dell disk array?
I've tried to configure my own custom SNMP monitors using MIBs from the Dell site with no luck.
I'm not looking to monitor everything, just if a disk or something fails that would be excellent!
If you have any ideas, or settings you wish me to test on a non-critical server, then let me know.
Thank you
I'd like to use Serverscheck to monitor a few Dell servers disk arrays, and notify me if the array has an error.
Currently I use the Dell supplied 'Array Manager' but noticed you already support certain Dell SNMP monitors (Tempreature, Fans and Power...)
Is it possible to monitor the status of a Dell disk array?
I've tried to configure my own custom SNMP monitors using MIBs from the Dell site with no luck.
I'm not looking to monitor everything, just if a disk or something fails that would be excellent!
If you have any ideas, or settings you wish me to test on a non-critical server, then let me know.
Thank you
This discussion has been closed.
Comments
If you have an OID for a disk that returns a value (for example 0 when OK and 1 when not OK) then it could work.
UKMON if you wanna have a chat about it, and help each other out give me a shout.
Cheers.
Please let me know, which servertype you are using
When I try to put what I think is the correct OID in ServersCheck I get this message:
Status: DOWN?
Error returned: Couldn't retrieve value for OID
systemStageGlobalSystemStatus
.1.3.6.1.4.1.674.10892.1.200.10.1.2.1
you get an integer value back:
(1 = other, 2 = unknown, 3 = ok, 4 = nonCritical, 5 = critical, 6 = nonRecoverable)
You should also check if snmp service is running and the community name for reading the status.
So do I just add that to the number you listed?
you can try to put the number exactly like i wrote in a post before
On the next screen I put in the servers name, (I've also tried the IP address) the community string which is public, the port which is 161 and the OID number that you gave me. When I try and test those settings I get this message:
Status: DOWN?
Error returned: Couldn't retrieve value for OID
Reason: Error: Couldn't retrieve value for OID
I get a server down notification message when this happens. Have you experienced this?
I have got it working as well on my PE2900, but I am not sure what is checks. Does it checks failures on RAID, Processor(s), Memory etc ??
Is there any documentation on the systemStageGlobalSystemStatus OID ??
Thanks a lot for the good hint.