Friday, September 30th 2011, 2:02pm UTC+2

You are not logged in.

CRITICAL - Socket timeout after 10 seconds

karen

Beginner

Posts: 18

Number of Nagios server: 2

Nagios Versions: Nagios 3.0

Distributed monitoring: Ja

Redundant monitoring: Nein

Number of hosts: 86

Number of services: 646

OS: RHEL-5

Plugin Versions: Nagios plugin 1.4.11

1

Thursday, August 21st 2008, 11:11am

CRITICAL - Socket timeout after 10 seconds

Hi experts,

I have noticed sometime i am getting below type of response from my servers and network devices :




"CRITICAL - Socket timeout after 10 seconds "

"SNMP problem - No data received from host "

"Status is a WARNING level - SNMP agent not responding "

what is the reason behind this? i kept trying to ping these particular hosts..i didnt found any packet loss for these respective hosts...how to fix such issue?

Thanks.

donni

Beginner

Posts: 26

Birthday: Dec 26th

Gender: male

Location: Hamburg

Occupation: System Administrator

Number of Nagios server: 5

Nagios Versions: 3.0.3, 2.3

Distributed monitoring: Ja

Redundant monitoring: Ja

Number of hosts: 170

Number of services: 1500

OS: Debian

Plugin Versions: 1.4.12

NagVis Version: 1.3

NDO Version: 1.4b7

Perfparse Version: 0.105.6

Other Addons: PNP, DokuWiki, NagTrap, NSCA

2

Thursday, August 21st 2008, 11:31am

Hi Karen,

check_nrpe returns "CRITICAL - Socket timeout after 10 seconds " if the test/check returned no result within 10 seconds (10s = default timeout of the nrpe plugin). According to my experience this can happen due to the following reasons:

1. The machine you want to check is overloaded for whatever reason.
Solution: check for the "whatever" reason and fix it :)

2. The execution of the plugin takes a while by design (e.g. the plugin counts the rows of a very big InnoDB table)
Solution: increase the timeout for the nrpe plugin (check_nrpe --help)
e.g. ./check_nrpe -H hosttocheck -c slow_check -t 30

I haven't seen the other 2 error messages yet, so I cannot help on these issues, sry.

Best, Alex

karen

Beginner

Posts: 18

Number of Nagios server: 2

Nagios Versions: Nagios 3.0

Distributed monitoring: Ja

Redundant monitoring: Nein

Number of hosts: 86

Number of services: 646

OS: RHEL-5

Plugin Versions: Nagios plugin 1.4.11

3

Thursday, August 21st 2008, 1:46pm

Alex, Thanks for quick reply for check_nrpe issue.... :)

i am gets snmp related messages for my switches and routers...very rarely and for fraction of second i am getting it.