This may also be a problem with too many open file descriptors. I have run into similar problems with RedHat 7.2, but was easily fixes by either rebooting(the neanderthal way) or by increasing the amount of open file descriptors. This problem is pretty common on servers with really good uptimes. Consult the docs on irix pretaining to open file descriptors.
Terry Inzauro
-----Original Message-----
From: D.M.Lewney@sussex.ac.uk [mailto:D.M.Lewney@sussex.ac.uk]
Sent: Wednesday, June 18, 2003 2:41 AM
To: Olle.Westman@astrazeneca.com
Cc: OpenLDAP-software@OpenLDAP.org
Subject: Re: too many open files?
Olle.Westman@astrazeneca.com wrote:
> Hi,
>
> I run openldap-2.1.12 with ldbm backend on IRIX64 6.5.18f.
> Things have been running flawlessly for a few months now, but recently error
> messages about
> connection_read: no connection and "too many open files" on the ldap server
> host appeared in the log
> and causing clients to fail reading openldap.
> A restart of openldap fixes the problem for a few days, but soon returns.
>
> What can be the cause of this? Do I need more file descriptors or is there a
> file descriptor leak in openldap ?
> We have a little more than 100 clients here. The syslog follows below:
>
> Jun 17 08:52:45 7U:ldap01 last message repeated 5 times
> Jun 17 08:52:46 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 08:53:37 7U:ldap01 last message repeated 5 times
> Jun 17 08:53:38 3D:ldap01 timeslave[236]: recvfrom(date read)=-1: No route
> to host
> Jun 17 08:53:39 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 08:54:45 7U:ldap01 last message repeated 7 times
> Jun 17 08:54:47 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 08:56:45 7U:ldap01 last message repeated 8 times
> Jun 17 08:56:47 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 08:58:45 7U:ldap01 last message repeated 13 times
> Jun 17 08:58:47 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:00:45 7U:ldap01 last message repeated 17 times
> Jun 17 09:00:46 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:01:47 7U:ldap01 last message repeated 8 times
> Jun 17 09:01:54 3D:ldap01 timeslave[236]: recvfrom(date read)=-1: No route
> to host
> Jun 17 09:02:45 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:04:45 7U:ldap01 last message repeated 9 times
> Jun 17 09:04:47 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:05:13 7U:ldap01 last message repeated 3 times
> Jun 17 09:05:45 3D:ldap01 timeslave[236]: recvfrom(date read)=-1: No route
> to host
> Jun 17 09:05:45 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:06:45 7U:ldap01 last message repeated 4 times
> Jun 17 09:06:46 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:08:45 7U:ldap01 last message repeated 3 times
> Jun 17 09:08:46 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:09:47 7U:ldap01 last message repeated 8 times
> Jun 17 09:09:47 3D:ldap01 timeslave[236]: recvfrom(date read)=-1: No route
> to host
> Jun 17 09:10:13 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:10:37 7U:ldap01 last message repeated 6 times
> Jun 17 09:10:46 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:11:47 7U:ldap01 last message repeated 3 times
> Jun 17 09:12:24 5E:ldap01 login[1135]: root on /dev/ttyd1
> Jun 17 09:12:25 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:12:45 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:14:45 7U:ldap01 last message repeated 8 times
> Jun 17 09:14:47 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:16:45 7U:ldap01 last message repeated 8 times
> Jun 17 09:16:47 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:18:45 7U:ldap01 last message repeated 4 times
> Jun 17 09:18:47 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:20:45 7U:ldap01 last message repeated 14 times
> Jun 17 09:20:46 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:21:46 7U:ldap01 last message repeated 3 times
> Jun 17 09:22:19 3D:ldap01 timeslave[236]: recvfrom(date read)=-1: No route
> to host
> Jun 17 09:22:46 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:24:45 7U:ldap01 last message repeated 8 times
> Jun 17 09:24:47 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:24:57 6E:ldap01 sshd[2922244]: Accepted rhosts for adboha from
> 157.96.192.233 port 886 ruser adboha
> Jun 17 09:25:13 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:25:46 7U:ldap01 last message repeated 7 times
> Jun 17 09:26:07 3D:ldap01 timeslave[236]: recvfrom(date read)=-1: No route
> to host
> Jun 17 09:26:45 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:28:45 7U:ldap01 last message repeated 6 times
> Jun 17 09:28:46 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:29:47 7U:ldap01 last message repeated 8 times
> Jun 17 09:30:02 3D:ldap01 timeslave[236]: recvfrom(date read)=-1: No route
> to host
> Jun 17 09:30:13 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:30:36 7U:ldap01 last message repeated 7 times
> Jun 17 09:30:46 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:32:45 7U:ldap01 last message repeated 4 times
> Jun 17 09:32:46 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:34:02 7U:ldap01 last message repeated 6 times
> Jun 17 09:34:38 6E:ldap01 sshd[2920427]: Accepted rhosts for root from
> 157.96.192.233 port 888 ruser root
> Jun 17 09:34:45 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:34:45 7U:ldap01 last message repeated 2 times
> Jun 17 09:34:46 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:34:49 7U:ldap01 slapd[2831920]: daemon: accept(6) failed errno=24
> (Too many open files)
> Jun 17 09:35:28 7U:ldap01 slapd[2922286]: slapd starting
> Jun 17 09:35:28 5E:ldap01 su[2921936]: succeeded: ttyq4 changing from root
> to ldap
> Jun 17 10:00:00 7U:ldap01 slapd[2922286]: send_search_entry: ber write
> failed
> Jun 17 10:19:58 3D:ldap01 timeslave[236]: Time measurements from
> timelord.seml.astrazeneca.net are working again
> Jun 17 11:54:24 6E:ldap01 sshd[2924164]: Accepted rhosts for root from
> 157.96.192.248 port 679 ruser root
> Jun 17 11:55:59 5E:ldap01 su[2923753]: succeeded: ttyq0 changing from root
> to ldap
> Jun 17 13:30:54 6E:ldap01 sshd[2925120]: Accepted rhosts for root from
> 157.96.192.248 port 801 ruser root
> Jun 17 14:00:05 7U:ldap01 slapd[2922286]: send_search_entry: ber write
> failed
> Jun 17 18:03:52 7U:ldap01 slapd[2922286]: connection_read(91): no
> connection!
> Jun 17 19:05:23 7U:ldap01 slapd[2922286]: connection_read(92): no
> connection!
> Jun 17 20:03:53 7U:ldap01 slapd[2922286]: connection_read(95): no
> connection!
> Jun 17 21:05:25 7U:ldap01 slapd[2922286]: connection_read(98): no
> connection!
> Jun 17 22:03:52 7U:ldap01 slapd[2922286]: connection_read(101): no
> connection!
> Jun 17 22:05:25 7U:ldap01 slapd[2922286]: connection_read(94): no
> connection!
> Jun 17 23:03:53 7U:ldap01 slapd[2922286]: connection_read(103): no
> connection!
> Jun 18 00:03:53 7U:ldap01 slapd[2922286]: connection_read(106): no
> connection!
> Jun 18 01:03:54 7U:ldap01 slapd[2922286]: connection_read(110): no
> connection!
> Jun 18 03:00:03 7U:ldap01 slapd[2922286]: send_search_entry: ber write
> failed
> Jun 18 03:00:06 7U:ldap01 slapd[2922286]: send_search_entry: ber write
> failed
> Jun 18 03:03:51 7U:ldap01 slapd[2922286]: connection_read(116): no
> connection!
> Jun 18 03:59:59 7U:ldap01 slapd[2922286]: send_search_entry: ber write
> failed
> Jun 18 05:00:00 7U:ldap01 slapd[2922286]: send_search_entry: ber write
> failed
> Jun 18 05:05:26 7U:ldap01 slapd[2922286]: connection_read(115): no
> connection!
> Jun 18 06:00:07 7U:ldap01 slapd[2922286]: send_search_entry: ber write
> failed
> Jun 18 06:05:26 7U:ldap01 slapd[2922286]: connection_read(125): no
> connection!
> Jun 18 07:00:06 7U:ldap01 slapd[2922286]: deferring operation
> Jun 18 07:00:06 7U:ldap01 slapd[2922286]: deferring operation
> Jun 18 07:03:52 7U:ldap01 slapd[2922286]: connection_read(128): no
> connection!
> Jun 18 08:03:54 7U:ldap01 slapd[2922286]: connection_read(133): no
> connection!
> Jun 18 08:54:59 7U:ldap01 slapd[2922286]: slapd shutdown: waiting for 0
> threads to terminate
> Jun 18 08:54:59 7U:ldap01 slapd[2922286]: slapd stopped.
> Jun 18 08:55:01 7U:ldap01 slapd[2939777]: slapd starting
> Jun 18 08:59:59 7U:ldap01 slapd[2939777]: send_search_entry: ber write
> failed
>
> -Olle Westman
Hi,
Try putting the following line in slapd.conf
idletimeout 20
Since doing this here the problem has not re-occured.
--
Dave
--
Dave Lewney
Principal Systems Programmer, Computing Service
University of Sussex, Brighton BN1 9QJ. Tel: 01273 678354 Fax: 01273 271956