[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: (ITS#5835) master slapd dying on lost writes




--On November 28, 2008 11:35:45 AM -0800 Quanah Gibson-Mount 
<quanah@zimbra.com> wrote:

>
>
> --On November 28, 2008 7:24:27 PM +0000 quanah@zimbra.com wrote:
>
>
>> As you can see, we lose a connection and then try to read from it (FD
>> 40).  This is where the log ends because the assert triggered.
>
> Printing the connection struct shows it is actually an issue with FD 26:

The previous connection on FD 26 shows:

Nov 29 00:27:02 new slapd[8930]: conn=335 fd=26 ACCEPT from 
IP=192.168.61.154:40988 (IP=192.168.58.179:389)
Nov 29 00:27:02 new slapd[8930]: conn=335 op=0 STARTTLS
Nov 29 00:27:02 new slapd[8930]: conn=335 op=0 RESULT oid= err=0 text=
Nov 29 00:27:02 new slapd[8930]: conn=335 fd=26 TLS established tls_ssf=256 
ssf=256
Nov 29 00:27:02 new slapd[8930]: conn=335 op=1 BIND 
dn="uid=zmreplica,cn=admins,cn=zimbra" method=128
Nov 29 00:27:02 new slapd[8930]: conn=335 op=1 BIND 
dn="uid=zmreplica,cn=admins,cn=zimbra" mech=SIMPLE ssf=0
Nov 29 00:27:02 new slapd[8930]: conn=335 op=1 RESULT tag=97 err=0 text=
Nov 29 00:27:02 new slapd[8930]: conn=335 op=2 SRCH base="cn=accesslog" 
scope=2 deref=0 filter="(&(objectClass=auditWriteObject)(reqResult=0))"
Nov 29 00:27:02 new slapd[8930]: conn=335 op=2 SRCH attr=reqDN reqType 
reqMod reqNewRDN reqDeleteOldRDN reqNewSuperior entryCSN

[then nothing for a long time until]:


Nov 29 00:29:41 new slapd[8930]: conn=335 fd=26 closed (connection lost)
Nov 29 00:29:41 new slapd[8930]: conn=496 op=0 BIND 
dn="uid=zimbra,cn=admins,cn=zimbra" method=128
Nov 29 00:29:41 new slapd[8930]: conn=496 op=0 BIND 
dn="uid=zimbra,cn=admins,cn=zimbra" mech=SIMPLE ssf=0
Nov 29 00:29:41 new slapd[8930]: conn=496 op=0 RESULT tag=97 err=0 text=
Nov 29 00:29:41 new slapd[8930]: conn=498 fd=26 ACCEPT from 
IP=192.168.58.231:45575 (IP=192.168.58.179:389)
Nov 29 00:29:41 new slapd[8930]: connection_read(40): no connection!
Nov 29 00:29:41 new slapd[8930]: conn=498 op=0 BIND 
dn="uid=zimbra,cn=admins,cn=zimbra" method=128
Nov 29 00:29:41 new slapd[8930]: conn=498 op=0 BIND 
dn="uid=zimbra,cn=admins,cn=zimbra" mech=SIMPLE ssf=0
Nov 29 00:29:41 new slapd[8930]: conn=498 op=0 RESULT tag=97 err=0 text=

[slapd crashes]

The consumer here in conn 335 is a refreshAndPersist consumer.  Not 
entirely clear why it's connection gets closed.

--Quanah

--

Quanah Gibson-Mount
Principal Software Engineer
Zimbra, Inc
--------------------
Zimbra ::  the leader in open source messaging and collaboration