[DRBD-user] Invalidate

Lars Ellenberg Lars.Ellenberg at linbit.com
Wed Aug 10 11:28:19 CEST 2005

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


/ 2005-08-04 17:25:51 +0200
\ Stephan Rattai:
> > (unless something else before that went terribly wrong, and some of our
> > threads died without that being noticed. or something corrupts memory.)
> >
> > but I may be wrong, of course...
> 
> Ok, opening my syslog again and... this wasn't the first oops... I will attach 
> the two proceding oopses...

> Aug  4 11:37:05 test2 kernel: EFLAGS: 00010282   (2.6.12.2)
> Aug  4 11:37:05 test2 kernel: EIP is at got_NegDReply+0x1a/0xe4 [drbd]

what did you do to get a NegDReply ??
any logs on the other box?
some context before that,
including the last successfull drbd state change?

feel free to send me those logs in private mail,
we don't need to spam the user list with call traces.

> Aug  4 11:37:05 test2 kernel: eax: f3fa32b8   ebx: 00000018   ecx: 00000002   edx: 00000011
> Aug  4 11:37:05 test2 kernel: esi: ffffffff   edi: f3fa3000   ebp: f3fa32b8   esp: f5bb7f70
> Aug  4 11:37:05 test2 kernel: ds: 007b   es: 007b   ss: 0068
> Aug  4 11:37:05 test2 kernel: Process drbd0_asender (pid: 8910, threadinfo=f5bb6000 task=f0f78020)

ok, the asender dies.
we don't notice.

> Aug  4 11:37:13 test2 kernel: Process drbd0_receiver (pid: 8900, threadinfo=f5b50000 task=f6894a20)
> Aug  4 11:37:13 test2 kernel: Stack: c0124548 00000001 00000001 f0d7f020 00000002 f3fa3448 f3fa3000 c0124e5e
> Aug  4 11:37:13 test2 kernel:        00000001 00000001 f0f78020 f8c71b57 00000001 f0f78020 c0485d40 00000008
> Aug  4 11:37:13 test2 kernel:        00000004 f3fa3400 f3fa3000 f3fa33d0 f3fa33d0 f8c6b355 f3fa3448 00000000
> Aug  4 11:37:13 test2 kernel: Call Trace:
> Aug  4 11:37:13 test2 kernel:  [force_sig_info+39/165] force_sig_info+0x27/0xa5
receiver tries to signal asender.
asender is no longer there.
boom.

so I have to try and figure why the asender died in the first place.
it should not.  but just saying "it might be related to that bio_clone
bug" seems a bit too simple. we need more input on this.

thanks,

-- 
: Lars Ellenberg                                  Tel +43-1-8178292-0  :
: LINBIT Information Technologies GmbH            Fax +43-1-8178292-82 :
: Schoenbrunner Str. 244, A-1120 Vienna/Europe   http://www.linbit.com :
__
please use the "List-Reply" function of your email client.



More information about the drbd-user mailing list