[Drbd-dev] DRBD8: drbd nodes deadlock in WFBitMapT
Montrose, Ernest
Ernest.Montrose at stratus.com
Sat Mar 31 00:01:01 CEST 2007
Hi all,
This is another hard to reproduce one but the proofs are in the logs
that the problem is alive and well.
I am hoping for at least some clues that may help reproduce
this...Essentially after one node in
Primary state is powered down(Not a graceful shutdown) both nodes ends
up In WFBitMapT for a drbd volume.
Here are some logs:
On one node========================
Drbd2: Writing metadata to superblock now.
......
....... This nodes is powered of and came back with:
Mar 23 13:16:53 jerry kernel: drbd2: rct = 0 in
/test_logs/builds/SuperNova/trunk/070323/platform/drbd/src/drbd/drbd_rec
eiver.c:1878
Mar 23 13:16:55 [ OK ]
Mar 23 13:16:54 jerry kernel: drbd2: drbd_sync_handshake:
Mar 23 13:16:54 jerry kernel: drbd2: self
F71E503A8179BC5D:0000000000000000:3F430D4E1D59C3EA:6B0B1DA5CB20689C
Mar 23 13:16:54 jerry kernel: drbd2: peer
F71E503A8179BC5C:0000000000000000:3F430D4E1D59C3EA:6B0B1DA5CB20689C
Mar 23 13:16:55 jerry kernel: drbd2: uuid_compare()=0 by rule 4
Mar 23 13:16:55 jerry kernel: drbd2: No resync, but bits in bitmap!
......
Mar 23 13:17:00 jerry kernel: drbd2: drbd_sync_handshake:
Mar 23 13:17:00 jerry kernel: drbd2: self
F71E503A8179BC5D:0000000000000000:3F430D4E1D59C3EA:6B0B1DA5CB20689C
Mar 23 13:17:00 jerry kernel: drbd2: peer
0000000000000000:0000000000000000:F71E503A8179BC5C:3F430D4E1D59C3EA
Mar 23 13:17:00 jerry kernel: drbd2: uuid_compare()=-2 by rule 6
Mar 23 13:17:00 jerry kernel: drbd2: Writing meta data super block now.
Mar 23 13:17:01 jerry kernel: drbd2: writing of bitmap took 11 jiffies
Mar 23 13:17:01 jerry kernel: drbd2: 12 GB marked out-of-sync by on disk
bit-map.
Mar 23 13:17:02 jerry kernel: drbd2: 13336132 KB now marked out-of-sync
by on disk bit-map.
Mar 23 13:17:02 jerry kernel: drbd2: Writing meta data super block now.
Mar 23 13:17:02 jerry kernel: drbd2: uuid[History_start] now
F71E503A8179BC5D
Mar 23 13:17:02 jerry kernel: drbd2: uuid[Current] now 0000000000000000
Mar 23 13:17:03 jerry kernel: drbd2: conn( Connected -> WFBitMapT )
Mar 23 13:17:03 jerry kernel: drbd2: Writing meta data super block now.
On theh other node=============================
Mar 23 13:16:48 ben kernel: drbd2: aftr_isp( 0 -> 1 )
Mar 23 13:16:48 ben kernel: drbd2: Handshake successful: DRBD Network
Protocol version 86
Mar 23 13:16:48 ben kernel: drbd2: peer( Unknown -> Secondary ) conn(
WFReportParams -> Connected ) pdsk( DUnknown -> UpToDate ) peer_isp( 0
-> 1 )
Mar 23 13:16:48 ben kernel: drbd2: Writing meta data super block now.
....
Mar 23 13:16:49 ben kernel: drbd2: rct = 2 in
/test_logs/builds/SuperNova/trunk/070323/platform/drbd/src/drbd/drbd_rec
eiver.c:1878
Mar 23 13:16:49 ben kernel: drbd2: drbd_sync_handshake:
Mar 23 13:16:49 ben kernel: drbd2: self
F71E503A8179BC5C:0000000000000000:3F430D4E1D59C3EA:6B0B1DA5CB20689C
Mar 23 13:16:49 ben kernel: drbd2: peer
F71E503A8179BC5D:0000000000000000:3F430D4E1D59C3EA:6B0B1DA5CB20689C
Mar 23 13:16:49 ben kernel: drbd2: uuid_compare()=-1 by rule 4
Mar 23 13:16:49 ben kernel: drbd2: uuid[History_start] now
F71E503A8179BC5C
Mar 23 13:16:49 ben kernel: drbd2: uuid[Current] now 0000000000000000
Mar 23 13:16:49 ben kernel: drbd2: conn( Connected -> WFBitMapT )
Mar 23 13:16:49 ben kernel: drbd2: Writing meta data super block now.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.linbit.com/pipermail/drbd-dev/attachments/20070330/588748b1/attachment.htm
More information about the drbd-dev
mailing list