[DRBD-user] System Hanging after drbd_bm_resize

Ben Timby btimby at gmail.com
Thu Sep 29 17:15:47 CEST 2011

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


A new machine I am working with has started hanging, this has happened
twice in the past 24 hours. I am not sure what might be going on.

I am running 8.3.11 on CentOS with kernel 2.6.18-238.12.1.el5 on x86_64.

/var/log/messages:

Sep 29 10:06:05 sedri kernel: block drbd2: role( Primary -> Secondary )
Sep 29 10:06:05 sedri kernel: block drbd2: bitmap WRITE of 0 pages
took 0 jiffies
Sep 29 10:06:05 sedri kernel: block drbd2: 0 KB (0 bits) marked
out-of-sync by on disk bit-map.
Sep 29 10:06:11 sedri kernel: block drbd2: peer( Secondary -> Unknown
) conn( Connected -> Disconnecting ) pdsk( UpToDate -> DUnknown )
Sep 29 10:06:11 sedri kernel: block drbd2: asender terminated
Sep 29 10:06:11 sedri kernel: block drbd2: Terminating asender thread
Sep 29 10:06:11 sedri kernel: block drbd2: Connection closed
Sep 29 10:06:11 sedri kernel: block drbd2: conn( Disconnecting -> StandAlone )
Sep 29 10:06:11 sedri kernel: block drbd2: receiver terminated
Sep 29 10:06:11 sedri kernel: block drbd2: Terminating receiver thread
Sep 29 10:06:11 sedri kernel: block drbd2: disk( UpToDate -> Failed )
Sep 29 10:06:11 sedri kernel: block drbd2: disk( Failed -> Diskless )
Sep 29 10:06:11 sedri kernel: block drbd2: drbd_bm_resize called with
capacity == 0
Sep 29 10:06:11 sedri kernel: block drbd2: worker terminated
Sep 29 10:06:11 sedri kernel: block drbd2: Terminating worker thread
Sep 29 10:06:13 sedri kernel: block drbd2: Starting worker thread
(from cqueue/0 [290])
Sep 29 10:06:13 sedri kernel: block drbd2: disk( Diskless -> Attaching )
Sep 29 10:06:13 sedri kernel: block drbd2: Found 4 transactions (131
active extents) in activity log.
Sep 29 10:06:13 sedri kernel: block drbd2: Method to ensure write
ordering: barrier
Sep 29 10:06:13 sedri kernel: block drbd2: drbd_bm_resize called with
capacity == 15627638960
Sep 29 10:50:50 sedri syslogd 1.4.1: restart.

Both crashes are the same. Kdump logs nothing, the system is
unresponsive and I have to reboot it from the console. You can see in
the log that it took 44 minutes between the hang and when I could
arrive at the data center to do the deed. Any suggestions are greatly
appreciated.

Thanks.



More information about the drbd-user mailing list