Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
A new machine I am working with has started hanging, this has happened twice in the past 24 hours. I am not sure what might be going on. I am running 8.3.11 on CentOS with kernel 2.6.18-238.12.1.el5 on x86_64. /var/log/messages: Sep 29 10:06:05 sedri kernel: block drbd2: role( Primary -> Secondary ) Sep 29 10:06:05 sedri kernel: block drbd2: bitmap WRITE of 0 pages took 0 jiffies Sep 29 10:06:05 sedri kernel: block drbd2: 0 KB (0 bits) marked out-of-sync by on disk bit-map. Sep 29 10:06:11 sedri kernel: block drbd2: peer( Secondary -> Unknown ) conn( Connected -> Disconnecting ) pdsk( UpToDate -> DUnknown ) Sep 29 10:06:11 sedri kernel: block drbd2: asender terminated Sep 29 10:06:11 sedri kernel: block drbd2: Terminating asender thread Sep 29 10:06:11 sedri kernel: block drbd2: Connection closed Sep 29 10:06:11 sedri kernel: block drbd2: conn( Disconnecting -> StandAlone ) Sep 29 10:06:11 sedri kernel: block drbd2: receiver terminated Sep 29 10:06:11 sedri kernel: block drbd2: Terminating receiver thread Sep 29 10:06:11 sedri kernel: block drbd2: disk( UpToDate -> Failed ) Sep 29 10:06:11 sedri kernel: block drbd2: disk( Failed -> Diskless ) Sep 29 10:06:11 sedri kernel: block drbd2: drbd_bm_resize called with capacity == 0 Sep 29 10:06:11 sedri kernel: block drbd2: worker terminated Sep 29 10:06:11 sedri kernel: block drbd2: Terminating worker thread Sep 29 10:06:13 sedri kernel: block drbd2: Starting worker thread (from cqueue/0 [290]) Sep 29 10:06:13 sedri kernel: block drbd2: disk( Diskless -> Attaching ) Sep 29 10:06:13 sedri kernel: block drbd2: Found 4 transactions (131 active extents) in activity log. Sep 29 10:06:13 sedri kernel: block drbd2: Method to ensure write ordering: barrier Sep 29 10:06:13 sedri kernel: block drbd2: drbd_bm_resize called with capacity == 15627638960 Sep 29 10:50:50 sedri syslogd 1.4.1: restart. Both crashes are the same. Kdump logs nothing, the system is unresponsive and I have to reboot it from the console. You can see in the log that it took 44 minutes between the hang and when I could arrive at the data center to do the deed. Any suggestions are greatly appreciated. Thanks.