[DRBD-user] Freezing DRBD and freezing disks...

Stefan Löfgren stlo at dontdownload.com
Mon May 19 15:56:54 CEST 2008

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Hi all!

Background:
I've only been using DRBD for a couple of weeks in test
and now for 1 week in "production"...

My testsystem was kernel 2.4.3.33 and DRBD 0.7.25 in
virtual machines and seems to be working just fine...

Moving on to production-machines (DELL PowerEdge R200) gave
me a lot of problems and had to go for Slackware 12.1,
kernel 2.6.24.5 and DRBD 8.2.5.
Here starts the problem:

DRBD gets stuck in Bitmap-transfer after 15 minutes and freezes
the complete resource, and a disconnect is needed. I fixed this
with using 8.2.6rc1 and now the resource is now working.

Now, the next problem. Freezing ATA disks instead.
This might have been a problem before the update but
was pretty hard to find behind the freezing resource...

May 19 13:55:36 mother2 kernel: ata1.01: exception Emask 0x0 SAct 0x0 SErr 0x0
action 0x2 frozen
May 19 13:55:36 mother2 kernel: ata1.01: cmd
a0/00:00:00:08:00/00:00:00:00:00/b0 tag 0 pio 8 in
May 19 13:55:36 mother2 kernel:          cdb 25 00 00 00 00 00 00 00  00 00 00
00 00 00 00 00
May 19 13:55:36 mother2 kernel:          res
40/00:03:00:08:00/00:00:00:00:00/b0 Emask 0x4 (timeout)
May 19 13:55:36 mother2 kernel: ata1.01: status: { DRDY }


This first looked like a problem with the hardware 'cos there was 
no messages like this on the secondary node. So, switching primary...
Now, there is no messages like this on that machine but suddenly
there's a lot of messages on the new primary...

This rules out a hardware problem, right? (no errors on secondary, even if a
copy a lot of files on the other filesystems on the same physical disk).


Anyone seen this before? Known issues?
Should I go back to DRBD 8.0.x? (disk-compatible?)
I've tried to different SATA controllers with the same result.
No problems on the secondary node.
I'm running the resource on raid1 (/dev/md0) on both machines...
Any other ideas?

Thanks in advance...

/Stefan





More information about the drbd-user mailing list