[DRBD-user] Secondary SCSI Errors causing Primary Unresponsiveness
tony.willoughby at bigbandnet.com
Wed Sep 15 17:43:34 CEST 2004
We've had an incident that I am trying to understand.
Two IBM E-Server x330's running Heartbeat/DRBD (0.6.4).
(I know that 0.6.4 is old, but we have a rather staggered release
cycle and our customers tend to upgrade infrequently.)
At some point the secondary machine started reporting SCSI errors (the
disk eventually failed). It is not known how long the system was
having these errors.
The primary machine started to become unresponsive.
Here is the odd thing: Any command that accessed the filesystem above
DRBD (e.g. "ls /the/mirrored/partition") would hang. Once the
secondary was shutdown the commands that were hung suddenly
I'm not necessarily looking for a fix (although if I were told this
was fixed in a latter release you'd make my day :^), I'm trying to
understand why this would happen.
Anyone have any ideas?
mailto:tony.willoughby at bigbandnet.com
More information about the drbd-user