Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
On Thu, May 03, 2007 at 01:23:19PM +0200, Håkan Engblom wrote:
> Hi,
>
> Some background: drbd-version is 0.7.22, running on a Montavista Linux
> dirstribution 2.6.10_mvl4
>
> I've seen that sometimes when doing mkfs on a drbd-partition, the system
> seem to hang in a drbd-function in kernel-space.
> The problem has been reported once before to this mailing-list, in February
> 2006, a thread called "mkfs hangs with lastest drbd branch build and FC4
> kernel" (I thin it is the same problem) and it has also been observed by
> others (seen when searching for "drbd_al_begin_io hangs" in google)
>
> However I've not seen any soultion to the problem.
>
> So far what I've been able to establish that the process seem to hang in
> the dbrd-function mentioned above, and I also know that it hangs 640 bytes
> into the function. When looking at the source code of this function, my
> guess is that it hangs on "spin_lock_irq(&mdev->al_lock);".
>
> Is this a known problem and does anyone know of a soultion ?
hanging in "spin_lock_irq" translates to a hard lockup of the machine.
so, this is most likely not the correct guess.
what exactly are the symptoms of that "hang"?
do the numbers in /proc/drbd move, still?
can you reproduce this with some different kernel,
preferably plain kernel.org?
does it hang only when "Connected" or also when "StandAlone"?
does running "while true; do sync; usleep 1; done" help?
when run on the Primary?
Secondary?
both?
is this on a software raid?
does it help doing this without software raid?
--
: Lars Ellenberg Tel +43-1-8178292-0 :
: LINBIT Information Technologies GmbH Fax +43-1-8178292-82 :
: Vivenotgasse 48, A-1120 Vienna/Europe http://www.linbit.com :
__
please use the "List-Reply" function of your email client.