[DRBD-user] [Fwd: Re: [Linux-HA] heartbeat 2.0.8: lockups]
kerneloops
Ross S. W. Walker
rwalker at medallion.com
Wed Feb 21 22:43:56 CET 2007
> -----Original Message-----
> From: drbd-user-bounces at lists.linbit.com
> [mailto:drbd-user-bounces at lists.linbit.com] On Behalf Of Gerry Reno
> Sent: Wednesday, February 21, 2007 10:41 AM
> To: drbd-user at lists.linbit.com
> Subject: [DRBD-user] [Fwd: Re: [Linux-HA] heartbeat 2.0.8:
> lockups] kerneloops
>
>
> Forwarding this from linux-ha list:
>
> -------- Original Message --------
> Subject: Re: [Linux-HA] heartbeat 2.0.8: lockups
> Date: Mon, 19 Feb 2007 09:57:16 -0500
> From: Gerry Reno <greno at verizon.net>
> Reply-To: General Linux-HA mailing list
> <linux-ha at lists.linux-ha.org>
> To: General Linux-HA mailing list <linux-ha at lists.linux-ha.org>
> References:
> <12392854.6367231171759462886.JavaMail.root at vms074.mailsrvcs.net>
> <26ef5e70702190352p4d6d24cajb31b28edbe0d1885 at mail.gmail.com>
> <45D9B8CD.70907 at verizon.net>
>
>
>
> Gerry Reno wrote:
> > Andrew Beekhof wrote:
> >> so what are we looking at here? what time did the lockup occur?
> >>
> >> On 2/18/07, greno at verizon.net <greno at verizon.net> wrote:
> >>> I've been running heartbeat on my two nodes for almost
> two weeks and
> >>> everything is functioning as it is supposed to with the exception
> >>> that I am getting frequent lockups on the primary server. It
> >>> doesn't matter which server that I make the primary it will
> >>> eventually be locked up. The lockups are very hard. There is no
> >>> response of any kind out of the locked up machine. Sometimes the
> >>> drive light will be on and sometimes not. The lockups
> are occurring
> >>> at times of disk access such as during backups or right
> after I ftp
> >>> a file or tar file over to another machine from the drbd array.
> >>> There is very little in the logs. It just shows a big
> gap and then
> >>> a syslog restart for when I cold booted the server to
> bring it back
> >>> up. I'm going to attach dmesg output and
> /var/log/messages output
> >>> for both servers. What should I do to track down the
> source of this
> >>> problem?
> >>>
> >>> heartbeat-2.0.8-1.fc6
> >>> drbd-0.7.23-15.fc6.at
> >>>
> >>> Other info:
> >>> drbd is running over logical volume which is over a
> RAID-1 md array
> >>> on each server.
> >>>
> >>> Both servers were rock stable prior to installing HA.
> >>>
> >>>
<snip>
Are you per-chance doing "internal" storage of the meta-data on these
RAID-1 arrays?
-Ross
______________________________________________________________________
This e-mail, and any attachments thereto, is intended only for use by
the addressee(s) named herein and may contain legally privileged
and/or confidential information. If you are not the intended recipient
of this e-mail, you are hereby notified that any dissemination,
distribution or copying of this e-mail, and any attachments thereto,
is strictly prohibited. If you have received this e-mail in error,
please immediately notify the sender and permanently delete the
original and any copy or printout thereof.
More information about the drbd-user
mailing list