Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
> -----Original Message----- > From: drbd-user-bounces at lists.linbit.com > [mailto:drbd-user-bounces at lists.linbit.com] On Behalf Of Gerry Reno > Sent: Wednesday, February 21, 2007 10:41 AM > To: drbd-user at lists.linbit.com > Subject: [DRBD-user] [Fwd: Re: [Linux-HA] heartbeat 2.0.8: > lockups] kerneloops > > > Forwarding this from linux-ha list: > > -------- Original Message -------- > Subject: Re: [Linux-HA] heartbeat 2.0.8: lockups > Date: Mon, 19 Feb 2007 09:57:16 -0500 > From: Gerry Reno <greno at verizon.net> > Reply-To: General Linux-HA mailing list > <linux-ha at lists.linux-ha.org> > To: General Linux-HA mailing list <linux-ha at lists.linux-ha.org> > References: > <12392854.6367231171759462886.JavaMail.root at vms074.mailsrvcs.net> > <26ef5e70702190352p4d6d24cajb31b28edbe0d1885 at mail.gmail.com> > <45D9B8CD.70907 at verizon.net> > > > > Gerry Reno wrote: > > Andrew Beekhof wrote: > >> so what are we looking at here? what time did the lockup occur? > >> > >> On 2/18/07, greno at verizon.net <greno at verizon.net> wrote: > >>> I've been running heartbeat on my two nodes for almost > two weeks and > >>> everything is functioning as it is supposed to with the exception > >>> that I am getting frequent lockups on the primary server. It > >>> doesn't matter which server that I make the primary it will > >>> eventually be locked up. The lockups are very hard. There is no > >>> response of any kind out of the locked up machine. Sometimes the > >>> drive light will be on and sometimes not. The lockups > are occurring > >>> at times of disk access such as during backups or right > after I ftp > >>> a file or tar file over to another machine from the drbd array. > >>> There is very little in the logs. It just shows a big > gap and then > >>> a syslog restart for when I cold booted the server to > bring it back > >>> up. I'm going to attach dmesg output and > /var/log/messages output > >>> for both servers. What should I do to track down the > source of this > >>> problem? > >>> > >>> heartbeat-2.0.8-1.fc6 > >>> drbd-0.7.23-15.fc6.at > >>> > >>> Other info: > >>> drbd is running over logical volume which is over a > RAID-1 md array > >>> on each server. > >>> > >>> Both servers were rock stable prior to installing HA. > >>> > >>> <snip> Are you per-chance doing "internal" storage of the meta-data on these RAID-1 arrays? -Ross ______________________________________________________________________ This e-mail, and any attachments thereto, is intended only for use by the addressee(s) named herein and may contain legally privileged and/or confidential information. If you are not the intended recipient of this e-mail, you are hereby notified that any dissemination, distribution or copying of this e-mail, and any attachments thereto, is strictly prohibited. If you have received this e-mail in error, please immediately notify the sender and permanently delete the original and any copy or printout thereof.