[DRBD-user] [Fwd: Re: [Linux-HA] heartbeat 2.0.8: lockups] kerneloops

Ross S. W. Walker rwalker at medallion.com
Wed Feb 21 22:43:56 CET 2007

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


> -----Original Message-----
> From: drbd-user-bounces at lists.linbit.com 
> [mailto:drbd-user-bounces at lists.linbit.com] On Behalf Of Gerry Reno
> Sent: Wednesday, February 21, 2007 10:41 AM
> To: drbd-user at lists.linbit.com
> Subject: [DRBD-user] [Fwd: Re: [Linux-HA] heartbeat 2.0.8: 
> lockups] kerneloops
> 
> 
> Forwarding this from linux-ha list:
> 
> -------- Original Message --------
> Subject: 	Re: [Linux-HA] heartbeat 2.0.8: lockups
> Date: 	Mon, 19 Feb 2007 09:57:16 -0500
> From: 	Gerry Reno <greno at verizon.net>
> Reply-To: 	General Linux-HA mailing list 
> <linux-ha at lists.linux-ha.org>
> To: 	General Linux-HA mailing list <linux-ha at lists.linux-ha.org>
> References: 
> <12392854.6367231171759462886.JavaMail.root at vms074.mailsrvcs.net> 
> <26ef5e70702190352p4d6d24cajb31b28edbe0d1885 at mail.gmail.com> 
> <45D9B8CD.70907 at verizon.net>
> 
> 
> 
> Gerry Reno wrote:
> > Andrew Beekhof wrote:
> >> so what are we looking at here?  what time did the lockup occur?
> >>
> >> On 2/18/07, greno at verizon.net <greno at verizon.net> wrote:
> >>> I've been running heartbeat on my two nodes for almost 
> two weeks and 
> >>> everything is functioning as it is supposed to with the exception 
> >>> that I am getting frequent lockups on the primary server.  It 
> >>> doesn't matter which server that I make the primary it will 
> >>> eventually be locked up.  The lockups are very hard.  There is no 
> >>> response of any kind out of the locked up machine.  Sometimes the 
> >>> drive light will be on and sometimes not.  The lockups 
> are occurring 
> >>> at times of disk access such as during backups or right 
> after I ftp 
> >>> a file or tar file over to another machine from the drbd array.  
> >>> There is very little in the logs.  It just shows a big 
> gap and then 
> >>> a syslog restart for when I cold booted the server to 
> bring it back 
> >>> up.  I'm going to attach dmesg output and 
> /var/log/messages output 
> >>> for both servers.  What should I do to track down the 
> source of this 
> >>> problem?
> >>>
> >>> heartbeat-2.0.8-1.fc6
> >>> drbd-0.7.23-15.fc6.at
> >>>
> >>> Other info:
> >>> drbd is running over logical volume which is over a 
> RAID-1 md array 
> >>> on each server.
> >>>
> >>> Both servers were rock stable prior to installing HA.
> >>>
> >>>

<snip>

Are you per-chance doing "internal" storage of the meta-data on these
RAID-1 arrays?

-Ross

______________________________________________________________________
This e-mail, and any attachments thereto, is intended only for use by
the addressee(s) named herein and may contain legally privileged
and/or confidential information. If you are not the intended recipient
of this e-mail, you are hereby notified that any dissemination,
distribution or copying of this e-mail, and any attachments thereto,
is strictly prohibited. If you have received this e-mail in error,
please immediately notify the sender and permanently delete the
original and any copy or printout thereof.




More information about the drbd-user mailing list