[DRBD-user] HA-NFS & drbd

Dave Dykstra dwdha at drdykstra.us
Fri Mar 4 18:20:54 CET 2005


[I haven't seen Paul's message come through drbd-user yet but he did
send it there in addition to direct to me.]

On Thu, Mar 03, 2005 at 03:53:45PM -0500, Paul Nowoczynski wrote:
> Hi,
> 
> >
> > I need some clarification on this.  I have tried it both ways and have
> > had a little better luck with having /var/lib/nfs be separate on the two
> > servers than having it be a symlink to the shared filesystem.  I have not
> > seen EPERM errors after failover, but I do sometimes have NFS filesystems
> > show up in 'df' on some clients as having some huge number of free blocks
> > (which I assumed was a misinterpretation of -1) until I do "exportfs -r"
> > on the new active server.  Is that an EPERM error? It usually is not on
> > all clients, just some of the ones in the netgroup that the filesystem is
> > exported-to (and I've even seen it when explicitly listing all the hosts
> > in /etc/exports).  I'm quite sure I saw these problems with /var/lib/nfs
> > as a symlink to the shared fileserver or not.  If I make /var/lib/nfs
> > a symlink to the shared filesystem, then it disappears on the standby
> > server and 'df' hangs there when it gets up to the shared filesystem
> > which I have mounted from the activer server by NFS.
> 
> After some trip-ups, I've got a similar config working very well.
> I've been sharing varlibnfs the entire time and have not seen any
> problems - even after 20 or 30 failovers (hard and soft).  One thing
> I have learned is that running an nfs server on your standby machine
> is a bad idea.  I chkconfig nfs off on both of the machines and let
> heartbeat start nfsd after it has mounted drbd.  If the machine is
> on standby, nfsd is not started at all.  One time, after a machine was
> rebuilt, nfsd was on be default and failovers ceased to work at all.

I'm not running nfsd on the standby server either.  I was only talking
about acting as an NFS client.

> I have not been mounting the ha share on the failover cluster nodes,
> but I don't think that would be a bad thing - unless the nfs client
> translates the vip to the loopback.  Are you explicitly mounting the
> vip via the nfs client or the real ip?  I'd recommend that you don't allow
> any non-vital processes on your failover cluster.

I'm mounting the filesystem via the virtual IP address so it can continue
to work before & after a failover.

> paul

- Dave

> >
> > Is it in general a Very Bad Thing to NFS-mount the shared filesystem on
> > the HA-NFS servers?  I have never seen anybody explicitly state that,
> > although I'm beginning to come to that conclusion.  I've had problems with
> > fuser hanging on failover, and even after avoiding that I still sometimes
> > see hangs on shutdown that I'm quite sure are related to operations
> > attempting to access the non-responding NFS mountpoint.  In my case the
> > shared filesystem holds almost everybody's home directories so it's rather
> > a pain to not be able to access them on the standby shared file server.
> > I need to allow people to log in to the active server so I'd have to
> > have their home directories be set up there to be symlinks directly to
> > the mounted filesystem (because that's how we do CVS accesses to avoid
> > problems with CVS over NFS), but that means that when a failover happens
> > every process that is directly accessing the filesystem will get killed
> > which isn't very friendly.
> >
> > - Dave Dykstra
> > _______________________________________________
> > drbd-user mailing list
> > drbd-user at lists.linbit.com
> > http://lists.linbit.com/mailman/listinfo/drbd-user
> >



More information about the drbd-user mailing list