[DRBD-user] Pacemaker - DRBD fails on node every couple hours

Lars Ellenberg lars.ellenberg at linbit.com
Mon Feb 27 17:40:02 CET 2012

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


On Mon, Feb 27, 2012 at 05:15:29PM +0100, Christoph Roethlisberger wrote:
> We use a simple 2node active-passive cluster with DRBD and NFS services.
> 
> Right now the cluster monitor detects a drbr failure every couple
> hours (~ 2-40) and will fail over.
> syslog shows the following lines just before pacepaker initiates the
> failover:
> 
> --------------------------------------
> Feb 24 20:55:54 drbdnode1 lrmd: [1659]: info: RA output:
> (p_drbd_r0:0:monitor:stderr) <1>error creating netlink socket
> Feb 24 20:55:54 drbdnode1 lrmd: [1659]: info: RA output:
> (p_drbd_r0:0:monitor:stderr) Could not connect to 'drbd' generic
> netlink family


Check that you really have loaded the DRBD 8.4.1 kernel module.

My guess is that you have some drbd 8.3 module loaded.

find /lib/modules/`uname -r` -name "drbd.ko"

You probably have more than one.
Make sure you load the one you want.


-- 
: Lars Ellenberg
: LINBIT | Your Way to High Availability
: DRBD/HA support and consulting http://www.linbit.com



More information about the drbd-user mailing list