Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
drbd-user-bounces at lists.linbit.com wrote on 03/18/2010 03:27:10 PM: > From: "Matt Graham" <danceswithcrows at usa.net> > To: <drbd-user at lists.linbit.com> > Date: 03/18/2010 03:33 PM > Subject: Re: [DRBD-user] DRBD module won't load > Sent by: drbd-user-bounces at lists.linbit.com > > From: Wood.Chris at tatravelcenters.com > > I'm trying to start drbd on an Oracle Virtual Server (Xen) machine. > > Starting DRBD resources: Can not load the drbd module. > > $ rpm -qa|grep drbd > > drbd-8.3.6-1.el5 > > kmod-drbd-8.0.16-5.el5_3 > > drbd 8.3.6, drbd kernel module 8.0.16. The two really should match, no? > And which /lib/modules/2.6.* dir did the module get installed in? If it > got put in the wrong dir (possible) then modprobe won't find it. Where > did the kernel module RPM come from? It has to be built against the > kernel that's running, otherwise it won't work. > > > Any help would be much appreciated - do I have to build the module from > > scratch? > > Building the kernel module from source should be pretty easy; just go > into the drbd source dir and do "make rpm KDIR=/usr/src/kernels/2.6.18-blah" > and you should get RPMs for userland and kernelspace drbd components. > Not applicable to very recent kernels since drbd is now in the vanilla > kernel source, but CentOS 5 doesn't have a recent kernel. Ok, I uninstalled all the other packages... so I have no idea if I'll be able to get this to work with heartbeat and openais... but I built the 8.3.4 versions and installed on both hosts. I set up dedicated interfaces on each, they are directly connected. I opened the firewall for that interface. I can ping each server from the other over this link, although the xenbr1 interface is bridging it - I don't really want that, but can't find anything on how to disable xen from grabbing it... so I can ping each server, but when I start drbd on each node, it just waits and waits and waits for the other node to come up... Here's what I see in the log on server0 Mar 18 13:08:52 admin-lab-ovs0 kernel: drbd: initialized. Version: 8.3.4 (api:88/proto:86-91) Mar 18 13:08:52 admin-lab-ovs0 kernel: drbd: GIT-hash: 70a645ae080411c87b4482a135847d69dc90a6a2 build by root at admin-lab-ovs0, 2010-03-18 11:28:37 Mar 18 13:08:52 admin-lab-ovs0 kernel: drbd: registered as block device major 147 Mar 18 13:08:52 admin-lab-ovs0 kernel: drbd: minor_table @ 0xdf50c0c0 Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: Starting worker thread (from cqueue/0 [120]) Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: disk( Diskless -> Attaching ) Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: No usable activity log found. Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: Method to ensure write ordering: barrier Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: max_segment_size ( = BIO size ) = 32768 Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: drbd_bm_resize called with capacity == 1942729272 Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: resync bitmap: bits=242841159 words=7588788 Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: size = 926 GB (971364636 KB) Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: recounting of set bits took additional 15 jiffies Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: 926 GB (242841159 bits) marked out-of-sync by on disk bit-map. Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: disk( Attaching -> Inconsistent ) Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: conn( StandAlone -> Unconnected ) Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: Starting receiver thread (from drbd0_worker [2871]) Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: receiver (re)started Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: conn( Unconnected -> WFConnection ) Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: Handshake successful: Agreed network protocol version 91 Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: conn( WFConnection -> WFReportParams ) Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: Starting asender thread (from drbd0_receiver [2881]) Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: data-integrity-alg: <not-used> Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: drbd_sync_handshake: Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: self 0000000000000004:0000000000000000:0000000000000000:0000000000000000 bits:242841159 flags:0 Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: peer 0000000000000004:0000000000000000:0000000000000000:0000000000000000 bits:242841159 flags:0 Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: uuid_compare()=0 by rule 10 Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: No resync, but 242841159 bits in bitmap! Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: peer( Unknown -> Secondary ) conn( WFReportParams -> Connected ) pdsk( DUnknown -> Inconsistent ) Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: peer( Secondary -> Unknown ) conn( Connected -> Disconnecting ) Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: short read expecting header on sock: r=-512 Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: asender terminated Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: Terminating asender thread Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: Connection closed Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: conn( Disconnecting -> StandAlone ) Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: receiver terminated Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: Terminating receiver thread Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: conn( StandAlone -> Unconnected ) Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: Starting receiver thread (from drbd0_worker [2871]) Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: receiver (re)started Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: conn( Unconnected -> WFConnection ) [root at admin-lab-ovs0 cwood]# cat /proc/drbd version: 8.3.4 (api:88/proto:86-91) GIT-hash: 70a645ae080411c87b4482a135847d69dc90a6a2 build by root at admin-lab-ovs0, 2010-03-18 11:28:37 0: cs:WFConnection ro:Secondary/Unknown ds:Inconsistent/Inconsistent C r---- ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:b oos:971364636 And server1 Mar 18 18:03:47 admin-lab-ovs1 kernel: drbd: initialized. Version: 8.3.4 (api:88/proto:86-91) Mar 18 18:03:47 admin-lab-ovs1 kernel: drbd: GIT-hash: 70a645ae080411c87b4482a135847d69dc90a6a2 build by root at admin-lab-ovs0, 2010-03-18 11:28:37 Mar 18 18:03:47 admin-lab-ovs1 kernel: drbd: registered as block device major 147 Mar 18 18:03:47 admin-lab-ovs1 kernel: drbd: minor_table @ 0xf718b6c0 Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: Starting worker thread (from cqueue/0 [120]) Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: disk( Diskless -> Attaching ) Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: No usable activity log found. Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: Method to ensure write ordering: barrier Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: max_segment_size ( = BIO size ) = 32768 Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: drbd_bm_resize called with capacity == 1942729272 Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: resync bitmap: bits=242841159 words=7588788 Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: size = 926 GB (971364636 KB) Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: recounting of set bits took additional 18 jiffies Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: 926 GB (242841159 bits) marked out-of-sync by on disk bit-map. Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: disk( Attaching -> Inconsistent ) Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: conn( StandAlone -> Unconnected ) Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: Starting receiver thread (from drbd0_worker [3206]) Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: receiver (re)started Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: conn( Unconnected -> WFConnection ) Mar 18 18:04:06 admin-lab-ovs1 kernel: block drbd0: Handshake successful: Agreed network protocol version 91 Mar 18 18:04:06 admin-lab-ovs1 kernel: block drbd0: conn( WFConnection -> WFReportParams ) Mar 18 18:04:06 admin-lab-ovs1 kernel: block drbd0: Starting asender thread (from drbd0_receiver [3216]) Mar 18 18:04:06 admin-lab-ovs1 kernel: block drbd0: data-integrity-alg: <not-used> Mar 18 18:04:07 admin-lab-ovs1 kernel: block drbd0: drbd_sync_handshake: Mar 18 18:04:07 admin-lab-ovs1 kernel: block drbd0: self 0000000000000004:0000000000000000:0000000000000000:0000000000000000 bits:242841159 flags:0 Mar 18 18:04:07 admin-lab-ovs1 kernel: block drbd0: peer 0000000000000004:0000000000000000:0000000000000000:0000000000000000 bits:242841159 flags:0 Mar 18 18:04:07 admin-lab-ovs1 kernel: block drbd0: uuid_compare()=0 by rule 10 Mar 18 18:04:07 admin-lab-ovs1 kernel: block drbd0: No resync, but 242841159 bits in bitmap! Mar 18 18:04:07 admin-lab-ovs1 kernel: block drbd0: peer( Unknown -> Secondary ) conn( WFReportParams -> Connected ) pdsk( DUnknown -> Inconsistent ) Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: peer( Secondary -> Unknown ) conn( Connected -> TearDown ) Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: asender terminated Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: Terminating asender thread Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: Connection closed Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: conn( TearDown -> Unconnected ) Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: receiver terminated Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: Restarting receiver thread Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: receiver (re)started Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: conn( Unconnected -> WFConnection ) [root at admin-lab-ovs1 cwood]# cat /proc/drbd version: 8.3.4 (api:88/proto:86-91) GIT-hash: 70a645ae080411c87b4482a135847d69dc90a6a2 build by root at admin-lab-ovs0, 2010-03-18 11:28:37 0: cs:WFConnection ro:Secondary/Unknown ds:Inconsistent/Inconsistent C r---- ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:b oos:971364636