[DRBD-user] RHEL Kernel Crash/Panic with DRBD 0.8pre3 ...

Sébastien CRAMATTE s.cramatte at wanadoo.fr
Wed Jul 12 17:03:28 CEST 2006

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Hello,

yesterday I've tested  drbd 0.8pre3  on a debian server with an RHEL 
2.6  kernel  + freevps  patch (see www.freevps.com)
meta-disk was created  correctly  using this conf  but when I've 
started  /etc/init.d/drbd   my server have crashed with kernel panic error !

the IP 192.168.0.10  is installed on  tagged VLAN   (ID 500).  I've 
patched  drbd_receiver.c   sock_create  functions.  Redhat/Fedora kernel 
seem that require 5 arguments instead of 4 in a vanilla Kernel.

--------drbd.conf ---------
common {
  syncer { rate 10M; }
}

resource r0 {
  protocol C;

  on appli1 {
    address 192.168.0.10:7790;
    disk /dev/mapper/volume1-backup;
    device /dev/drbd0;
    meta-disk "internal";
    }

  on appli2 {
    address 192.168.0.11:7791;
    disk /dev/mapper/volume1-backup;
    device /dev/drbd0;
    meta-disk "internal";
  }
}

----------

----- drbd crash with this infos ----

Jul 11 19:39:13 appli1 kernel: drbd0: disk( Diskless -> Attaching )
Jul 11 19:39:13 appli1 kernel: drbd0: No usable activity log found.
Jul 11 19:39:13 appli1 kernel: drbd0: drbd_bm_resize called with 
capacity == 204793672
Jul 11 19:39:13 appli1 kernel: drbd0: resync bitmap: bits=25599209 
words=799976
Jul 11 19:39:13 appli1 kernel: drbd0: size = 97 GB (102396836 KB)
Jul 11 19:39:13 appli1 kernel: drbd0: reading of bitmap took 94 jiffies
Jul 11 19:39:13 appli1 kernel: drbd0: recounting of set bits took 
additional 4 jiffies
Jul 11 19:39:13 appli1 kernel: drbd0: 97 GB marked out-of-sync by on 
disk bit-map.
Jul 11 19:39:13 appli1 kernel: drbd0: disk( Attaching -> Inconsistent )
Jul 11 19:39:13 appli1 kernel: drbd0: Writing meta data super block now.
Jul 11 19:39:13 appli1 kernel: drbd0: conn( StandAlone -> Unconnected )
Jul 11 19:39:13 appli1 kernel: drbd0: conn( Unconnected -> WFConnection )
Jul 11 19:39:13 appli1 kernel: Unable to handle kernel paging request at 
virtual address 00001416
Jul 11 19:39:13 appli1 kernel: printing eip:
Jul 11 19:39:13 appli1 kernel: c028bc4d
Jul 11 19:39:13 appli1 kernel: *pde = 32308001
Jul 11 19:39:13 appli1 kernel: Oops: 0002 [#1]
Jul 11 19:39:13 appli1 kernel: SMP
Jul 11 19:39:13 appli1 kernel: Modules linked in: ipt_limit(U) 
ipt_state(U) e752x_edac(U) edac_mc(U) dm_mod(U) drbd(U) 
ip_conntrack_ftp(U) ip_conntrack_tftp(U) ip_conntrack(U) 
iptable_filter(U) ip_tables(U) bonding(U) 8021q(U)
Jul 11 19:39:13 appli1 kernel: CPU:    1
Jul 11 19:39:13 appli1 kernel: EIP:    0060:[<c028bc4d>]    Not tainted VLI
Jul 11 19:39:13 appli1 kernel: EFLAGS: 00010286   (2.6.9-prep-tl)
Jul 11 19:39:13 appli1 kernel: EIP is at sock_alloc+0x4c/0x69
Jul 11 19:39:13 appli1 kernel: eax: 00000000   ebx: f2e9a3ac   ecx: 
f2e9a380   edx: f5c26000
Jul 11 19:39:13 appli1 kernel: esi: 00000002   edi: 00000000   ebp: 
00000002   esp: f5c26f58
Jul 11 19:39:13 appli1 kernel: ds: 007b   es: 007b   ss: 0068
Jul 11 19:39:13 appli1 kernel: Process drbd0_receiver (pid: 3816, 
threadinfo=f5c26000 task=f4f809a0)
Jul 11 19:39:13 appli1 kernel: Stack: c0365460 00000001 c028c8d7 
f6744000 00000000 00000000 f6744000 c028ca20
Jul 11 19:39:13 appli1 kernel: f5c26f90 00000000 00000000 f92c930c 
f5c26f90 00000000 ffffffff 0000000a
Jul 11 19:39:13 appli1 kernel: 0b07f442 f4f80bbe f6744000 f6744000 
00000000 00000000 00000000 f92c9646
Jul 11 19:39:13 appli1 kernel: Call Trace:
Jul 11 19:39:13 appli1 kernel: [<c028c8d7>] __sock_create+0xc8/0x202
Jul 11 19:39:13 appli1 kernel: [<c028ca20>] sock_create+0xf/0x13
Jul 11 19:39:13 appli1 kernel: [<f92c930c>] drbd_try_connect+0x21/0x141 
[drbd]
Jul 11 19:39:13 appli1 kernel: [<f92c9646>] drbd_connect+0x9d/0x450 [drbd]
Jul 11 19:39:13 appli1 kernel: [<f92ce5fc>] drbdd_init+0x41/0x18d [drbd]
Jul 11 19:39:13 appli1 kernel: [<f92d4c7c>] drbd_thread_setup+0xa4/0xd5 
[drbd]
Jul 11 19:39:13 appli1 kernel: [<f92d4bd8>] drbd_thread_setup+0x0/0xd5 
[drbd]
Jul 11 19:39:13 appli1 kernel: [<c01051f5>] kernel_thread_helper+0x5/0xb
Jul 11 19:39:13 appli1 kernel: Code: 66 01 ba 00 f0 ff ff 21 e2 8d 4b d4 
8b 02 8b 80 88 01 00 00 89 43 28 8b 02 8b 80 98 01 00 00 89 b3 40 01 00 
00 89 43 2c 89 71 28 <f0> ff 86 14 14 00 00 8b 52 10 b8 80 cb 3e c0 03 
04 95 20 51 3f
Jul 11 19:39:13 appli1 kernel: <0>Fatal exception: panic in 5 seconds

----- my redhat  drbd_receiver.c  patch  ----
--- drbd-8.0pre3/drbd/drbd_receiver.c   2006-07-11 16:41:54.000000000 +0000
+++ drbd-8.0pre3-patched/drbd/drbd_receiver.c   2006-07-11 
16:19:40.000000000 +0000
@@ -428,7 +428,7 @@
       if (err)
               goto out;

-       if (sock_create(PF_INET, SOCK_STREAM, IPPROTO_TCP, &newsock))
+       if (sock_create(PF_INET, SOCK_STREAM, IPPROTO_TCP, &newsock, 0))
               goto out;

       newsock->type = sock->type;
@@ -547,7 +547,7 @@
       struct socket *sock;
       struct sockaddr_in src_in;

-       err = sock_create(AF_INET, SOCK_STREAM, 0, &sock);
+       err = sock_create(AF_INET, SOCK_STREAM, 0, &sock, 0);
       if (err) {
               ERR("sock_creat(..)=%d\n", err);
               return NULL;
@@ -596,7 +596,7 @@
       int err;
       struct socket *sock,*sock2;

-       err = sock_create(AF_INET, SOCK_STREAM, 0, &sock2);
+       err = sock_create(AF_INET, SOCK_STREAM, 0, &sock2, 0);
       if (err) {
               ERR("sock_creat(..)=%d\n", err);
               return NULL;










More information about the drbd-user mailing list