[DRBD-user] 8.3.8 Online Verify Oops on kernel 2.6.34

Roland Friedwagner roland.friedwagner at wu.ac.at
Fri Sep 3 16:11:03 CEST 2010

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Hello Michael,

I can confirm this issue.
Our secondary crashed last night after started an online verify via cron.
I had to push The Button...

Found these last messages in syslog:
Sep  2 00:18:01 bach-s52 kernel: block drbd0: Online Verify start sector: 0
Sep  2 00:18:01 bach-s52 kernel: block drbd1: conn( Connected -> VerifyT )
Sep  2 00:18:01 bach-s52 kernel: block drbd1: Online Verify start sector: 0
Sep  2 00:18:04 bach-s52 kernel: Unable to handle kernel NULL pointer dereference at 0000000000000000 RIP:
Sep  2 00:18:04 bach-s52 kernel:  [<ffffffff88440fbf>] :drbd:w_e_end_ov_req+0x29/0x136
Sep  2 00:18:04 bach-s52 kernel: PGD 0
Sep  2 00:18:04 bach-s52 kernel: Oops: 0000 [1] SMP
Sep  2 00:18:04 bach-s52 kernel: last sysfs file: /devices/pci0000:00/0000:00:1c.2/0000:03:00.1/irq

DRBD Version: 8.3.8.1
HW: HP DL380G6 (1 x Xeon X5570)
OS: RHEL 5.5 x86_64
Kernel: 2.6.18-194.11.3.el5 #1 SMP Mon Aug 23 15:51:38 EDT 2010 x86_64 x86_64 x86_64 GNU/Linux

Does not reproduce until now.

Kind Regards,
Roland



DRBD resource configs here:

# drbdsetup 0 show | grep -v _is_default
disk {
        on-io-error             detach;
        no-disk-barrier ;
        no-disk-flushes ;
        no-md-flushes   ;
}
net {
        max-epoch-size          20000;
        max-buffers             32000;
        unplug-watermark        16;
        ko-count                6;
        allow-two-primaries;
        cram-hmac-alg           "sha1";
        shared-secret           "39urXnII331";
        after-sb-0pri           discard-zero-changes;
        after-sb-1pri           discard-secondary;
        data-integrity-alg      "md5";
}
syncer {
        rate                    33792k; # bytes/second
        al-extents              3389;
        al-extents              3389;
        csums-alg               "md5";
        verify-alg              "md5";
        cpu-mask                "255";
}
protocol C;
_this_host {
        device                  minor 0;
        disk                    "/dev/vg_drbdsec_raid10/lv_drbd_disk_01";
        meta-disk               internal;
        address                 ipv4 10.0.0.2:7789;
}
_remote_host {
        address                 ipv4 10.0.0.1:7789;
}
# drbdsetup 1 show | grep -v _is_default
disk {  
        on-io-error             detach;
        no-disk-barrier ;
        no-disk-flushes ;
        no-md-flushes   ;
}
net {   
        max-epoch-size          20000;
        max-buffers             32000;
        unplug-watermark        16;
        ko-count                6;
        allow-two-primaries;
        cram-hmac-alg           "sha1";
        shared-secret           "39urXnII332";
        after-sb-0pri           discard-zero-changes;
        after-sb-1pri           discard-secondary;
        data-integrity-alg      "md5";
}
syncer {
        rate                    33792k; # bytes/second
        after                   0;
        al-extents              3389;
        csums-alg               "md5";
        verify-alg              "md5";
        cpu-mask                "255";
}
protocol C;
_this_host {
        device                  minor 1;
        disk                    "/dev/vg_drbdsec_raid6/lv_drbd_disk_02";
        meta-disk               internal;
        address                 ipv4 10.0.0.2:7790;
}
_remote_host {
        address                 ipv4 10.0.0.1:7790;
}


Am Sonntag 04 Juli 2010 schrieben Sie:
> Hi All,
>
> I've installed ( as always ) from source 8.3.8
> All seems to work fine but when I've tried to start online-verify -
> I"ve got oops on secondary node:
> Is anyone having that issues?
>
> Jul  5 07:38:45 vhost2 kernel: block drbd3: conn( Connected ->
> VerifyT ) Jul  5 07:38:45 vhost2 kernel: block drbd3: Online Verify
> start sector: 0 Jul  5 07:38:46 vhost2 kernel: BUG: unable to handle
> kernel NULL pointer dereference at 0000000000000030
> Jul  5 07:38:46 vhost2 kernel: IP: [<ffffffffa029ce5f>]
> w_e_end_ov_req+0x36/0x154 [drbd]
> Jul  5 07:38:46 vhost2 kernel: PGD 41aa0d067 PUD 4178f8067 PMD 0
> Jul  5 07:38:46 vhost2 kernel: Oops: 0000 [#1] SMP
> Jul  5 07:38:46 vhost2 kernel: last sysfs file:
> /sys/module/drbd/parameters/cn_idx
> Jul  5 07:38:46 vhost2 kernel: CPU 3
> Jul  5 07:38:46 vhost2 kernel: Modules linked in: ppdev vmnet(P)
> parport_pc parport vmmon(P) drbd ext4 jbd2 crc16 crc32c ac battery cn
> coretemp w83627hf w83793 hwmon_vid bonding loop rtc_cmos rtc_core
> i2c_i801 iTCO_wdt i5000_edac serio_raw ioatdma container rtc_lib
> i2c_core e1000e tpm_tis tpm tpm_bios dca edac_core shpchp pci_hotplug
> psmouse pcspkr i5k_amb button processor evdev ext3 jbd mbcache
> dm_mirror dm_region_hash dm_log dm_snapshot dm_mod raid456 async_pq
> async_xor xor async_memcpy async_raid6_recov raid6_pq async_tx raid10
> raid1 md_mod ide_cd_mod cdrom sd_mod pata_acpi ata_generic ata_piix
> ide_pci_generic ahci sata_mv piix ehci_hcd ide_core libata scsi_mod
> uhci_hcd floppy thermal fan [last unloaded: vmnet]
> Jul  5 07:38:46 vhost2 kernel:
> Jul  5 07:38:46 vhost2 kernel: Pid: 7314, comm: drbd3_worker Tainted:
> P        W  2.6.34-vs2.3.0.36.30.4.pre8 #2 X7DB8/X7DB8
> Jul  5 07:38:46 vhost2 kernel: RIP: 0010:[<ffffffffa029ce5f>]
> [<ffffffffa029ce5f>] w_e_end_ov_req+0x36/0x154 [drbd]
> Jul  5 07:38:46 vhost2 kernel: RSP: 0000:ffff8804131c5e20  EFLAGS:
> 00010202 Jul  5 07:38:46 vhost2 kernel: RAX: 0000000000000008 RBX:
> ffff88042b09c9c0 RCX: ffff88036b08c430
> Jul  5 07:38:46 vhost2 kernel: RDX: 0000000000000000 RSI:
> 0000000000000010 RDI: ffff880416900000
> Jul  5 07:38:46 vhost2 kernel: RBP: ffff880416900000 R08:
> 0000000000000004 R09: ffff8800016cd1c0
> Jul  5 07:38:46 vhost2 kernel: R10: 0000000000000001 R11:
> ffffffff8126109b R12: ffff880416900138
> Jul  5 07:38:46 vhost2 kernel: R13: ffff8804169006b8 R14:
> ffff88036b08c430 R15: ffff880416900118
> Jul  5 07:38:46 vhost2 kernel: FS:  0000000000000000(0000)
> GS:ffff8800016c0000(0000) knlGS:0000000000000000
> Jul  5 07:38:46 vhost2 kernel: CS:  0010 DS: 0000 ES: 0000 CR0:
> 000000008005003b
> Jul  5 07:38:46 vhost2 kernel: CR2: 0000000000000030 CR3:
> 00000004179ed000 CR4: 00000000000006e0
> Jul  5 07:38:46 vhost2 kernel: DR0: 0000000000000000 DR1:
> 0000000000000000 DR2: 0000000000000000
> Jul  5 07:38:46 vhost2 kernel: DR3: 0000000000000000 DR6:
> 00000000ffff0ff0 DR7: 0000000000000400
> Jul  5 07:38:46 vhost2 kernel: Process drbd3_worker (pid: 7314,
> threadinfo ffff8804131c4000, task ffff88042b09c9c0)
> Jul  5 07:38:46 vhost2 kernel: Stack:
> Jul  5 07:38:46 vhost2 kernel:  ffff880416900118 ffff88042b09c9c0
> ffff880416900000 ffff880416900138
> Jul  5 07:38:46 vhost2 kernel: <0> ffff8804169006b8 0000000000000000
> ffff880416900118 ffffffffa029c0f8
> Jul  5 07:38:46 vhost2 kernel: <0> 0000000101fbdd14 ffffffff81040c79
> ffff88042e4dc000 ffff88042b09c9c0
> Jul  5 07:38:46 vhost2 kernel: Call Trace:
> Jul  5 07:38:46 vhost2 kernel:  [<ffffffffa029c0f8>] ?
> drbd_worker+0x284/0x4b4 [drbd]
> Jul  5 07:38:46 vhost2 kernel:  [<ffffffff81040c79>] ?
> del_timer_sync+0xc/0x16
> Jul  5 07:38:46 vhost2 kernel:  [<ffffffff81040c83>] ?
> process_timeout+0x0/0x5
> Jul  5 07:38:46 vhost2 kernel:  [<ffffffffa02b63b7>] ?
> drbd_thread_setup+0x166/0x227 [drbd]
> Jul  5 07:38:46 vhost2 kernel:  [<ffffffff810036d4>] ?
> kernel_thread_helper+0x4/0x10
> Jul  5 07:38:46 vhost2 kernel:  [<ffffffffa02b6251>] ?
> drbd_thread_setup+0x0/0x227 [drbd]
> Jul  5 07:38:46 vhost2 kernel:  [<ffffffff810036d0>] ?
> kernel_thread_helper+0x0/0x10
> Jul  5 07:38:46 vhost2 kernel: Code: 55 48 89 fd 53 48 83 ec 08 85 d2
> 0f 85 c9 00 00 00 f6 46 48 10 0f 85 bf 00 00 00 48 8b 87 60 06 00 00
> be 10 00 00 00 48 83 c0 08 <8b> 58 28 48 63 fb e8 29 51 e3 e0 48 85
> c0 49 89 c5 0f 84 98 00
> Jul  5 07:38:46 vhost2 kernel: RIP  [<ffffffffa029ce5f>]
> w_e_end_ov_req+0x36/0x154 [drbd]
> Jul  5 07:38:46 vhost2 kernel:  RSP <ffff8804131c5e20>
> Jul  5 07:38:46 vhost2 kernel: CR2: 0000000000000030
> Jul  5 07:38:46 vhost2 kernel: ---[ end trace 11e0b3ada4e13922 ]---



-- 
Roland.Friedwagner at wu.ac.at            Phone: +43 1 31336 5377
IT Services - WU (Vienna University of Economics and Business) 



More information about the drbd-user mailing list