Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Hi, I am having a problem with two DRBD machines.These machines are exactly the same in hardware and software, and are both running SUSE Linux(the kernel is 2.6.16.60-0.21-bigsmp) and DRBD 0.7.24. The primary node is "ATCAX86_F0S5" and the secondary node is "ATCAX86_F0S9".When I detach the DRBD resource on Primary node and then copy a file to the DRBD device,the linux is panic and then restart. First, I do "drbdadm primary all" on the Primary node,and the state of drbd is: ATCAX86_F0S5:/ # cat /proc/drbd version: 0.7.24 (api:79/proto:74) SVN Revision: 2875 build by root at ccf01, 2008-10-24 06:23:46 0: cs:Unconfigured 1: cs:DiskLessClient st:Primary/Secondary ld:Consistent ns:68 nr:0 dw:68 dr:172 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 And then,I copy a file to the DRBD device of primary node.Then the OS of primary node is panic,and then restart.The system message is: Mar 31 10:07:34 ATCAX86_F0S4 kernel: ReiserFS: drbd1: Using r5 hash to sort names Mar 31 10:12:24 ATCAX86_F0S4 kernel: Unable to handle kernel NULL pointer dereference at virtual address 0000001e Mar 31 10:12:24 ATCAX86_F0S4 kernel: printing eip: Mar 31 10:12:24 ATCAX86_F0S4 kernel: f92a3d25 Mar 31 10:12:24 ATCAX86_F0S4 kernel: *pde = 35c54001 Mar 31 10:12:24 ATCAX86_F0S4 kernel: Oops: 0000 [#1] Mar 31 10:12:25 ATCAX86_F0S4 syslog-ng[3184]: Changing permissions on special file /dev/console Mar 31 10:12:24 ATCAX86_F0S4 kernel: SMP Mar 31 10:12:24 ATCAX86_F0S4 kernel: last sysfs file: /devices/pci0000:00/0000:00:00.0/irq Mar 31 10:12:24 ATCAX86_F0S4 kernel: Modules linked in: drbd kbox_V100R001C01B003_20090211114813_24945 pmcint dpukernel_V100R003C01B612_20090312062515_21878 mcenonfatal_V100R003C02B015_20090113102406_26186 mchesb_V100R003C02B015_20090113102413_4859 lpcbios_V100R003C02B015_20090113102412_13838 nfsd exportfs lockd nfs_acl sunrpc ipv6 gab af_packet llt intermodule ipmi_watchdog ipmi_si ipmi_devintf ipmi_poweroff ipmi_msghandler tg3 e1000 dock button battery ac loop dm_mod usbhid i2c_i801 i2c_core mptctl qla2xxx uhci_hcd ehci_hcd firmware_class usbcore scsi_transport_fc reiserfs ext3 jbd mppVhba edd fan thermal processor mptsas mptscsih mptbase scsi_transport_sas ata_piix libata mppUpper sg sd_mod scsi_mod Mar 31 10:12:24 ATCAX86_F0S4 kernel: CPU: 0 Mar 31 10:12:24 ATCAX86_F0S4 kernel: EIP: 0060:[<f92a3d25>] Tainted: PF U VLI Mar 31 10:12:24 ATCAX86_F0S4 kernel: EFLAGS: 00010202 (2.6.16.60-0.21-bigsmp #1) Mar 31 10:12:24 ATCAX86_F0S4 kernel: EIP is at drbd_send_dblock+0x1c9/0x36a [drbd] Mar 31 10:12:24 ATCAX86_F0S4 kernel: eax: 00000001 ebx: f4ad1904 ecx: 00008000 edx: 00000000 Mar 31 10:12:24 ATCAX86_F0S4 kernel: esi: c593e000 edi: f4ad1528 ebp: f61935a4 esp: c593fbcc Mar 31 10:12:24 ATCAX86_F0S4 kernel: ds: 007b es: 007b ss: 0068 Mar 31 10:12:24 ATCAX86_F0S4 kernel: Process pdflush (pid: 212, threadinfo=c593e000 task=c593d6b0) Mar 31 10:12:25 ATCAX86_F0S4 kernel: Stack: <0>00000000 00000000 00000001 00000082 ffffffff ffffffff 67027483 10100000 Mar 31 10:12:25 ATCAX86_F0S4 kernel: 00000000 00400400 f61935a4 00000000 f7445bec f110b0c0 00000001 00000000 Mar 31 10:12:25 ATCAX86_F0S4 kernel: f929de40 00001000 00000001 f4ad1528 01e39800 f61935a4 0000000a 00000000 Mar 31 10:12:25 ATCAX86_F0S4 kernel: Call Trace: Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<f929de40>] drbd_make_request_common+0x727/0x961 [drbd] Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<f929e2bd>] drbd_make_request_26+0x1cc/0x1d5 [drbd] Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c01bdf34>] generic_make_request+0x29c/0x2ac Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c01c02d2>] blk_do_ordered+0x18c/0x2b6 Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c011b486>] find_busiest_group+0x13c/0x2fa Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c014b20d>] mempool_alloc+0x28/0xc5 Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c01bfa5a>] submit_bio+0xa6/0xad Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c016b144>] bio_alloc_bioset+0xb2/0x117 Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c01680e0>] submit_bh+0xe3/0x101 Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<f8a21277>] write_ordered_chunk+0x47/0x6d [reiserfs] Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<f8a2164b>] write_ordered_buffers+0x1a7/0x28c [reiserfs] Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c0148610>] find_get_page+0x18/0x47 Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c01683e3>] __find_get_block_slow+0x10b/0x115 Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c0168730>] __find_get_block+0x185/0x18f Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<f8a219a2>] flush_commit_list+0x190/0x5a6 [reiserfs] Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<f8a247b3>] do_journal_end+0xbe5/0xc31 [reiserfs] Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c011b486>] find_busiest_group+0x13c/0x2fa Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<f8a24865>] journal_end_sync+0x66/0x6b [reiserfs] Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<f8a15e97>] reiserfs_sync_fs+0x32/0x54 [reiserfs] Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c016cd5a>] sync_supers+0x72/0xd2 Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c014dc92>] wb_kupdate+0x2a/0xf4 Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c014e58e>] pdflush+0x116/0x1ad Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c014dc68>] wb_kupdate+0x0/0xf4 Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c013436f>] kthread+0xca/0xf7 Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c014e478>] pdflush+0x0/0x1ad Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c01342a5>] kthread+0x0/0xf7 Mar 31 10:12:25 ATCAX86_F0S4 kernel: [<c0102005>] kernel_thread_helper+0x5/0xb Mar 31 10:12:25 ATCAX86_F0S4 kernel: Code: ff 89 44 24 10 89 f0 e8 e1 6c e7 c6 5e 58 83 7c 24 08 00 74 34 b9 01 00 00 00 ba 10 00 00 00 89 e8 e8 23 a8 ff ff eb 21 8b 55 28 <0f> b7 42 1e 6b c0 0c 03 42 34 8b 48 08 8b 10 ff 70 04 89 f8 e8 Mar 31 10:15:19 ATCAX86_F0S4 syslog-ng[3209]: syslog-ng version 1.6.8 starting Could you tell me why this problem happen? How to solve it? Thanks, Jason ### /etc/drbd.conf ### # # please have a a look at the example configuration file in # /usr/share/doc/packages/drbd.conf # resource r0 { protocol C; startup { wfc-timeout 20; degr-wfc-timeout 60; } syncer { rate 50M; } on ATCAX86_F0S5 { device /dev/drbd1; disk /dev/sda5; address 172.30.128.125:7789; meta-disk internal; } on ATCAX86_F0S9 { device /dev/drbd1; disk /dev/sda5; address 172.30.128.131:7789; meta-disk internal; } } -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20090401/59a7e7a8/attachment.htm>