[DRBD-user] SOLVED : DRBD stalled connection / state mismatch between primary and secondary

Support WVNET hilfe at wvnet.at
Thu May 12 00:31:15 CEST 2011

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


>I'm not able to re-sync my primary and secondary storage server .
>Both servers are identical , Setup looks like this
>
>-System: Slackware 13.1 64bit , kernel 2.6.33.12 , OFED 1.5.2 Stack , drdb
>8.3.10 from source
>-Storage:
>	Adaptec raid controller 52445 ( with BBU ) , 24x SAS
>	raid Partitions
>	drbd
>	scst ib srpt_target ( vdisk blockio )
>-Replication-Link: IPoIB Interface
>-Cluster: 
>	Pacemaker 1.1.4
>  	Corosync 1.3.0
>	2 Communication Links ( 1x crossover Gigbit ethernet , 1x IPoIB Link
>
>State on the secondary node : ( storage-node-b )
>---------------------------------------------------------
>version: 8.3.10 (api:88/proto:86-96)
>GIT-hash: 5c0b0469666682443d4785d90a2c603378f9017b build by
root at storage-node-b.cluster.lokal, 2011-05-05 12:48:19
>
>1: cs:Connected ro:Secondary/Primary ds:UpToDate/UpToDate C r-----
>    ns:0 nr:1175532 dw:1175532 dr:0 al:0 bm:8386 lo:0 pe:0 ua:0 ap:0 ep:1
wo:n oos:0
>
>10: cs:Connected ro:Secondary/Primary ds:UpToDate/UpToDate C r-----
>    ns:0 nr:480569816 dw:480569816 dr:0 al:0 bm:29452 lo:0 pe:0 ua:0 ap:0
>ep:1 wo:n oos:0
>
>13: cs:Unconfigured
>
>State on the primary node : ( storage-node-a )
>------------------------------------------------------
>version: 8.3.10 (api:88/proto:86-96)
>GIT-hash: 5c0b0469666682443d4785d90a2c603378f9017b build by
root at storage-node-a.san.lokal, 2011-04-27 11:33:30
>
> 1: cs:Connected ro:Primary/Secondary ds:UpToDate/UpToDate C r-----
>    ns:516 nr:0 dw:2601253 dr:1083196 al:0 bm:23 lo:0 pe:0 ua:0 ap:0 ep:1
wo:n oos:635196
>
>10: cs:SyncSource ro:Primary/Secondary ds:UpToDate/Inconsistent C r-----
>    ns:480359772 nr:0 dw:5831668 dr:483513740 al:0 bm:29323 lo:0 pe:0 ua:0
ap:0 ep:1 wo:n oos:210044
>        [===================>] sync'ed:100.0% (204/469100)M
>        finish: 3:41:03 speed: 12 (3,812) K/sec (stalled)
>11: cs:WFConnection ro:Primary/Unknown ds:UpToDate/Outdated C r-----
>    ns:0 nr:0 dw:38456751 dr:62544271 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1
wo:n oos:16370960
>12: cs:WFConnection ro:Primary/Unknown ds:UpToDate/Outdated C r-----
>    ns:0 nr:0 dw:9660832 dr:7130755 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:b
>oos:2684524
>13: cs:WFConnection ro:Primary/Unknown ds:UpToDate/Inconsistent C r-----
>    ns:10052 nr:0 dw:14567958 dr:81131454 al:0 bm:3319 lo:0 pe:0 ua:0 ap:0
>ep:1 wo:n oos:3465252

Track down problem to scst vdisk_blockio .

Switchted from vdisk_blockio to vdisk_fileio and now everything is working
fine

-Steve




More information about the drbd-user mailing list