[DRBD-user] DRBDmanage (re)initialization

Julien Escario escario at azylog.net
Mon Jun 12 09:39:08 CEST 2017

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Le 09/06/2017 à 14:24, Julien Escario a écrit :
> Le 09/06/2017 à 09:59, Robert Altnoeder a écrit :
>> On 06/08/2017 04:14 PM, Julien Escario wrote:
>>> Hello,
>>> A drbdmanage cluster is actually stuck in this state :
>>> .drbdctrl role:Secondary
>>>   volume:0 disk:UpToDate
>>>   volume:1 disk:UpToDate
>>>   vm4 connection:NetworkFailure
>>>   vm7 role:Secondary
>>>     volume:0 replication:WFBitMapS peer-disk:Inconsistent
>>>     volume:1 peer-disk:Outdated
>>> [...]
>>> Any way to restart this ressource without losing all other ressources ?
>> on vm4 and vm7, try 'drbdadm down .drbdctrl' followed by 'drbdadm up
>> .drbdctrl'.
>> In most cases, it just reconnects and fixes itself.

[Sorry for the double post]

Finally, I've been able to fully restore vm4 and vm5 (drbdsetup and drbdmanage
working) but not vm7.

I've done that by firewalling port 6999 (port used by .drbdctrl ressource) and
issuing a down/up on drbdctrl on vm4 and vm5.

So far, so good.

It seems to be the pure drbd part is somewhat screwed on vm7. I can't issue any
drbdadm/drbdsetup command. They all hang up and keep running without being
killed by time or kill (even -9).

With strace, drbdsetup status output is as the attached file. It seems to hang
while writing to a socket but I'm not really familiar with strace output.

It would be really nice to get back to normal without a reboot. Any advice ?

Thanks for your help,
Julien
-------------- next part --------------
root at vm7:~# strace drbdsetup status
execve("/usr/sbin/drbdsetup", ["drbdsetup", "status"], [/* 15 vars */]) = 0
brk(0)                                  = 0x19c0000
access("/etc/ld.so.nohwcap", F_OK)      = -1 ENOENT (No such file or directory)
mmap(NULL, 8192, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7f267af3b000
access("/etc/ld.so.preload", R_OK)      = -1 ENOENT (No such file or directory)
open("/etc/ld.so.cache", O_RDONLY|O_CLOEXEC) = 3
fstat(3, {st_mode=S_IFREG|0644, st_size=48549, ...}) = 0
mmap(NULL, 48549, PROT_READ, MAP_PRIVATE, 3, 0) = 0x7f267af2f000
close(3)                                = 0
access("/etc/ld.so.nohwcap", F_OK)      = -1 ENOENT (No such file or directory)
open("/lib/x86_64-linux-gnu/libc.so.6", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\3\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0P\34\2\0\0\0\0\0"..., 832) = 832
fstat(3, {st_mode=S_IFREG|0755, st_size=1738176, ...}) = 0
mmap(NULL, 3844640, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7f267a972000
mprotect(0x7f267ab14000, 2093056, PROT_NONE) = 0
mmap(0x7f267ad13000, 24576, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, 3, 0x1a1000) = 0x7f267ad13000
mmap(0x7f267ad19000, 14880, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_ANONYMOUS, -1, 0) = 0x7f267ad19000
close(3)                                = 0
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7f267af2e000
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7f267af2d000
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7f267af2c000
arch_prctl(ARCH_SET_FS, 0x7f267af2d700) = 0
mprotect(0x7f267ad13000, 16384, PROT_READ) = 0
mprotect(0x61a000, 4096, PROT_READ)     = 0
mprotect(0x7f267af3d000, 4096, PROT_READ) = 0
munmap(0x7f267af2f000, 48549)           = 0
chdir("/")                              = 0
stat("/proc/drbd", {st_mode=S_IFREG|0444, st_size=0, ...}) = 0
open("/proc/drbd", O_RDONLY)            = 3
brk(0)                                  = 0x19c0000
brk(0x19e2000)                          = 0x19e2000
read(3, "version: 9.0.3-1 (api:2/proto:86"..., 4095) = 162
close(3)                                = 0
brk(0x19e1000)                          = 0x19e1000
socket(PF_NETLINK, SOCK_DGRAM, NETLINK_GENERIC) = 3
setsockopt(3, SOL_SOCKET, SO_SNDBUF, [1048576], 4) = 0
setsockopt(3, SOL_SOCKET, SO_RCVBUF, [1048576], 4) = 0
bind(3, {sa_family=AF_NETLINK, pid=0, groups=00000000}, 12) = 0
getsockname(3, {sa_family=AF_NETLINK, pid=5040, groups=00000000}, [12]) = 0
write(3, " \0\0\0\20\0\1\0\5D>Y\260\23\0\0\3\2\0\0\t\0\2\0drbd\0\0\0\0", 32
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 3705 bytes
Desc: Signature cryptographique S/MIME
URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20170612/d931daec/attachment.bin>


More information about the drbd-user mailing list