[DRBD-user] First Linstor bug encountered

Julien Escario julien.escario at altinea.fr
Tue Aug 21 18:23:31 CEST 2018


Hello,
Just hit a bug after multiple creation/deletion of resources on my two nodes
cluster.

Syslog reports :
Aug 21 17:31:28 dedie83 kernel: [350254.337961] drbd vm-102-disk-5 dedie82:
Preparing remote state change 63686478
Aug 21 17:31:28 dedie83 kernel: [350254.338166] drbd vm-102-disk-5 dedie82:
Committing remote state change 63686478 (primary_nodes=1)
Aug 21 17:31:28 dedie83 kernel: [350254.338170] drbd vm-102-disk-5 dedie82:
conn( Connected -> TearDown ) peer( Secondary -> Unknown )
Aug 21 17:31:28 dedie83 kernel: [350254.338172] drbd vm-102-disk-5/0 drbd1009
dedie82: pdsk( UpToDate -> DUnknown ) repl( Established -> Off )
Aug 21 17:31:28 dedie83 kernel: [350254.338185] drbd vm-102-disk-5 dedie82:
ack_receiver terminated
Aug 21 17:31:28 dedie83 kernel: [350254.338187] drbd vm-102-disk-5 dedie82:
Terminating ack_recv thread
Aug 21 17:31:28 dedie83 kernel: [350254.338471] drbd vm-102-disk-5/0 drbd1009:
new current UUID: 6E533D317A5115E9 weak: FFFFFFFFFFFFFFFE
Aug 21 17:31:28 dedie83 Satellite[15917]: 17:31:28.828 [MainWorkerPool_0016]
ERROR LINSTOR/Satellite - Problem of type 'java.lang.NullPointerException'
logged to report number 5B770066-000000
Aug 21 17:31:28 dedie83 Satellite[15917]: 17:31:28.833 [MainWorkerPool_0016]
ERROR LINSTOR/Satellite - Access to deleted resource [Report number
5B770066-000001]
Aug 21 17:31:28 dedie83 kernel: [350254.398545] drbd vm-102-disk-5 dedie82:
Connection closed
Aug 21 17:31:28 dedie83 kernel: [350254.398570] drbd vm-102-disk-5 dedie82:
conn( TearDown -> Unconnected )
Aug 21 17:31:28 dedie83 kernel: [350254.398575] drbd vm-102-disk-5 dedie82:
Restarting receiver thread
Aug 21 17:31:28 dedie83 kernel: [350254.398577] drbd vm-102-disk-5 dedie82:
conn( Unconnected -> Connecting )


Command that triggered this :
linstor resource delete dedie82 vm-102-disk-5

VM102 resource is stuck :
vm-102-disk-5 role:Primary
  disk:UpToDate
  dedie82 connection:Connecting

I was able to issue drbdadm disconnect vm-102-disk-5, so right now, state is
StandAlone.

There was some load at this time. I was optimizing nvme speed (so
delete/create a few times to compare with connection and without).

All other ressources are fine.

drbdadm adjust put the resource is Connecting state.

Best regards,
Julien Escario


More information about the drbd-user mailing list