<p class="MsoNormal"><u><span style="color: black;">Hi All,</span></u><span style="color: black;"></span></p><p class="MsoNormal"><br><span style="color: black;"></span></p><p class="MsoNormal"><span style="color: black;">We've been facing this issue since last one year. We didn't get any solution for that. Please can you help us out here. The problem is described as below: </span><u><span style="color: black;"><br>
</span></u></p><p class="MsoNormal"><u><span style="color: black;"><br></span></u></p><p class="MsoNormal"><u><span style="color: black;"><br></span></u></p><p class="MsoNormal"><b><u><span style="COLOR: black">Problem summary</span></u></b><span style="COLOR: black"></span></p>
<p class="MsoNormal"><span style="COLOR: black"></span> </p>
<p class="MsoNormal"><span style="COLOR: black">The problem is that when
performing an HA failover from server A to server B, a DRBD resource is
sometimes not shut down properly on server A. Several attempts are made to stop
the DRBD resource, but finally it gives up and the server is rebooted. The
failover to server B works properly; B becomes the Active server. After the
reboot, server A comes up properly as the Standby server. </span></p>
<p class="MsoNormal"><span style="COLOR: black"></span> </p>
<p class="MsoNormal"><span style="color: black;">So everything ends up in a good
state (B is Active, A is Standby as expected). The issue is that server A is
unexpectedly rebooted during the failover. We are vulnerable for the time period
that server A is rebooting (a few minutes) in the sense that there is no Standby
server to failover to. <br></span></p><p class="MsoNormal"><br><span style="color: black;"></span></p><p class="MsoNormal"><span style="COLOR: black"><br></span></p>
<p class="MsoNormal"><span style="COLOR: black"></span> </p>
<p class="MsoNormal"><span style="COLOR: black">The problem is intermittent. Most
HA failovers work as expected (server A does not reboot). </span></p>
<p class="MsoNormal"><span style="COLOR: black"></span> </p>
<p class="MsoNormal"><span style="COLOR: black">When the problem does occur, the
following lines are logged in the logs and displayed on the server console:</span></p>
<p class="MsoNormal"><span style="COLOR: black"></span> </p>
<p style="margin-bottom: 12pt;" class="MsoNormal"><span style="font-family: 'Arial','sans-serif'; color: black; font-size: 10pt;">drbd0:
State change failed: Device is held open by someone<br>drbd0: state = {
cs:Connected st:Primary/Secondary ds:UpToDate/UpToDate r--- }<br>drbd0: wanted
= { cs:Connected st:Secondary/Secondary ds:UpToDate/UpToDate r--- }</span></p><p style="margin-bottom: 12pt;" class="MsoNormal">Thanks,<br></p><p style="margin-bottom: 12pt;" class="MsoNormal">Regards,</p><p style="MARGIN-BOTTOM: 12pt" class="MsoNormal">
Dileep Nayak<br><span style="FONT-FAMILY: 'Arial','sans-serif'; COLOR: black; FONT-SIZE: 10pt"></span><span style="COLOR: black"></span></p>