Deadlock while acquiring and releasing lock in HazelcastClusterManager in vertx 3.3.2 #41

pkadam1989 · 2016-08-04T15:12:50Z

public class Test {

public static void main (String[] args)  {
        Vertx.clusteredVertx(new VertxOptions(), result -> testFunction(result.result()));
}

private static void testFunction(Vertx vertx) {
    Context context = vertx.getOrCreateContext();
    context.runOnContext(v -> testFunctionOnContext(vertx,"r1"));
    context.runOnContext(v -> testFunctionOnContext(vertx,"r2"));
    context.runOnContext(v -> vertx.setTimer(20000L, event -> context.runOnContext(v1 -> testFunctionOnContext(vertx,"r3"))));
}


private static void testFunctionOnContext(Vertx vertx, String req) {
    System.out.println("************ TRY TO GET LOCK "+req +" ************");
    vertx.sharedData().getLockWithTimeout("abc",15000L, lockResult -> {
            if (lockResult.succeeded()) {
                    System.out.println("************ "+req +" GOT LOCK ************");
                    vertx.setTimer(10000L, event -> {
                        System.out.println("************ "+req +" TRYING TO RELEASE LOCK ************");
                        lockResult.result().release();
                    });
            }
            else{
                lockResult.cause().printStackTrace();
            }
        });
}

}

Using vertx 3.3.2

In the above case getLockWithTimeout and release call on lock both are using executeBlocking with ordering = true by default.
We are running on a single context.

So,
When r1 tries to get lock, it will get the lock for abc and will start doing its work, assume it does some work which takes 10 secs.

When r2 tries to get lock, simultaneously before, r1 finishes its work, r2 will not get a lock and will be waiting 15 secs for r1 to release the lock.

But r1 is waiting for r2 to get lock since it is executeBlocking with ordering true.

So r2 will never get the lock, and fail with timeout exception even though r1 had finished the work in 10 secs. After r2 fails with timeout exception r1 will release the lock, since it was waiting for r2 executeBlocking to get over.

This will work if we have ordering false in executeBlocking. Why is the ordering true by default, which makes it sequential execution of executeBlocking ?
In that case, I will not be able to use lock on single context, when there are multiple requests coming for the same resource.

The text was updated successfully, but these errors were encountered:

pkadam1989 · 2016-08-18T15:59:50Z

This issue was not seen in Vertx version 3.2.1

…d=true by default which enables sequential execution of executeBlocking. Changed the ordering to false, since it results in race condition in case we are running on same event loop due to sequential execution of acquiring and releasing lock. Details have been provided in the issue, Deadlock while acquiring and releasing lock in HazelcastClusterManager in vertx 3.3.2 vert-x3#41

vietj · 2016-09-04T18:54:15Z

note of what using executeBlocking=false might improve : #43

The existing clustered lock tests always involve two different Vert.x instances. In vert-x3/vertx-hazelcast#41, it has been reported that concurrent requests to a clustered lock on the same instance can lead to deadlocking. It happens because locks are acquired with ordered execute blocking, the same for releasing. The fix consists in applying the technique used in jdbc client connections: a dedicated worker executor is created for a lock instance, then the lock is acquired in order, but released freely. Consequently, we never find ourselves in a situation where a lock release task is waiting for a lock acquire task to complete. Pull requests for the cluster managers are ready and will be pushed soon. Signed-off-by: Thomas Segismont <[email protected]>

…castClusterManager The expected behavior is tested with new tests introduced in eclipse-vertx/vert.x#1732

Fixes #41 Deadlock while acquiring and releasing lock in HazelcastClusterManager

pkadam1989 mentioned this issue Aug 23, 2016

Change ordering of executeBlocking to false while releasing HazelcastLock #42

Closed

tsegismont mentioned this issue Dec 5, 2016

Test locking on same Vert.x instance eclipse-vertx/vert.x#1732

Merged

tsegismont added a commit to tsegismont/vertx-hazelcast that referenced this issue Dec 5, 2016

Fixes vert-x3#41 Deadlock while acquiring and releasing lock in Hazel…

8f40c6e

…castClusterManager The expected behavior is tested with new tests introduced in eclipse-vertx/vert.x#1732

tsegismont self-assigned this Dec 5, 2016

tsegismont added the to review label Dec 5, 2016

tsegismont closed this as completed in a93fd7b Jan 3, 2017

tsegismont added a commit that referenced this issue Jan 3, 2017

Merge pull request #51 from tsegismont/more-lock-tests

0ef846d

Fixes #41 Deadlock while acquiring and releasing lock in HazelcastClusterManager

tsegismont removed the to review label Jan 3, 2017

tsegismont mentioned this issue Feb 2, 2017

Deadlock while acquiring and releasing lock in HazelcastClusterManager #51

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deadlock while acquiring and releasing lock in HazelcastClusterManager in vertx 3.3.2 #41

Deadlock while acquiring and releasing lock in HazelcastClusterManager in vertx 3.3.2 #41

pkadam1989 commented Aug 4, 2016 •

edited by vietj

Loading

pkadam1989 commented Aug 18, 2016

vietj commented Sep 4, 2016

Deadlock while acquiring and releasing lock in HazelcastClusterManager in vertx 3.3.2 #41

Deadlock while acquiring and releasing lock in HazelcastClusterManager in vertx 3.3.2 #41

Comments

pkadam1989 commented Aug 4, 2016 • edited by vietj Loading

pkadam1989 commented Aug 18, 2016

vietj commented Sep 4, 2016

pkadam1989 commented Aug 4, 2016 •

edited by vietj

Loading