lock not propagated between instances #13

ChaosEternal · 2017-04-14T02:55:04Z

The nfs mount does not propagate the locks on file between instances.

test:
on same instance:
run cmd: (flock 9 && echo sucess1; sleep 100) 9> $NFS_MOUNT_POINT/lockfile &
then run: (flock 9 && echo sucess2; sleep 100) 9> $NFS_MOUNT_POINT/lockfile &
will see: sucess1 and after 100 seconds, see sucess2.
the lock works on same instance.

on different instances:
instance 1:
run cmd: (flock 9 && echo sucess1; sleep 100) 9> $NFS_MOUNT_POINT/lockfile &
will see sucess1 immediately
then on instance 2:
run cmd: (flock 9 && echo sucess2; sleep 100) 9> $NFS_MOUNT_POINT/lockfile &
we still will see sucess2 immediately
that means flock does not propage to instance 2

in CF, all applications have multiple instances and they will mount the same volume, then these application wont have proper mechanisms to avoid competing write which will cause data corruption.

cf-gitbot · 2017-04-14T02:55:05Z

We have created an issue in Pivotal Tracker to manage this:

https://www.pivotaltracker.com/story/show/143703131

The labels on this github issue will be updated when the story is started.

julian-hj · 2017-04-14T23:11:38Z

Please see the linux man page on flock:

Notes

flock() does not lock files over NFS. Use fcntl(2) instead: that does work over NFS, given a sufficiently recent version of Linux and a server which supports locking.

ChaosEternal · 2017-04-16T15:09:11Z

sorry.

In Linux kernels up to 2.6.11, flock() does not lock files over NFS (i.e., the scope of locks was limited to the local system). Instead, one could use fcntl(2) byte-range lock‐
ing, which does work over NFS, given a sufficiently recent version of Linux and a server which supports locking. Since Linux 2.6.12, NFS clients support flock() locks by emulat‐
ing them as byte-range locks on the entire file. This means that fcntl(2) and flock() locks do interact with one another over NFS. Since Linux 2.6.37, the kernel supports a com‐
patibility mode that allows flock() locks (and also fcntl(2) byte region locks) to be treated as local; see the discussion of the local_lock option in nfs(5).

julian-hj · 2017-04-19T01:05:11Z

Right you are--we did some testing here on our end comparing kernel mounts with the fuse mount package we use in order to introduce uid mapping

Here's what we found:

server type	fuse-nfs	kernel mount
ubuntu nfs	fails to lock	flock locks correctly (blocks until lock is free)
isilon	fails to lock	flock fails when lock cannot be acquired (does not block)

So it seems that even for kernel mounts there is some discrepancy in the way that flock behaves, but nonetheless, our nfs client is not behaving as well as the kernel nfs libraries. We will investigate further.

ChaosEternal · 2017-04-19T09:49:41Z

OT: why fuse? why not just a mount? even further, why not just an automount ?

julian-hj · 2017-04-19T23:03:22Z

fuse-nfs allows us to do uid mapping. We could use the kernel mount, but then any buildpack application will connect to the nfs server as uid 2000 (the uid of the vcap user in the container)

We wanted the ability for CF apps to be able to declare a uid and have that uid transmitted to the nfs server. This facilitates re-use of existing shares for app replatforming and gives us something sort of resembling security for shares used by multiple apps.

julian-hj · 2017-04-19T23:04:37Z

We opened a corresponding issue in fuse-nfs, as the root cause is in the fuse-nfs library which does not implement the lock() or flock() methods of the fuse interface:

sahlberg/fuse-nfs#20

paulcwarren · 2017-04-19T23:08:35Z

By way of bigger picture we are curious what sorts of apps/use cases you are considering? Is there is a specific use case where a filesystem and filesystem locking is a better approach than some other alternative. This may help us with this feature and testing moving forwards.

ChaosEternal · 2017-06-09T09:14:35Z

@paulcwarren In CF, apps can have more than one instances running, and all these instances will access the same nfs-share, there must be some mechanisms to prevent race conditions.
Traditionally, nfs uses locks to solve such problem.

ChaosEternal · 2017-06-12T01:46:01Z

and this issue is an defect, not a enhancement request. this issue will prevent our customers to adopt this component in their production environments.

julian-hj · 2017-06-12T16:06:53Z

@ChaosEternal -- I think we understand the high level issue, but it would be helpful for us to have a bit more detail about your specific use case in order to prioritize this work against some of the other possible remedies we might consider.

For example, if NFSv4 were supported, would you consider to use that over NFSv3? My impression is that locking is more cleanly integrated in a V4 context as it is incorporated into the NFS protocol itself rather than being implemented in a parallel protocol. OTOH, some folks still have issues with NFS4 reliability when networks are slow or flaky, which is maybe why it hasn't got better adoption.

As another example, do you require locking because your apps must be scaled up in order to work in a production environment, or is this a requirement because CF today cannot guarantee that it will not start more than one instance of an application? In other words, if CF were to support more "pessimistic" application scheduling, and single-attach semantics for file volumes, would that meet the requirement? Or you require >1 instances for scalability to prod?

These are fairly specific questions, but a general understanding of the use cases and environment you're working with would also be helpful for us.

julian-hj · 2018-07-31T22:59:08Z

Nearly a year and a half later, this issue is finally resolved when using the nfs-experimental service. We will be migrating all mounts to use the "experimental" mount flow in the next couple of months, but I am going to go ahead and close this issue.

The fix is available in nfs-volume-release v1.5.0.

julian-hj · 2018-08-17T20:27:54Z

Thanks to @jandubois for pointing out that the comment above is a little inaccurate. The fix that makes locking work is actually in mapfs-release v1.1.0

cf-gitbot added the unscheduled label Apr 14, 2017

cf-gitbot added scheduled in progress and removed unscheduled scheduled labels Apr 19, 2017

julian-hj added the enhancement label Apr 20, 2017

cf-gitbot added scheduled unscheduled and removed in progress scheduled labels Apr 20, 2017

cf-gitbot added scheduled and removed unscheduled labels Feb 22, 2018

cf-gitbot added in progress and removed scheduled labels Mar 14, 2018

cf-gitbot added scheduled and removed in progress labels Jun 13, 2018

cf-gitbot added in progress and removed scheduled labels Jul 2, 2018

cf-gitbot removed the in progress label Jul 24, 2018

cf-gitbot added delivered accepted and removed delivered labels Jul 24, 2018

julian-hj closed this as completed Jul 31, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lock not propagated between instances #13

lock not propagated between instances #13

ChaosEternal commented Apr 14, 2017

cf-gitbot commented Apr 14, 2017

julian-hj commented Apr 14, 2017

ChaosEternal commented Apr 16, 2017

julian-hj commented Apr 19, 2017

ChaosEternal commented Apr 19, 2017 •

edited

Loading

julian-hj commented Apr 19, 2017

julian-hj commented Apr 19, 2017

paulcwarren commented Apr 19, 2017

ChaosEternal commented Jun 9, 2017

ChaosEternal commented Jun 12, 2017

julian-hj commented Jun 12, 2017

julian-hj commented Jul 31, 2018

julian-hj commented Aug 17, 2018

lock not propagated between instances #13

lock not propagated between instances #13

Comments

ChaosEternal commented Apr 14, 2017

cf-gitbot commented Apr 14, 2017

julian-hj commented Apr 14, 2017

ChaosEternal commented Apr 16, 2017

julian-hj commented Apr 19, 2017

ChaosEternal commented Apr 19, 2017 • edited Loading

julian-hj commented Apr 19, 2017

julian-hj commented Apr 19, 2017

paulcwarren commented Apr 19, 2017

ChaosEternal commented Jun 9, 2017

ChaosEternal commented Jun 12, 2017

julian-hj commented Jun 12, 2017

julian-hj commented Jul 31, 2018

julian-hj commented Aug 17, 2018

ChaosEternal commented Apr 19, 2017 •

edited

Loading