-
Notifications
You must be signed in to change notification settings - Fork 181
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ls: cannot open directory '...': Transport endpoint is not connected #630
Comments
Addtitionally, we are observing a CPU spike every minute with --enable-metadata-caching --metadata-cache-ttl 60. I was hoping the listing would be lazy e.g if the users don't list or interact with the mount, no listing is done. |
Hi @tchaton, thanks for raising the issue. I see you were using a custom build of 1.1.1 with caching. Have you since upgraded to 1.2.0? Note that the flags to configure caching are different from the pre-release version. Once you upgrade, could you report if you are still observing the issue on 1.2.0? Are you able to share more details on the workload you ran before seeing the error on EDIT: for help with the new configuration flags, see this section in the docs. |
About the CPU spikes: Mountpoint does not proactively refresh metadata when it expires. So it should behave just as you were expecting. I suspect that the activity you are observing is due to applications accessing the filesystem and the kernel in turn requesting updated metadata from Mountpoint. |
Hey @passaro Let me update and give you more feedbacks. |
@passaro But if you want to see some failures, you can do something like this. Create 1 bucket with 1M files with random sizes ranging from 100kb to 10GB. And copy all the files from the mount to another bucket while trying to maximize the CPU usage of the machine to 100%( I am using a machine with 32 or 64 CPU cores).
This always fails for me. However, other open source solutions are more reliable under that same stress. |
@tchaton, unfortunately, I was not able to reproduce the issue with the command you suggested. It may depend on specific factors like the content of your bucket or the load on your instance. However, my (unconfirmed) suspicion is that you are seeing the result of an out of memory issue, similar to that reported in #502.
|
Hey @passaro I will try again. For the |
You can probably use journalctl. For example, the lines I copied above were extracted from the output of this command:
|
I also encountered this error when using s3fs and now mountpoint-s3. I am applying a solution that I described in this comment: s3fs-fuse/s3fs-fuse#2356 (comment) |
Mountpoint v1.10.0 has been released with some prefetcher improvements and might reduce memory usage. Could you please try upgrading to see if it provides any improvements for you? |
I'm getting the same issue and just tried v.1.10.0. It doesn't appear to have helped. (I can reproduce by copying a file - using 'cp' - from a mounted S3 bucket to the local filesystem where the file is about the same size as the free available RAM on the system. At some point part way through the copy it fails and I get "Transport endpoint is not connected" displayed as the error.) The 'temporary workaround' in #1021 does address the issue. |
Hey, @jmccl! Thanks for taking time to report an issue that you're facing in connection with memory usage on version 1.10.0, we're particularly interested in this. It seems that you've found a workaround that is suitable for your use case, but please note, that this approach is not stable, meaning that extra caution should be exercised when updating Mountpoint on workloads that use it. In case, your problem isn't solved or you'll be interested to help us improve the memory limiting in Mountpoint, consider opening a new bug report. It would be helpful if you could describe in more detail the environment where you're facing the problem:
|
Closing this issue since there is no activity on the original problem from 2023. We suspect that the crash was occurring because of Mountpoint getting killed by OOM killer. Starting from version 1.10.0 Mountpoint will target to use no more than 95% of the installed memory on the host, which may solve the problem in some cases. |
Mountpoint for Amazon S3 version
1.1.1 with caching
AWS Region
us-east-1
Describe the running environment
Running on Amazon EC2
What happened?
This is happening quite frequentally ~ 7/10 for us in our filesystem tests.
ls: cannot open directory `....`: Transport endpoint is not connected
Relevant log output
The only log line I can see is the following.
cc @dannycjones @passaro
The text was updated successfully, but these errors were encountered: