Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove client initialization for fast listing from dataflux-pytorch #111

Merged
merged 3 commits into from
Sep 18, 2024

Conversation

akansha1812
Copy link
Collaborator

@akansha1812 akansha1812 commented Sep 16, 2024

To fix #58

When multiprocess starts with spawn or forkserver, passing storage client to fast-listing from dataflux pytorch will result in pickling error. Multi-processing fails as it cannot serialize/pickle storage client.

  • remove storage client creation for fast listing from dataflux_pytorch/dataflux_mapstyle_dataset.py.

The change got add in PR #109

  • Tests pass
  • Appropriate changes to documentation are included in the PR

@akansha1812 akansha1812 requested a review from a team as a code owner September 16, 2024 23:16
@akansha1812 akansha1812 merged commit bdd0db7 into main Sep 18, 2024
2 checks passed
@akansha1812 akansha1812 deleted the amaloo/remove-storage-initalization-listing branch September 18, 2024 18:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Pickling client objects is explicitly not supported (Iterable Dataset)
3 participants