Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support opening datasets with file-like objects in a Beam pipeline #37

Open
alxmrs opened this issue Sep 15, 2021 · 0 comments
Open

Support opening datasets with file-like objects in a Beam pipeline #37

alxmrs opened this issue Sep 15, 2021 · 0 comments

Comments

@alxmrs
Copy link
Contributor

alxmrs commented Sep 15, 2021

I experimented a bit more with this based on @mjwillson's suggestion.

Amazingly, it seems that uses file-like objects in Xarray does actually work as used here, though making a local copy might still have better performance.

What doesn't work yet -- but hopefully with small upstream changes to Xarray could work -- is passing xarray datasets opened with these file-like objects into a Beam pipeilne. That could let us do the actual data loading from netCDF in separate workers, which could be quite a win!

Originally posted by @shoyer in #31 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant