-
Notifications
You must be signed in to change notification settings - Fork 599
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor: use Apache Arrow compute for string function #885
Conversation
adf48db
to
da0a232
Compare
FYI: If you have a bit of patience (like 24h of patience), you could use the arrow conda packages in the arrow-nightlies channel instead of building it yourself. |
da0a232
to
229f9ce
Compare
Great, I didn't know it existed, and was difficult to find, thanks a lot! |
820f61c
to
d55a96a
Compare
@JovanVeljanoski would be great if you can add/finish the str->booleans added in apache/arrow#7656 |
@JovanVeljanoski I think I want to merge this early and leave it for you to do the rest in a different PR, we need some of this in #865 and I also want to merge that soon. |
dbc8a7b
to
0f89f54
Compare
packages/vaex-core/vaex/dataframe.py
Outdated
@@ -134,6 +134,7 @@ def get_main_executor(): | |||
_doc_snippets['chunk_size_export'] = 'Number of rows to be written to disk in a single iteration' | |||
_doc_snippets['evaluate_parallel'] = 'Evaluate the (virtual) columns in parallel' | |||
_doc_snippets['array_type'] = 'Type of output array, possible values are None/"numpy" (ndarray), "xarray" for a xarray.DataArray, or "list" for a Python list' | |||
_doc_snippets['ascii'] = 'Transform only ascii character (usually faster).' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
character -> characters
0f89f54
to
62e8302
Compare
025988f
to
2a85f7c
Compare
e24fc95
to
5ed5b5b
Compare
5e50d73
to
394e70a
Compare
windows CI has becomes crazy slow btw, we'll have to trace back when/why that happened. It seem the conda env creation takes ages. |
This is a draft PR to check the status of arrow compute with vaex. I think we likely cherry pick from this branch as arrow makes new releases.