-
Notifications
You must be signed in to change notification settings - Fork 853
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comparison for nested type #5407
Comments
I'm not sure we want to support this within these kernels, in particular the null behaviour is not well-defined. I added support to the comparator for this, which will allow systems like DataFusion to define an ordering for nulls based on a config setting, as Spark and Postgres use different ordering. There is an example of this here - https://docs.rs/arrow-ord/latest/arrow_ord/ord/fn.make_comparator.html#postgres-compatible-nested-comparison This will also make it more obvious that the nested comparison is not vectorised Edit: Filed #5942 to make this more discoverable |
Are you suggesting that if datafusion requires nested comparison, we could implement them in datafusion with This is the code that utilize |
Yes, DataFusion should decide what null semantics it wants / make this configurable, and implement it using make_comparator if appropriate |
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Ref #9252
I want to be able to compare ListArray. The comparison order is left to right and if the prefix is the same then it is considered.
lesser.
For example,
Describe the solution you'd like
Support nested types in
compare_op
, currently failed on this line.Describe alternatives you've considered
Additional context
We can support most of the ops in the future,
lt
andeq
are two that I would like to work on firstThe text was updated successfully, but these errors were encountered: