-
Notifications
You must be signed in to change notification settings - Fork 464
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[VL] array_size(null)
results inconsistent with vanilla spark
#5248
Comments
spark replaces |
I want to add a new |
Hi @wForget, thanks for bringing up this issue! Looks velox has a config to control the behavior. https://github.com/facebookincubator/velox/blob/main/velox/functions/sparksql/Size.cpp#L35 I note Gluten sets it according to Spark's config to align with Spark's "Size" function. But, for "ArraySize" function, we expect it's always false. For performance consideration, it may be better to directly do some changes in velox's size function, e.g., add support for two args here. The extra arg is |
@PHILO-HE Thanks for your guidance, this makes sense to me, I will try it. |
This is an example to let a function struct cover different inputs. Similarly, you need to add an extra |
Thanks, I'm trying to do it that way, and will send a pr later. |
I think before the wForget's PR ready in velox, at least we can fallback if |
@wForget, do you have any progress? I can take over it if you have no bandwidth. @gaoyangxiaozhu, let's wait two or three days. |
Sorry, I was interrupted by something else, please feel free to send PR. |
Backend
VL (Velox)
Bug description
array_size(null)
results inconsistent with vanilla spark.test sql:
native engine returns:
-1
vanilla spark returns:
null
Spark version
None
Spark configurations
No response
System information
No response
Relevant logs
No response
The text was updated successfully, but these errors were encountered: