-
Notifications
You must be signed in to change notification settings - Fork 28.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-6949] [SQL] [PySpark] Support Date/Timestamp in Column expression #5570
Conversation
@@ -2267,6 +2267,8 @@ def _prepare_for_python_RDD(sc, command, obj=None): | |||
# The broadcast will have same life cycle as created PythonRDD | |||
broadcast = sc.broadcast(pickled_command) | |||
pickled_command = ser.dumps(broadcast) | |||
# There is a bug in py4j.java_gateway.JavaClass with auto_convert |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can you document what bug it is?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added a link here.
@JoshRosen can you take a look at this? I don't really know the py4j stuff. |
Test build #30514 has finished for PR 5570 at commit
|
Test build #30515 has finished for PR 5570 at commit
|
Test build #30525 has finished for PR 5570 at commit
|
Looks like the change broke something in MLlib. |
Test build #30541 has finished for PR 5570 at commit
|
Test build #30545 has finished for PR 5570 at commit
|
Thanks. I'm going to merge this in master. |
This PR enable auto_convert in JavaGateway, then we could register a converter for a given types, for example, date and datetime. There are two bugs related to auto_convert, see [1] and [2], we workaround it in this PR. [1] py4j/py4j#160 [2] py4j/py4j#161 cc rxin JoshRosen Author: Davies Liu <[email protected]> Closes apache#5570 from davies/py4j_date and squashes the following commits: eb4fa53 [Davies Liu] fix tests in python 3 d17d634 [Davies Liu] rollback changes in mllib 2e7566d [Davies Liu] convert tuple into ArrayList ceb3779 [Davies Liu] Update rdd.py 3c373f3 [Davies Liu] support date and datetime by auto_convert cb094ff [Davies Liu] enable auto convert
This PR enable auto_convert in JavaGateway, then we could register a converter for a given types, for example, date and datetime.
There are two bugs related to auto_convert, see [1] and [2], we workaround it in this PR.
[1] py4j/py4j#160
[2] py4j/py4j#161
cc @rxin @JoshRosen