-
Notifications
You must be signed in to change notification settings - Fork 28.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-21165][SQL] FileFormatWriter should handle mismatched attribute ids between logical and physical plan #19483
Conversation
cc @gatorsmile |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Test build #82683 has finished for PR 19483 at commit
|
Test build #82685 has finished for PR 19483 at commit
|
It sounds like we are facing various issues because we are using the analyzed plan. Is that possible we just add an extra Project using the analyzed plan's output at the end of optimizer? |
I'll refactor it later, to use |
The hive bucketing PR does that : #19001 I can isolate that piece and put out a PR |
that will be great, thanks @tejasapatil ! |
Test build #82712 has finished for PR 19483 at commit
|
…ry schema ## What changes were proposed in this pull request? #18386 fixes SPARK-21165 but breaks SPARK-22252. This PR reverts #18386 and picks the patch from #19483 to fix SPARK-21165. ## How was this patch tested? new regression test Author: Wenchen Fan <[email protected]> Closes #19484 from cloud-fan/bug.
thanks for the review, merging to master! |
…ry schema ## What changes were proposed in this pull request? apache#18386 fixes SPARK-21165 but breaks SPARK-22252. This PR reverts apache#18386 and picks the patch from apache#19483 to fix SPARK-21165. ## How was this patch tested? new regression test Author: Wenchen Fan <[email protected]> Closes apache#19484 from cloud-fan/bug.
What changes were proposed in this pull request?
Due to optimizer removing some unnecessary aliases, the logical and physical plan may have different output attribute ids. FileFormatWriter should handle this when creating the physical sort node.
How was this patch tested?
new regression test.