-
Notifications
You must be signed in to change notification settings - Fork 28.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-11691][SQL] Support setting hadoop compression codecs in DataFrameWriter#option #11324
Conversation
Test build #51775 has finished for PR 11324 at commit
|
@rxin Could you check this? |
@@ -58,7 +59,8 @@ import org.apache.spark.util.Utils | |||
private[sql] case class InsertIntoHadoopFsRelation( | |||
@transient relation: HadoopFsRelation, | |||
@transient query: LogicalPlan, | |||
mode: SaveMode) | |||
mode: SaveMode, | |||
codec: Option[Class[_ <: CompressionCodec]] = None) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
codec -> compressionCodec to make it more clear
okay, I'm fixing now. |
Test build #51867 has finished for PR 11324 at commit
|
Test build #51872 has finished for PR 11324 at commit
|
Jenkins, retest this please. |
Test build #51875 has finished for PR 11324 at commit
|
ISTM the fail is not related to this...
|
Take a look at the end of ParquetRelation. It would be great if we can consolidate the two and put in a single place where we specify short compression codec names. |
Yea I reverted a patch that broke the test. |
Okay, I'll try. |
14ef39a
to
86c8c0c
Compare
Test build #51934 has finished for PR 11324 at commit
|
Jenkins, retest this please. |
Test build #51935 has finished for PR 11324 at commit
|
Test build #51941 has finished for PR 11324 at commit
|
Jenkins, retest this please. |
Test build #51949 has finished for PR 11324 at commit
|
@rxin okay, ready to review again. |
@rxin @HyukjinKwon No problem ;) Is it okay to apply my diffs based on the @HyukjinKwon commit? |
Yes - please. |
@HyukjinKwon Is it okay to include document descriptions for codec options in this pr? |
Close this pr, and move to a new pr. |
…ent in ParquetRelation ## What changes were proposed in this pull request? This pr to make the short names of compression codecs in `ParquetRelation` consistent against other ones. This pr comes from #11324. ## How was this patch tested? Add more tests in `TextSuite`. Author: Takeshi YAMAMURO <[email protected]> Closes #11408 from maropu/SPARK-13528.
…ent in ParquetRelation ## What changes were proposed in this pull request? This pr to make the short names of compression codecs in `ParquetRelation` consistent against other ones. This pr comes from apache#11324. ## How was this patch tested? Add more tests in `TextSuite`. Author: Takeshi YAMAMURO <[email protected]> Closes apache#11408 from maropu/SPARK-13528.
What changes were proposed in this pull request?
This pr is to support hadoop compression codecs when saving DataFrame to disk.
This is rework from #9657 because it gets stale.
closes #9657