Added toggle for Parquet V1 and V2 formats #9497

joshthoward · 2021-10-04T20:38:51Z

This PR creates a toggle to write data in the Parquet V1 format using the native writer. The default version remains the same to ensure that there are no compatibility issues here.

With this change:

Hive cannot read Parquet V1 files created with the native Trino writer due to org.apache.hadoop.io.BytesWritable cannot be cast to org.apache.hadoop.hive.serde2.io.HiveVarcharWritable for all tested Hive versions (there are mixed errors for Parquet V2 files)
Spark can read Parquet V2 files created with the native Trino writer with Spark's vectorized reader turned off
Spark can read Parquet V1 files created with the native Trino writer with Spark's vectorized reader turned on or off

Why does this seem to work? (TBH I wouldn't have expected it to)

ValuesWriter is injected into PrimitiveColumnWriter here.
valuesWriterFactory creates a new ValuesWriter here.
Trino's native parquet writer still uses DefaultValuesWriterFactory here
DefaultValuesWriterFactory respects the Parquet V1 or V2 flag for constructing ValuesWriters here.

This PR is still a WIP, but I wanted to post this for discussion.

findepi · 2021-10-05T11:22:24Z

Hive cannot read Parquet V1 files created with the native Trino writer due to org.apache.hadoop.io.BytesWritable cannot be cast to org.apache.hadoop.hive.serde2.io.HiveVarcharWritable for all tested Hive versions (there are mixed errors for Parquet V2 files)

This sounds like important piece to solve #6377

This PR creates a toggle to write data in the Parquet V1 format using the native writer.

It seems like we didn't make a very deliberate decision to use file format version V2 and we later realized this is causing problems (#7953)

We should switch to v1 for now, unless we can name actual benefits of having a toggle.
Requiring users to switch the toggle so that Trino writes data that can be read by others is not a good deal for me, but maybe i am missing something.

findepi · 2021-10-05T11:46:33Z

cc @losipiuk @anjalinorwood @rdblue @electrum

findepi · 2021-10-05T11:59:08Z

per @anjalinorwood #7953 (comment)

changing the withWriterVersion might be not enough.

anjalinorwood · 2021-10-05T16:11:29Z

+1000

It seems like we didn't make a very deliberate decision to use file format version V2 and we later realized this is causing problems (#7953)

We should switch to v1 for now, unless we can name actual benefits of having a toggle. Requiring users to switch the toggle so that Trino writes data that can be read by others is not a good deal for me, but maybe i am missing something.

martint · 2021-10-05T16:20:47Z

We should switch to v1 for now, unless we can name actual benefits of having a toggle.

Yes, I agree that we should switch to V1 since that's what's supported more broadly. I think we should keep V2 as an experimental flag. It's useful to be able to write data in the new format for others to be able to test compatibility as they develop that support in their engines.

joshthoward · 2021-10-05T20:51:19Z

@anjalinorwood Based on your comments in #7953, did you have an explicit test case where something else was needed to support the V1 spec?

Writing DataPageHeaderV1 in PrimitiveColumnWriter#flushCurrentPageToBuffer (example below) results in Spark thinking that the page is compressed... I think this a bug with Spark missing the compression flag.

parquetMetadataConverter.writeDataPageV1Header((int) uncompressedSize,
                    (int) compressedSize,
                    currentPageRowCount,
                    Encoding.RLE, // Encoding.RLE matches rlEncoding value of RunLengthBitPackingHybridEncoder
                    Encoding.RLE, // Encoding.RLE matches dlEncoding value of RunLengthBitPackingHybridEncoder
                    primitiveValueWriter.getEncoding(),
                    pageHeaderOutputStream);

findepi · 2021-10-05T21:18:22Z

for others to be able to test compatibility as they develop that support in their engines.

it would be awesome if Trino could become the reference implementation for Parquet v2 format, but i would assume parquet-mr is going to be that no matter what we do.
We don't know if current implementation is 100% spec-wise with Parquet, do we?

Toggles are not something to add haphazardly. There is a cost to them in terms of code and cognitive complexity that affects future maintainability and ability to safely evolve the system.

@anjalinorwood Based on your comments in #7953, did you have an explicit test case where something else was needed to support the V1 spec?

Good question. While we look for the answer to that, i think we should also take a look what it would take to produce files that are readable by Hive.

alexjo2144 · 2021-10-08T20:40:18Z

If the other PR gets merged first we'll have a small merge conflict with https://github.com/trinodb/trino/pull/9569/files/fd0c06389c6977de7ecc96a70ec6959086f90446#r725263578

But it should just be another case of switching a "v2" to a "v1"

Cherry-pick of trinodb/trino#9497 and trinodb/trino#9611 co-authored by Josh Howard <[email protected]>

Added toggle for Parquet V1 and V2 formats

9678dbb

cla-bot bot added the cla-signed label Oct 4, 2021

joshthoward added the WIP label Oct 4, 2021

findepi added the tests:hive label Oct 5, 2021

empty

0716e0e

findepi mentioned this pull request Oct 8, 2021

Support writing Parquet encoding stats #9569

Merged

joshthoward mentioned this pull request Oct 12, 2021

Change native parquet writer to write v1 parquet files #9611

Merged

joshthoward closed this Oct 20, 2021

yingsu00 mentioned this pull request Aug 24, 2023

ParquetWriter always write header version 2 even with version PARQUET_1_0 prestodb/presto#17240

Open

yingsu00 mentioned this pull request Sep 21, 2023

Allow Parquet writer to write both V1 and V2 files prestodb/presto#20926

Closed

nmahadevuni mentioned this pull request Sep 25, 2023

Support Parquet writer versions V1 and V2 prestodb/presto#20957

Merged

nmahadevuni added a commit to nmahadevuni/presto that referenced this pull request Oct 3, 2023

Support Parquet writer versions V1 and V2

7cbbc0e

Cherry-pick of trinodb/trino#9497 and trinodb/trino#9611 co-authored by Josh Howard <[email protected]>

yingsu00 pushed a commit to prestodb/presto that referenced this pull request Oct 5, 2023

Support Parquet writer versions V1 and V2

5434a36

Cherry-pick of trinodb/trino#9497 and trinodb/trino#9611 co-authored by Josh Howard <[email protected]>

kaikalur pushed a commit to kaikalur/presto that referenced this pull request Mar 14, 2024

Support Parquet writer versions V1 and V2

5eaaa27

Cherry-pick of trinodb/trino#9497 and trinodb/trino#9611 co-authored by Josh Howard <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added toggle for Parquet V1 and V2 formats #9497

Added toggle for Parquet V1 and V2 formats #9497

joshthoward commented Oct 4, 2021

findepi commented Oct 5, 2021

findepi commented Oct 5, 2021

findepi commented Oct 5, 2021

anjalinorwood commented Oct 5, 2021

martint commented Oct 5, 2021

joshthoward commented Oct 5, 2021

findepi commented Oct 5, 2021

alexjo2144 commented Oct 8, 2021

Added toggle for Parquet V1 and V2 formats #9497

Added toggle for Parquet V1 and V2 formats #9497

Conversation

joshthoward commented Oct 4, 2021

findepi commented Oct 5, 2021

findepi commented Oct 5, 2021

findepi commented Oct 5, 2021

anjalinorwood commented Oct 5, 2021

martint commented Oct 5, 2021

joshthoward commented Oct 5, 2021

findepi commented Oct 5, 2021

alexjo2144 commented Oct 8, 2021