When transferring ACID tables, allow for non-optimization options #23

dstreev · 2023-01-18T20:15:50Z

When the source table is well compressed and the files are large, a map only job is ok to help with the transfer. In this case, no files are combined, so the mappers will only write to the partition from which they got the source file.

Also consider setting: hive.exec.orc.split.strategy=BI to help with file organization.

dstreev · 2023-01-18T20:18:06Z

When the source table is organized AND the table has large files, the DISTRIBUTE BY and DYNAMIC SORTING options, which introduce a REDUCE phase will get choked up on the writes. A partition will only be written to by a single file. When we really want multiple writers.

dstreev · 2023-03-14T14:32:37Z

Added property to CLI: -so|--skip-optimizations. This will set the property hive.optimize.sort.dynamic.partition=false and NOT add DISTRIBUTE BY to the SQL statements.

[Property Overrides](cloudera-labs/hms-mirror#27) [No-Purge](cloudera-labs/hms-mirror#25) [Skip Optimizations](cloudera-labs/hms-mirror#23)

dstreev added the enhancement New feature or request label Jan 18, 2023

dstreev self-assigned this Jan 18, 2023

dstreev added this to the 1.5.4.3 milestone Mar 14, 2023

dstreev closed this as completed Mar 14, 2023

dstreev added a commit to dstreev/hms-mirror that referenced this issue Mar 14, 2023

No-Purge, Skip-Optimizations, Property Overrides.

8e62734

[Property Overrides](cloudera-labs/hms-mirror#27) [No-Purge](cloudera-labs/hms-mirror#25) [Skip Optimizations](cloudera-labs/hms-mirror#23)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When transferring ACID tables, allow for non-optimization options #23

When transferring ACID tables, allow for non-optimization options #23

dstreev commented Jan 18, 2023

dstreev commented Jan 18, 2023

dstreev commented Mar 14, 2023

When transferring ACID tables, allow for non-optimization options #23

When transferring ACID tables, allow for non-optimization options #23

Comments

dstreev commented Jan 18, 2023

dstreev commented Jan 18, 2023

dstreev commented Mar 14, 2023