ORC-1060: Reduce memory usage when vectorized reading dictionary string encoding columns #971

expxiaoli · 2021-12-14T08:02:08Z

What changes were proposed in this pull request?

In old code, when dictionary string encoding columns are read by vectorized reading, 2 copy of current stripe's dictionary data and 1 copy of next stripe's dictionary data are hold in memory when reading across different stripes. That could make vectorized reading's memory usage is larger than row reading. This patch fixes this issue, and only hold 1 copy of current stripe's dictionary data.

This patch logic has 3 parts:

Directly read data to primitive byte array, rather than using DynamicByteArray as intermediate variable. Using DynamicByteArray as intermediate variable causes 2 copy of current stripe's dictionary data are hold in memory.
Lazy read dictionary data until read current batch data. In previous code, RecordReaderImpl class's nextBatch method reads dictionary data of next stripe through advanceToNextRow method, then memory will hold two stripe's dictionary data. Through lazy read logic, only one stripe's dictionary data is hold in memory when reading across different stripes.
Before lazy read dictionary data from current stripe, remove batch data's reference to dictionary data from previous stripe. This could allow GC to clean previous stripe's dictionary data memory.

Why are the changes needed?

Reduce memory usage.

How was this patch tested?

Pass the existing CIs.

…ng encoding columns In old code, when dictionary string encoding columns are read by vectorized reading, 2 copy of current stripe's dictionary data and 1 copy of next stripe's dictionary data are hold in memory when reading across different stripes. That could make vectorized reading's memory usage is larger than row reading. This patch fixes this issue, and only hold 1 copy of current stripe's dictionary data. This patch logic has 3 parts: 1) Directly read data to primitive byte array, rather than using DynamicByteArray as intermediate variable. Using DynamicByteArray as intermediate variable causes 2 copy of current stripe's dictionary data are hold in memory. 2) Lazy read dictionary data until read current batch data. In previous code, RecordReaderImpl class's nextBatch method reads dictionary data of next stripe through advanceToNextRow method, then memory will hold two stripe's dictionary data. Through lazy read logic, only one stripe's dictionary data is hold in memory when reading across different stripes. 3) Before lazy read dictionary data from current stripe, remove batch data's reference to dictionary data from previous stripe. This could allow GC to clean previous stripe's dictionary data memory.

expxiaoli · 2021-12-14T08:13:43Z

see background info here: https://issues.apache.org/jira/browse/ORC-1060

dongjoon-hyun · 2021-12-14T20:22:07Z

Thank you for making a PR, @expxiaoli .

dongjoon-hyun · 2021-12-15T07:00:10Z

cc @pgaref

expxiaoli · 2021-12-20T07:29:22Z

Here is perf test.
I create a orc table named src_table with map<string, string> column named mapping, which stripe's string dictionary could occupy 466M memory.
Then I run a spark query to read this column:
insert overwrite table res_table select mapping['tag_a'] from src_table;

With old orc lib, only if I set executor-memory to equal or larger than 2500M, the query could run successfully. Otherwise the query will fail with OOM exception. Here is perf result with MAT tool when I run query with 2500M executor-memory

With orc lib with this new patch, the query could run successfully when executor-memory is decreased to 1200M.

expxiaoli · 2021-12-20T08:10:15Z

@pgaref @wgtmac Could you review and verify it?
For reading string dictionary encoding column, this PR could reduce batch reading's memory usage to nearly row reading's memory usage, which could solve OOM issue in migration work from row reading to batch reading.

dongjoon-hyun

Is there any potential regression in terms of the speed?

expxiaoli · 2021-12-23T10:33:06Z

@dongjoon-hyun In my perf test, speed is no regression for this patch. Here is scan time result for spark run query "insert overwrite table res_table select mapping['tag_a'] from src_table"

"scan time" is spark metric in FileSourceScanExec class's doExecuteColumnar method , which only metric time for scan operator

executor-memory | new ORC with this patch | old ORC
2500M | 37.2s | 34.6s
2250M | 36.5s | OOM
1100M | 30.2s | OOM
1000M | OOM | OOM

Besides, this patch removes memory allocation for DynamicByteArray as well as memory copy from DynamicByteArray to primitive byte array, and do not add other time consuming logic. I think there is no potential regression for speed.

guiyanakuang · 2021-12-23T11:02:42Z

Before
InStream ----> DynamicByteArray(data[][]) -----> byte[]
After
InStream ----> byte[]

DynamicByteArray plays no other role in the context and seems completely redundant, I approve of this pr

dongjoon-hyun

+1, LGTM. Merged to master. Thank you, @expxiaoli and @expxiaoli .
Merry Christmas!

dongjoon-hyun · 2021-12-25T04:04:15Z

@expxiaoli . I added you to the Apache ORC contributor group and assigned ORC-1060 to you.
Welcome to the Apache ORC community.

…ng encoding columns (#971) ### What changes were proposed in this pull request? In old code, when dictionary string encoding columns are read by vectorized reading, 2 copy of current stripe's dictionary data and 1 copy of next stripe's dictionary data are hold in memory when reading across different stripes. That could make vectorized reading's memory usage is larger than row reading. This patch fixes this issue, and only hold 1 copy of current stripe's dictionary data. This patch logic has 3 parts: 1) Directly read data to primitive byte array, rather than using DynamicByteArray as intermediate variable. Using DynamicByteArray as intermediate variable causes 2 copy of current stripe's dictionary data are hold in memory. 2) Lazy read dictionary data until read current batch data. In previous code, RecordReaderImpl class's nextBatch method reads dictionary data of next stripe through advanceToNextRow method, then memory will hold two stripe's dictionary data. Through lazy read logic, only one stripe's dictionary data is hold in memory when reading across different stripes. 3) Before lazy read dictionary data from current stripe, remove batch data's reference to dictionary data from previous stripe. This could allow GC to clean previous stripe's dictionary data memory. ### Why are the changes needed? Reduce memory usage. ### How was this patch tested? Pass the existing CIs. (cherry picked from commit 3a2cb60) Signed-off-by: Dongjoon Hyun <[email protected]>

dongjoon-hyun · 2021-12-29T20:29:06Z

I cherry-picked this to branch-1.7 for next Apache ORC 1.7.3 release.
We will test the performance impact with the downstream projects before releasing.

expxiaoli · 2021-12-31T03:23:23Z

Thanks @dongjoon-hyun

github-actions bot added the JAVA label Dec 14, 2021

expxiaoli changed the title ~~ORC-1060: reduce memory usage when vectorized reading dictionary stri…~~ ORC-1060: reduce memory usage when vectorized reading dictionary string encoding columns Dec 14, 2021

Adjust indentation to avoid committing unnecessary lines

2d7ca22

dongjoon-hyun reviewed Dec 22, 2021

View reviewed changes

dongjoon-hyun added this to the 1.8.0 milestone Dec 25, 2021

dongjoon-hyun changed the title ~~ORC-1060: reduce memory usage when vectorized reading dictionary string encoding columns~~ ORC-1060: Reduce memory usage when vectorized reading dictionary string encoding columns Dec 25, 2021

dongjoon-hyun approved these changes Dec 25, 2021

View reviewed changes

dongjoon-hyun merged commit 3a2cb60 into apache:main Dec 25, 2021

dongjoon-hyun modified the milestones: 1.8.0, 1.7.3 Dec 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ORC-1060: Reduce memory usage when vectorized reading dictionary string encoding columns #971

ORC-1060: Reduce memory usage when vectorized reading dictionary string encoding columns #971

expxiaoli commented Dec 14, 2021 •

edited by dongjoon-hyun

Loading

expxiaoli commented Dec 14, 2021

dongjoon-hyun commented Dec 14, 2021

dongjoon-hyun commented Dec 15, 2021

expxiaoli commented Dec 20, 2021

expxiaoli commented Dec 20, 2021

dongjoon-hyun left a comment

expxiaoli commented Dec 23, 2021 •

edited

Loading

guiyanakuang commented Dec 23, 2021

dongjoon-hyun left a comment

dongjoon-hyun commented Dec 25, 2021

dongjoon-hyun commented Dec 29, 2021 •

edited

Loading

expxiaoli commented Dec 31, 2021

ORC-1060: Reduce memory usage when vectorized reading dictionary string encoding columns #971

ORC-1060: Reduce memory usage when vectorized reading dictionary string encoding columns #971

Conversation

expxiaoli commented Dec 14, 2021 • edited by dongjoon-hyun Loading

What changes were proposed in this pull request?

Why are the changes needed?

How was this patch tested?

expxiaoli commented Dec 14, 2021

dongjoon-hyun commented Dec 14, 2021

dongjoon-hyun commented Dec 15, 2021

expxiaoli commented Dec 20, 2021

expxiaoli commented Dec 20, 2021

dongjoon-hyun left a comment

Choose a reason for hiding this comment

expxiaoli commented Dec 23, 2021 • edited Loading

guiyanakuang commented Dec 23, 2021

dongjoon-hyun left a comment

Choose a reason for hiding this comment

dongjoon-hyun commented Dec 25, 2021

dongjoon-hyun commented Dec 29, 2021 • edited Loading

expxiaoli commented Dec 31, 2021

expxiaoli commented Dec 14, 2021 •

edited by dongjoon-hyun

Loading

expxiaoli commented Dec 23, 2021 •

edited

Loading

dongjoon-hyun commented Dec 29, 2021 •

edited

Loading