[SPARK-25342][CORE][SQL]Support rolling back a result stage and rerunning all result tasks when writing files #37359

caican00 · 2022-08-01T12:12:09Z

What changes were proposed in this pull request?

From this pr:#22112, we learn that currently we can't rollback and rerun a result stage, and just fail.

And this new pr is designed to solve some scenarios of this problem. When the analysis result from the result stage of a job will be output to a storage system, it can be written to a file system or database system.

If the result was written to a file system, it was stored in a temporary directory until the result stage run successfully. If the result stage whose map stage is indeterminate failed but had committed output for some partitions, we can delete these temporary files and roll back the result stage.
If the result was written to a database system, it will be written directly to the database and therefore if the result stage whose map stage is indeterminate failed but some result tasks were successful, the result has been written successfully can not be rolled back
Therefore, the main purpose of this new pr is to support Result Stage rollback in the scenarios of writing to any file system.
I added a new identifier isResultStageRetryAllowed in RDD class to indicate whether its corresponding Result stage supports retries.
It is a Boolean variable and the default value is false，indicating that result stage rollback is not supported and corresponds to the scenario of writing to the database.
And in the case of writing to the file system, the result stage supports retries, and isResultStageRetryAllowed will be changed to true.

Does this PR introduce any user-facing change?

No

How was this patch tested?

new tests and manually test

write to hive

write to iceberg

write to hdfs

write to mysql

…ning all result tasks when writing files

caican00 · 2022-08-01T13:02:17Z

gently ping @cloud-fan
Can you help to review this PR

AmplabJenkins · 2022-08-01T20:23:04Z

Can one of the admins verify this patch?

caican00 · 2022-08-09T10:06:55Z

gently ping @cloud-fan Can you help to review this PR

@cloud-fan Hi, could you help to review this pr? Thanks

github-actions · 2022-11-18T00:24:23Z

We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable.
If you'd like to revive this PR, please reopen it and ask a committer to remove the Stale tag!

caican00 added 2 commits August 1, 2022 16:56

[SPARK-25342][CORE][SQL]Support rolling back a result stage and rerun…

ab850bf

…ning all result tasks when writing files

fix

f452963

github-actions bot added CORE SQL labels Aug 1, 2022

caican00 changed the title ~~Support rollback result stage~~ [SPARK-25342][CORE][SQL]Support rolling back a result stage and rerunning all result tasks when writing files Aug 1, 2022

fix

8d5fe74

github-actions bot added the Stale label Nov 18, 2022

github-actions bot closed this Nov 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-25342][CORE][SQL]Support rolling back a result stage and rerunning all result tasks when writing files #37359

[SPARK-25342][CORE][SQL]Support rolling back a result stage and rerunning all result tasks when writing files #37359

caican00 commented Aug 1, 2022 •

edited

Loading

caican00 commented Aug 1, 2022

AmplabJenkins commented Aug 1, 2022

caican00 commented Aug 9, 2022

github-actions bot commented Nov 18, 2022

[SPARK-25342][CORE][SQL]Support rolling back a result stage and rerunning all result tasks when writing files #37359

[SPARK-25342][CORE][SQL]Support rolling back a result stage and rerunning all result tasks when writing files #37359

Conversation

caican00 commented Aug 1, 2022 • edited Loading

What changes were proposed in this pull request?

Does this PR introduce any user-facing change?

How was this patch tested?

caican00 commented Aug 1, 2022

AmplabJenkins commented Aug 1, 2022

caican00 commented Aug 9, 2022

github-actions bot commented Nov 18, 2022

caican00 commented Aug 1, 2022 •

edited

Loading