[SPARK-24552][core][sql] Use unique id instead of attempt number for writes [branch-2.3]. #21615

vanzin · 2018-06-22T20:07:17Z

This passes a unique attempt id instead of attempt number to v2
data sources and hadoop APIs, because attempt number is reused
when stages are retried. When attempt numbers are reused, sources
that track data by partition id and attempt number may incorrectly
clean up data because the same attempt number can be both committed
and aborted.

…writes. This passes a unique attempt id instead of attempt number to v2 data sources and hadoop APIs, because attempt number is reused when stages are retried. When attempt numbers are reused, sources that track data by partition id and attempt number may incorrectly clean up data because the same attempt number can be both committed and aborted.

tgravescs · 2018-06-22T20:49:00Z

+1 pending tests. @rdblue

rdblue · 2018-06-22T22:28:32Z

+1

SparkQA · 2018-06-22T23:55:46Z

Test build #92226 has finished for PR 21615 at commit a80b57b.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2018-06-23T03:33:49Z

Test build #92234 has finished for PR 21615 at commit f9b134e.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

… number for writes . This passes a unique attempt id instead of attempt number to v2 data sources and hadoop APIs, because attempt number is reused when stages are retried. When attempt numbers are reused, sources that track data by partition id and attempt number may incorrectly clean up data because the same attempt number can be both committed and aborted. Author: Marcelo Vanzin <[email protected]> Closes #21615 from vanzin/SPARK-24552-2.3.

vanzin · 2018-06-25T23:56:42Z

Merged to 2.3.

… number for writes . This passes a unique attempt id instead of attempt number to v2 data sources and hadoop APIs, because attempt number is reused when stages are retried. When attempt numbers are reused, sources that track data by partition id and attempt number may incorrectly clean up data because the same attempt number can be both committed and aborted. Author: Marcelo Vanzin <[email protected]> Closes apache#21615 from vanzin/SPARK-24552-2.3.

Typo.

f9b134e

vanzin mentioned this pull request Jun 25, 2018

[SPARK-24552][core][SQL] Use task ID instead of attempt number for writes. #21606

Closed

vanzin closed this Jun 25, 2018

vanzin deleted the SPARK-24552-2.3 branch August 24, 2018 19:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-24552][core][sql] Use unique id instead of attempt number for writes [branch-2.3]. #21615

[SPARK-24552][core][sql] Use unique id instead of attempt number for writes [branch-2.3]. #21615

vanzin commented Jun 22, 2018

tgravescs commented Jun 22, 2018

rdblue commented Jun 22, 2018

SparkQA commented Jun 22, 2018

SparkQA commented Jun 23, 2018

vanzin commented Jun 25, 2018

[SPARK-24552][core][sql] Use unique id instead of attempt number for writes [branch-2.3]. #21615

[SPARK-24552][core][sql] Use unique id instead of attempt number for writes [branch-2.3]. #21615

Conversation

vanzin commented Jun 22, 2018

tgravescs commented Jun 22, 2018

rdblue commented Jun 22, 2018

SparkQA commented Jun 22, 2018

SparkQA commented Jun 23, 2018

vanzin commented Jun 25, 2018