-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Spark] Fix metadata cleanup by retaining a checkpoint before the cutoff window. Alternative fix #4146
base: master
Are you sure you want to change the base?
Conversation
Signed-off-by: Felipe Pessoto <[email protected]>
0394394
to
b02045a
Compare
Signed-off-by: Felipe Pessoto <[email protected]>
Signed-off-by: Felipe Pessoto <[email protected]>
@andreaschat-db, could you help with the I couldn't find any info in Delta protocol about checkpointProtection table feature
|
Hi @felipepessoto, please check the doc in TableFeature. AFAIU, your PR is changing which checkpoints are removed and this changes the expectations of the test. |
Which Delta project/connector is this regarding?
Description
Delete eligible delta log files only if there's a checkpoint newer than them before the cutoff window.
Resolves #606
Unit tests based on #2673
How was this patch tested?
Unit Tests
Does this PR introduce any user-facing changes?
Yes, tables with low rate of commit/checkpoints will have increased log retention beyond the cutoff window