Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Imporve][Connector-V2] Imporve iceberg source connector #2865

Closed
wants to merge 1 commit into from

Conversation

hailin0
Copy link
Member

@hailin0 hailin0 commented Sep 23, 2022

Purpose of this pull request

  • Lock output row and update offset in pollNext method to support at-least-once
  • Add synchronized to IcebergStreamSplitEnumerator#run method to avoid update conflicts with readers access handleSplitRequest
  • Fix duplicate read splits when batch job state restore

Check list

@hailin0 hailin0 force-pushed the fix-iceberg-source branch 2 times, most recently from 7c46b58 to e0ffe4e Compare September 24, 2022 08:00
@hailin0 hailin0 changed the title [Imporve][Connector-V2] Support AT_LEAST_ONCE for iceberg source connector [Imporve][Connector-V2] Imporve iceberg source connector Sep 24, 2022
@hailin0
Copy link
Member Author

hailin0 commented Sep 24, 2022

@ashulin PTAL

@hailin0 hailin0 force-pushed the fix-iceberg-source branch 2 times, most recently from 83ff631 to 190f12e Compare September 27, 2022 13:49
@hailin0 hailin0 force-pushed the fix-iceberg-source branch from fbdee90 to 66d61ea Compare November 4, 2022 06:35
@TyrantLucifer TyrantLucifer added this to the 2.3.0 milestone Dec 7, 2022
@hailin0 hailin0 force-pushed the fix-iceberg-source branch from 0e1c5bd to ec38fd0 Compare December 8, 2022 04:30
@hailin0 hailin0 force-pushed the fix-iceberg-source branch from ec38fd0 to 95a0bfa Compare December 8, 2022 05:45
@hailin0 hailin0 force-pushed the fix-iceberg-source branch from 95a0bfa to ce4c5dd Compare December 8, 2022 06:18
@EricJoy2048 EricJoy2048 removed this from the 2.3.0 milestone Dec 8, 2022
@hailin0 hailin0 force-pushed the fix-iceberg-source branch 2 times, most recently from 901cb4d to 3b219cf Compare December 8, 2022 08:49
@hailin0 hailin0 force-pushed the fix-iceberg-source branch 2 times, most recently from 47a5814 to cce8cbb Compare January 10, 2023 15:53
@@ -124,6 +126,7 @@ public void testIcebergSource(TestContainer container) throws IOException, Inter
}

private void initializeIcebergTable() {
FileUtil.fullyDelete(new File(CATALOG_DIR));
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would this be a problem if it wasn't completely deleted?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Test cases can interfere with each other

CalvinKirs
CalvinKirs previously approved these changes Jan 17, 2023
Copy link
Member

@CalvinKirs CalvinKirs left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Generally LGTM

@EricJoy2048 EricJoy2048 self-requested a review April 1, 2023 07:28
Copy link
Member

@EricJoy2048 EricJoy2048 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please resolve the conflict

* Lock output row and update offset in pollNext method to support at-least-once
* Add `synchronized` to IcebergStreamSplitEnumerator#run method to avoid update conflicts with readers access `handleSplitRequest`
* Fix duplicate read splits when batch job state restore
* Fix assign split owner(negative number)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants