Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

improve tiered storage system test #10471

Merged
merged 1 commit into from
Aug 22, 2024
Merged

Conversation

showuon
Copy link
Member

@showuon showuon commented Aug 20, 2024

Type of change

Select the type of your PR

  • Enhancement=
  • Documentation

Description

  1. In testTieredStorageWithAivenPlugin, we'll wait for the log uploaded or deleted in remote storage, this can be speeded up by reducing remote.log.manager.task.interval.ms config value (default is 30 sec).
  2. In testTieredStorageWithAivenPlugin, we'll wait for the local log got deleted, this can be speeded up by reducing log.retention.check.interval.ms config value (default is 5 min).
  3. Update the dev guide because we don't have testCustomAndUpdatedValues test in KafkaST now.

Checklist

Please go through this checklist and make sure all applicable tasks have been done

  • Write tests
  • Make sure all tests pass
  • Update documentation
  • Check RBAC rights for Kubernetes / OpenShift roles
  • Try your changes from Pod inside your Kubernetes and OpenShift cluster, not just locally
  • Reference relevant issue(s) and close them after merging
  • Update CHANGELOG.md
  • Supply screenshots for visual changes, such as Grafana dashboards

@showuon showuon force-pushed the improveTieredTest branch from 2e88150 to f1f3ddf Compare August 20, 2024 08:22
@scholzj scholzj added this to the 0.44.0 milestone Aug 20, 2024
@scholzj scholzj requested a review from a team August 20, 2024 08:43
Copy link
Member

@im-konge im-konge left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks

Copy link
Member

@see-quick see-quick left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, just out of curiosity how quicker will test go now? (compared to previous runs?) I assumed you have tested it.

@showuon
Copy link
Member Author

showuon commented Aug 20, 2024

LGTM, just out of curiosity how quicker will test go now? (compared to previous runs?) I assumed you have tested it.

@see-quick , I've removed the test terminals. But the most of time in the test is:

  1. waiting for data uploaded to S3
  2. waiting for local log deletion
  3. waiting for S3 data is empty

For (1), the related config is: remote.log.manager.task.interval.ms, whose default is 30 sec. So we might need additional 30 sec if we missed the previous check.
For (2), the topic config (file.delete.delay.ms=1000) has fixed it.
For (3), the related config is log.retention.check.interval.ms, whose default is 5 min. This has the most influence because if we missed the previous check, the test will need to wait for another 5 min.

So, basically, in the worst case, we can save (1) 30 sec + (3)300 sec - (new 1) 5 sec - (new 3) 5 sec = 320 sec, which is more than 5 mins.

@see-quick
Copy link
Member

LGTM, just out of curiosity how quicker will test go now? (compared to previous runs?) I assumed you have tested it.

@see-quick , I've removed the test terminals. But the most of time in the test is:

  1. waiting for data uploaded to S3
  2. waiting for local log deletion
  3. waiting for S3 data is empty

For (1), the related config is: remote.log.manager.task.interval.ms, whose default is 30 sec. So we might need additional 30 sec if we missed the previous check. For (2), the topic config (file.delete.delay.ms=1000) has fixed it. For (3), the related config is log.retention.check.interval.ms, whose default is 5 min. This has the most influence because if we missed the previous check, the test will need to wait for another 5 min.

So, basically, in the worst case, we can save (1) 30 sec + (3)300 sec - (new 1) 5 sec - (new 3) 5 sec = 320 sec, which is more than 5 mins.

Thanks, and good job with such optimization! 👍

@see-quick
Copy link
Member

/azp run regression

Copy link

Azure Pipelines successfully started running 1 pipeline(s).

@scholzj
Copy link
Member

scholzj commented Aug 22, 2024

Thanks for the PR.

@scholzj scholzj merged commit 13be378 into strimzi:main Aug 22, 2024
21 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants