-
Notifications
You must be signed in to change notification settings - Fork 73
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] BWC Tests get failed with different shard number for text chunking processor #690
Comments
Changing the index setting may not affect other bwc tests. I will close the PR: #685 |
Reopned PR for debugging: #685 |
I will take a look into this feature in the next few days. Feel free to assign this issue to me. |
Latest observation: BWC tests get passed for 2.14.0-SNAPSHOT but still fail in 2.13.0. |
BWC tests are failing in version 2.13.0 because the neural search code are fetched from url: https://ci.opensearch.org/ci/dbc/distribution-build-opensearch. Here are a few options to resolve this issue:
|
I personally prefer Option 1 and 3. |
@martin-gaievski and @navneet1v what are your thoughts on this? Shall we go for 2.13.1? |
I think we need a proper fix for this issue, BWC should be executed otherwise we're flying blind. Sounds like options 1 and 3 are essentially ignoring the failure. |
https://artifacts.opensearch.org/releases/bundle/opensearch is for unreleased version. These snapshots will be updated according to our latest code. https://ci.opensearch.org/ci/dbc/distribution-build-opensearch is for released version. They are fixed once the certain version get released. |
Actually the failure has been fixed since 2.14. Only open source 2.13 has this failure when user misconfigure their index with improper shard number. |
Hi @martin-gaievski and @vibrantvarun , what's your suggestion on the "proper fix" for this issue? |
Releasing 2.13.1 is the only option for bwc tests to pick up the artifact from ci url . However, you can once try running the tests with 2.13.0-SNAPSHOT. If it passes with 2.13.0-SNAPSHOT I have no issues in closing this. cc: @martin-gaievski |
+1 to @vibrantvarun comment, having successful test run for 2.13.0 should be enough |
I can raise a PR changing the snapshot to 2.13.0-SNAPSHOT, but I am not sure whether it makes sense to change the workflow due to a single test, even when the test is actually passing with the current setting. |
I am not sure whether you mean we need to update our CI workflow with this PR: #752 |
From my observation, CI still gets failed in https://github.com/opensearch-project/neural-search/actions/runs/9157136803/job/25172907185?pr=684 with updated CI workflow. We can deep dive this error during the meeting. |
I've restarted that run for 2.13.0-SNAPSHOT and this time it's successful https://github.com/opensearch-project/neural-search/actions/runs/9157136803/job/25247195168?pr=684. Same versions for different platform is also green - https://github.com/opensearch-project/neural-search/actions/runs/9157136803/job/25247196478?pr=684. It's looks ok at the first glance, I understand we may have flaky tests. @vibrantvarun do you have context, is this a flaky test issue or it was failing constantly? |
Here is the failing link: https://github.com/opensearch-project/neural-search/actions/runs/9157136803/job/25172907185?pr=684. It is the first time this issue gets falky. I am wondering whether this issue is related to BWC workflow. Does @vibrantvarun know more details? |
I've made few test runs for #684 , got mixed results https://github.com/opensearch-project/neural-search/actions/runs/9157136803/job/25255088325?pr=684. I think now it's more like a flaky test, as per my understanding it's different from what we had initially when test always failed.
|
Thanks @martin-gaievski for providing more results? Can we check the snapshots in https://artifacts.opensearch.org/releases/bundle/opensearch? |
Our conclusion: |
Latest update: 2.13.0-SNAPSHOT does not include the PR fixing the bug. |
Option 5 is not valid. |
Please try following steps:
|
Hi @martin-gaievski and @vibrantvarun ! I have double confirmed that 2.13 branch can fix the BWC test PR. First, here is the error log when directly running BWC tests. bwc_test_error_logs.txt Then, I follow your steps. I build a 2.13.0 jar file for neural search plugin and then replace it. Here is the successful log:
|
I have also tested with ./gradlew publishToMavenLocal and then run the bwc tests. You can confirm that the BWC test would get passed after publishing to .m2 repository. |
Since 2.13.0-SNAPSHOT bwc tests is good, can we close this issue? |
Closing this issue as it has been resolved. |
What is the bug?
The BWC tests get failed after changing the shard number in index. The error is due to "index not found" even if we assert the response after index creation is true.
How can one reproduce the bug?
Follow the code change in these two issues to change the index setting: #684 #685. And then run the BWC tests.
What is the expected behavior?
The BWC tests should keep consistent and regardless of the index setting.
What is your host/environment?
Linux environment on Github.
Do you have any screenshots?
Error log in these two issues: #684 #685
Do you have any additional context?
No
The text was updated successfully, but these errors were encountered: