Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] MinimumMasterNodesIT.testThreeNodesNoMasterBlock (random test failure) #1563

Closed
CEHENKLE opened this issue Nov 16, 2021 · 4 comments
Closed
Assignees
Labels
bug Something isn't working Cluster Manager flaky-test Random test failure that succeeds on second run

Comments

@CEHENKLE
Copy link
Member

Describe the bug
Random test failure :( Please dig in and figure out what went wrong :(

https://fork-jenkins.searchservices.aws.dev/job/OpenSearch_CI/job/PR_Checks/job/Gradle_Check/1079/artifact/gradle_check_1079.log/view/

@CEHENKLE CEHENKLE added bug Something isn't working untriaged labels Nov 16, 2021
@CEHENKLE CEHENKLE changed the title [BUG] [BUG] MinimumMasterNodesIT.testThreeNodesNoMasterBlock (random test failure) Nov 16, 2021
@CEHENKLE CEHENKLE added flaky-test Random test failure that succeeds on second run and removed untriaged labels Nov 16, 2021
@andrross
Copy link
Member

Another failure (different test within same class) here: https://fork-jenkins.searchservices.aws.dev/job/OpenSearch_CI/job/PR_Checks/job/Gradle_Check/1429/artifact/gradle_check_1429.log/*view*/

./gradlew ':server:internalClusterTest' --tests "org.opensearch.cluster.MinimumMasterNodesIT.testTwoNodesNoMasterBlock" -Dtests.seed=8C423154E3EBD680 -Dtests.security.manager=true -Dtests.jvm.argline="-XX:TieredStopAtLevel=1 -XX:ReservedCodeCacheSize=64m" -Dtests.locale=el-GR -Dtests.timezone=Asia/Aden -Druntime.java=17

@tlfeng
Copy link
Collaborator

tlfeng commented Dec 7, 2022

The test can not be reproduced, and looks like the last failure occurred in CI was on 12/10/2021, it has been almost a year ago.
I ran the command to run the test for 1k times, and there were only 2 failure occurred due to 20-minute general test timeout (the timeout is mentioned in #1477 (comment)).
./gradlew ':server:internalClusterTest' --tests "org.opensearch.cluster.MinimumClusterManagerNodesIT.testTwoNodesNoClusterManagerBlock" -Dtests.iters=1000

REPRODUCE WITH: ./gradlew ':server:internalClusterTest' --tests "org.opensearch.cluster.MinimumClusterManagerNodesIT.testCannotCommitStateThreeNodes {seed=[894AC60C805AC961:A3A096FAFCD862A]}" -Dtests.seed=894AC60C805AC961 -Dtests.security.manager=true -Dtests.jvm.argline="-XX:TieredStopAtLevel=1 -XX:ReservedCodeCacheSize=64m" -Dtests.locale=ar-EG -Dtests.timezone=America/Yellowknife -Druntime.java=17
  2> java.lang.Exception: Test abandoned because suite timeout was reached.
        at __randomizedtesting.SeedInfo.seed([894AC60C805AC961]:0)
...
BUILD FAILED in 40m 27s
43 actionable tasks: 1 executed, 42 up-to-date

@r1walz
Copy link
Contributor

r1walz commented Nov 19, 2023

Hi, @andrross, @tlfeng. It looks same as #10006. Since, test run history isn't available on this issue. Shall we close it and track progress on #10006 instead?

@andrross
Copy link
Member

Closing in favor of #10006

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working Cluster Manager flaky-test Random test failure that succeeds on second run
Projects
None yet
Development

No branches or pull requests

7 participants