Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tests: mitigate MaintenanceTest failures #5409

Merged
merged 1 commit into from
Jul 8, 2022

Conversation

jcsp
Copy link
Contributor

@jcsp jcsp commented Jul 8, 2022

Cover letter

The real fix will be to make the leader balancer aware
of maintenance mode, but the test has become much more
unstable since recent leader balancer changes to do
more movements concurrently, so its worth mitigating
that.

The workaround is to set a short mute timeout so that
muting nodes has no real effect, and a short idle timeout
so that post-maintenance leader movements happen promptly.

Related: #4772

Release notes

  • none

The real fix will be to make the leader balancer aware
of maintenance mode, but the test has become much more
unstable since recent leader balancer changes to do
more movements concurrently, so its worth mitigating
that.

The workaround is to set a short mute timeout so that
muting nodes has no real effect, and a short idle timeout
so that post-maintenance leader movements happen promptly.

Related: redpanda-data#4772
@jcsp jcsp marked this pull request as ready for review July 8, 2022 17:44
@jcsp
Copy link
Contributor Author

jcsp commented Jul 8, 2022

Failures are AWSRoleFetchTests, this presumably raced with landing the fix for that.

@jcsp jcsp merged commit 18fa203 into redpanda-data:dev Jul 8, 2022
@jcsp jcsp deleted the issue-4772-mitigation branch July 8, 2022 20:22
@mmedenjak mmedenjak added kind/bug Something isn't working area/tests ci-failure labels Jul 11, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants