Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

rpk container smoke test sporadically fails #1967

Closed
ivotron opened this issue Aug 4, 2021 · 4 comments
Closed

rpk container smoke test sporadically fails #1967

ivotron opened this issue Aug 4, 2021 · 4 comments

Comments

@ivotron
Copy link
Member

ivotron commented Aug 4, 2021

Initializing new client
--
  | client/metadata fetching metadata for all topics from broker 127.0.0.1:42583
  |  
  | Connected to broker at 127.0.0.1:42583 (unregistered)
  |  
  | client/brokers registered new broker #2 at 127.0.0.1:42583
  | client/brokers registered new broker #0 at 127.0.0.1:41913
  | Successfully initialized new client
  | Cluster didn't stabilize: Expected 3 nodes, got 2.
  | Retrying (1 retries left)
  | Error: Expected 3 nodes, got 2.
  | + cleanup
  | + echo 'Cleaning up'
  | + ./rpk container purge -v
  | Cleaning up
  | Stopping node 0
  | Stopping node 1
  | Stopping node 2
  | Removed node '1'
  | Removed container 'rp-node-1'
  | Removed node '0'
  | Removed container 'rp-node-0'
  | Removed node '2'
  | Removed container 'rp-node-2'
  | Deleted cluster data.

ref: https://buildkite.com/vectorized/redpanda/builds/1156#624c9c4e-690f-4ce6-a38f-5e579badf261

@ivotron
Copy link
Member Author

ivotron commented Aug 20, 2021

a related error: due to the rpk container smoke test running right after ducktape tests are executed, the compose cluster is left up and running (see this for an example), which in some cases causes the containers started by rpk to fail. A fix for this has been pushed to vtools (link), which stops the compose cluster before the smoke tests begin. It might also be the fix for the OP issue.

@ivotron
Copy link
Member Author

ivotron commented Aug 20, 2021

oh, and another commit enabled logging on failures (running docker logs for each rp-node container), which will show what is going on at the docker engine-level, in case this issue persists

@twmb twmb self-assigned this Oct 5, 2021
@twmb
Copy link
Contributor

twmb commented May 4, 2022

This is possibly a duplicate of #2418. There's been no new post on this specific issue since Oct; closing for now, please reopen if this is ran into again.

@twmb twmb closed this as completed May 4, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants